Apache Sqoop is in charge of moving large datasets between different storage systems such as relational databases to Hadoop. Sqoop supports a large number of connectors such as JDBC to work with different data sources. Sqoop makes it easy to import existing data into Hadoop.
Sqoop supports the following databases:
- HSQLDB starting version 1.8
- MySQL starting version 5.0
- Oracle starting version 10.2
- Microsoft SQL
Sqoop provides several possibilities to import and export data from and to Hadoop. The service also provides several mechanisms to validate data.