When sqoop is integrated with the already running hadoop cluster, you might face several issues including the following. I faced these issues when i tried to import the data from mysql to my hadoop instance.
Keep these points in mind
1. Install JDK, JRE alone is not enough.
2. JDBC driver for the corresponding database from where the data to be imported. Copy the driver jar to $SCOOP_HOME\lib
3. The hadoop jar that is bundled with sqoop may not be compatible with your hadoop cluster. Replace the hadoop jat bundled in scoop with the hadoop core from your hadoop installation.
4. Make sure that sqoop is using the right hadoop installation. If not then you may have to tweak $SCOOP_HOME/usr/lib/sqoop/bin/configure_sqoop file.
Exception due to point 3 and 4
ERROR tool.ImportTool: Encountered IOException running import job: org.apache.hadoop.ipc.RPC$VersionMismatch: Protocol org.apache.hadoop.hdfs.protocol.ClientProtocol version mismatch. (client = 63, server = 61)
at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:403)
No comments:
Post a Comment