Nutch运行后的问题
配置好,没有错误后运行
出现
crawl started in: crawl
rootUrlDir = urls
threads = 10
depth = 3
topN = 50
Injector: starting
Injector: crawlDb: crawl/crawldb
Injector: urlDir: urls
Injector: Converting injected urls to crawl db entries.
Exception in thread "main" java.io.IOException: Failed to get the current user's information.
at org.apache.hadoop.mapred.JobClient.getUGI(JobClient.java:717)
at org.apache.hadoop.mapred.JobClient.configureCommandLineOptions(JobClient.java:592)
at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:788)
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1142)
at org.apache.nutch.crawl.Injector.inject(Injector.java:160)
at org.apache.nutch.crawl.Crawl.main(Crawl.java:113)
Caused by: javax.security.auth.login.LoginException: Login failed: Cannot run program "whoami": CreateProcess error=2, ?????????
at org.apache.hadoop.security.UnixUserGroupInformation.login(UnixUserGroupInformation.java:250)
at org.apache.hadoop.security.UnixUserGroupInformation.login(UnixUserGroupInformation.java:275)
at org.apache.hadoop.mapred.JobClient.getUGI(JobClient.java:715)
... 5 more
请问这是怎么回事?
谢谢了啊~
[解决办法]
修改下conf文件夹下的nutch-site.xml文件:
<property> <name>http.agent.name</name> <value>HD nutch agent</value> <description></description> </property> <property> <name>http.agent.description</name> <value>hpjianhua</value> <description></description> </property> <property> <name>http.agent.url</name> <value>http://www.163.com</value> <description></description> </property> <property> <name>http.agent.email</name> <value>hpjianhua@163.com</value> <description></description> </property>