The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.
Options

Operator Crawl: Process failed

domodomo Member Posts: 1 Contributor I
Hi,
I have installed the lastest version 10.1.001 and I have a problem with the operator Crwal.
The process fail and here the error message.
I have checked the version of Java and the version is 1.8.0_361

  • Exception: java.lang.NoClassDefFoundError
  • Message: org/apache/tika/parser/html/HtmlParser
  • Stack trace:
  • edu.uci.ics.crawler4j.parser.TikaHtmlParser.(TikaHtmlParser.java:34)
  • edu.uci.ics.crawler4j.parser.Parser.(Parser.java:42)
  • edu.uci.ics.crawler4j.crawler.CrawlController.(CrawlController.java:85)
  • com.rapidminer.operator.web.crawler.CrawlerOperator.doWork(CrawlerOperator.java:269)
  • com.rapidminer.operator.Operator.execute(Operator.java:1024)
  • com.rapidminer.operator.execution.SimpleUnitExecutor.execute(SimpleUnitExecutor.java:77)
  • com.rapidminer.operator.ExecutionUnit$2.run(ExecutionUnit.java:804)
  • com.rapidminer.operator.ExecutionUnit$2.run(ExecutionUnit.java:799)
  • java.base/java.security.AccessController.doPrivileged(Native Method)
  • com.rapidminer.operator.ExecutionUnit.execute(ExecutionUnit.java:799)
  • com.rapidminer.operator.OperatorChain.doWork(OperatorChain.java:423)
  • com.rapidminer.operator.Operator.execute(Operator.java:1024)
  • com.rapidminer.Process.executeRoot(Process.java:1476)
  • com.rapidminer.Process.lambda$executeRootInPool$5(Process.java:1452)
  • com.rapidminer.studio.concurrency.internal.AbstractConcurrencyContext$AdaptedCallable.exec(AbstractConcurrencyContext.java:362)
  • java.base/java.util.concurrent.ForkJoinTask.doExec(Unknown Source)
  • java.base/java.util.concurrent.ForkJoinPool$WorkQueue.topLevelExec(Unknown Source)
  • java.base/java.util.concurrent.ForkJoinPool.scan(Unknown Source)
  • java.base/java.util.concurrent.ForkJoinPool.runWorker(Unknown Source)
  • java.base/java.util.concurrent.ForkJoinWorkerThread.run(Unknown Source)
Could you help me to solve the problem?
Thanks
Tagged:

Answers

  • Options
    ruhailaruhaila Member Posts: 48 Guru
    Perhaps share the .rpm file?
Sign In or Register to comment.