RapidMiner 9.7 is Now Available

Lots of amazing new improvements including true version control! Learn more about what's new here.

CLICK HERE TO DOWNLOAD

Text Processing

RhmanigRhmanig Member Posts: 9 Contributor I
Hi

I am using Process Document to tokenize text (plus transform case, filter stop words and generate n-grams). I wonder why RapidMiner does not make a use of free memory and CPU and the process takes such a log time.

The current data size is 1059MB and the process is running for almost 5 days :/ The system has four cores and 29GB RAM. on average it uses %46 of CPU and right now it uses 75% of memory (the memory usage is going up slowly).

Please explain if you know why.

Thanks

Answers

  • mschmitzmschmitz Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 2,412  RM Data Scientist
    Hi!

    Could you provide me with the process itself? Are there any Loops inside?

    Cheers,

    Martin
    - Head of Data Science Services at RapidMiner -
    Dortmund, Germany
Sign In or Register to comment.