RapidMiner

RapidMiner

[SOLVED] Add custom Stopwords Dictionary

Regular Contributor

[SOLVED] Add custom Stopwords Dictionary

Dear sirs,

I am surpise of the very nice things that your product can do.
I am trying some new research parts in text mining in Greek language and I would like to know if it is possibe to add my custom stopwords dictionary and in what way.

Thank you in advance
Manolis
5 REPLIES
Community Manager

Re: Add custom Stopwords Dictionary

Hi!

You can easily use your dictionary. Just add a "Filter Stopwords (Dictionary)" operator from Text Processing/Filtering (Text mining extension) to your document processing and select the file.

Regards

Balázs
Regular Contributor

Re: Add custom Stopwords Dictionary

Thank you for your immidiate reply

Are there any specifications that I have to follow for the txt document? Like comma delimited or seperators among words?

Does UTF-8 supported?

Thank you in advance

Manolis
Super Contributor

Re: Add custom Stopwords Dictionary

The documentation of that operator contains all required information: one stop word per line, and the encoding (UTF-8 or whatever you like) can be selected with the encoding parameter.

Best regards,
Marius
Regular Contributor

Re: Add custom Stopwords Dictionary

Hi again after a long time,

1. Yes but is there any tutorial for how I can add a custom filter stop words operator?

2. Do I have to develop something? Is there any tutorial for that?

3. Where can I find the documentation of filter operator? Is this helpfull as a tutorial?

Sorry for the silly questions but I can not find the answer

Thank you in advance
Manos
Regular Contributor

Problem Solved

Problem Solved