RapidMiner

RapidMiner

what is the DictionaryStemmer file format

Contributor

what is the DictionaryStemmer file format

Hi,
I am trying to use the DictionaryStemmer feature of the text plugin, I am confused about the file format to be used the tutorial say "base form,  expr1 expr2"

while the java doc says: "targetExpression pattern1 pattern2"

am baically using string and used the javadoc format  for example,

food pizza burger lunch...
sports baseball golf football...

but it seem to be giving erroneous results. can someone point out the correct format.

Thanks
Angshu
1 REPLY
Elite

Re: what is the DictionaryStemmer file format

Hi Angshu,
unfortunately the description was erroneous. You will have to specify a format like that:
targetExpression : patter1 patter2 ...

Where patterX might be a regular expression like ".*pizza".

Greetings,
  Sebastian
Old World Computing - Establishing the Future

Check out the Jackhammer Extension for RapidMiner! Crunch more data easier and with up to 700% speed up! Available only here