The RapidMiner community is on read-only mode until further notice. Technical support via cases will continue to work as is. For any urgent licensing related requests from Students/Faculty members, please use the Altair academic forum here.
IE-extension (preprocessing operators)
Hello!
I've been trying to use the IE-extension in order to build my own entity "extractor" but this extension and its support are really, really poor.
To do some kind of POS-tagging, the only option I can see is the "TagListPreprocessing" operator. It receives a list of words as input (e.g. pronouns), defines a new attribute (e.g., isPronoun) and checks if each word of your data set matches any word contained in that list, assigning a 1 to the isPronoun attribute.
My question is: what kind of file extension needs that list?! I've tried with ".txt", no extension whatsoever, etc.and it does not work.
Anyway, is there any other alternative to perform POS-tagging?
Thanks in advance!
I've been trying to use the IE-extension in order to build my own entity "extractor" but this extension and its support are really, really poor.
To do some kind of POS-tagging, the only option I can see is the "TagListPreprocessing" operator. It receives a list of words as input (e.g. pronouns), defines a new attribute (e.g., isPronoun) and checks if each word of your data set matches any word contained in that list, assigning a 1 to the isPronoun attribute.
My question is: what kind of file extension needs that list?! I've tried with ".txt", no extension whatsoever, etc.and it does not work.
Anyway, is there any other alternative to perform POS-tagging?
Thanks in advance!
Tagged:
0