How do I tokenize an example set?

darwin_h_poritz · December 2018

I have an example set of 451 comments (rows). I want to generate mult-word tokens (n-Grams) from this example set. What is the sequence of operators?

Telcontar120 · December 2018

You'll first need to set your attribute to text data type (it is probably nominal by default), and then use the Process Documents from Data operator. Within that you will put the tokenize operator followed by the N-Grams operator. You may also wish to conduct other types of text preprocessing as well.

greg_lorincz79 · March 2021

Howdy, Stranger!

Quick Links

Categories

Altair RapidMiner Community

GET HELP. LEARN BEST PRACTICES. NETWORK WITH YOUR PEERS.

How do I tokenize an example set?

Answers