I created a process that takes in a file from disk (I used ProcessDocumentsFromFiles operator). Within (nested) that I created Tranformcases operator and ReplaceTokens. All the connections were given correctly and through breakpoints I saw that they were performing like I wanted.
Here is the question. After running the entire process..why is that I see the old text (without case transformation and replacement) in the example set in the results section while is see the actual processing of texts through breakpoints.
How do i see the output after all the transformation? When I connect the "wor" to res I seem to see the transformations in the final result. So what is the different between example set and wordlist set? when do you connect which to results?
And my main question is how to I see the transformed text? Even while writing to disk after transformation I see the text without any processing written.
Process Documents is supposed to output the unmodified text, so everything is fine here. Usually the operator is used to calculate word frequencies, not to transform texts.
Since all you want to do is some text modification, you should probably work on the output of Process Documents and transform the text attribute with standard operators. Actually you will need the Replace operator. Here you can use regular expressions to perform everything you are doing currently in Process Documents.
Another alternative in your special case would be to take the wor output and add a Wordlist to Data operator to transform the wordlist to an example set that contains the transformed texts.