Windows and UTF-8

eisioriginaleisioriginal Member Posts: 4 Contributor I
edited November 2018 in Help

i currently try to use Rapidminer to crawl some chinese content. I use the crawl web operator and store the crawled pages to my file system. I also use a content filter within my process.

When i set some chinese words within the content filter those characters are ??? when i reload the process within rapid miner. I also have wrong characters in the resulting crawled pages in my folder because the files are stored in ANSI Format.

I already tried the encoding option of rpid miner with no success. How can i run RapidMiner on windows in a way that its storing utf-8 files and process files?

Thank you

Sign In or Register to comment.