RAPIDMINER 9.7 BETA ANNOUNCEMENT

The beta program for the RapidMiner 9.7 release is now available. Lots of amazing new improvements including true version control!

CLICK HERE TO DOWNLOAD

"Web crawler vs special character"

KamilKamil Member Posts: 1 Contributor I
edited May 2019 in Help
Hi,

when try to grab some german websites special charcters, i.e. Ü Ä Ö and so on, are not being interpreted correctly by the operator:

"Erste Eindr�cke vom ..."

Is there some "hidden" preference to solve this problem?

Tagged:

Answers

  • fischerfischer Member Posts: 439  Maven
    Hi,

    unfortunately not. The encoding is specified in the HTTP header and also in the HTML itself. This may be (and often is) incorrectly configured by the site administrator. Currently there is no way to force an encoding in RM.

    Best,
    Simon
Sign In or Register to comment.