🦉 🎤   RapidMiner Wisdom 2020 - CALL FOR SPEAKERS   🦉 🎤

We are inviting all community members to submit proposals to speak at Wisdom 2020 in Boston.


Whether it's a cool RapidMiner trick or a use case implementation, we want to see what you have.
Form link is below and deadline for submissions is November 15. See you in Boston!

CLICK HERE TO GO TO ENTRY FORM

"Web crawler vs special character"

KamilKamil Member Posts: 1 Contributor I
edited May 23 in Help
Hi,

when try to grab some german websites special charcters, i.e. Ü Ä Ö and so on, are not being interpreted correctly by the operator:

"Erste Eindr�cke vom ..."

Is there some "hidden" preference to solve this problem?

Tagged:

Answers

  • fischerfischer Member Posts: 439  Guru
    Hi,

    unfortunately not. The encoding is specified in the HTTP header and also in the HTML itself. This may be (and often is) incorrectly configured by the site administrator. Currently there is no way to force an encoding in RM.

    Best,
    Simon
Sign In or Register to comment.