RAPIDMINER 9.7 BETA ANNOUNCEMENT

The beta program for the RapidMiner 9.7 release is now available. Lots of amazing new improvements including true version control!

CLICK HERE TO DOWNLOAD

I have funny characters in my example sets. I suspect an encoding problem.

User13User13 Member Posts: 155  Guru

Problem:

Encoding settings of the database, the settings of a database connection configured in RapidMiner Studio or Server, or the JBoss instance that hosts RapidMiner Server are incorrect. Many file input operators can also specify an encoding.

Solution:


You should use the
utf8
encoding wherever possible. Database settings can be made per


  • Database
    : In MySQL, use โ€œALTER DATABASE xxx DEFAULT CHARACTER SET utf8โ€


  • Table
    : Newly created tables will inherit from the default character set and can be otherwise specified in the CREATE statement.


  • RapidMiner Studio JDBC connection
    : Set the appropriate connection properties (see below for a list). In RapidMiner Studio this is possible via Tools > Manage Database Connections > Advanced.

The encodingName you want to use is almost always utf8. What exactly the name of the JDCB property is, depends on the database. Known values are:


  • MySQL:
    characterEncoding


  • MS SQL Server via JTDS driver:
    CHARSET


  • Oracle:
    charset

Processes can configure the encoding via parameters of input operators.



Sign In or Register to comment.