I have funny characters in my example sets. I suspect an encoding problem.
Encoding settings of the database, the settings of a database connection configured in RapidMiner Studio or Server, or the JBoss instance that hosts RapidMiner Server are incorrect. Many file input operators can also specify an encoding.
You should use the
encoding wherever possible. Database settings can be made per
: In MySQL, use “ALTER DATABASE xxx DEFAULT CHARACTER SET utf8”
: Newly created tables will inherit from the default character set and can be otherwise specified in the CREATE statement.
RapidMiner Studio JDBC connection
: Set the appropriate connection properties (see below for a list). In RapidMiner Studio this is possible via Tools > Manage Database Connections > Advanced.
The encodingName you want to use is almost always utf8. What exactly the name of the JDCB property is, depends on the database. Known values are:
MS SQL Server via JTDS driver:
Processes can configure the encoding via parameters of input operators.