Options

Data with special character getting ignored while training

vipul_kumarvipul_kumar Member Posts: 20 Contributor I
edited November 2018 in Help

Hello,

 

I have a training model and i train that using csv data set. 
I came across one particular observation where if the row in csv data set contains special characters like \n,\r ot ^M (which can come easily if files are generated on windows and transfered to linux.) , the training process just ignores all these lines.
My csv data contains these specical characters on almost every line and hence the model is not getting trained with all these data as is ignoring these rows with special characters.

 

What is the way to instruct rapidminer server to allow these characters.

 

Please Note: I dont have control over the special characters coming in the CSV. I can do a lot of regular expression replacement in the csv to remove those special characters  but that is the last option.

Tagged:

Answers

  • Options
    sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM Moderator Posts: 2,959 Community Manager

    hello @vipul_kumar - I'd recommend posting your XML process here (see "Read Before Posting" on right when you reply) and attach your dataset. This way we can replicate what you're doing and help you better.

     

    Scott

     

     

Sign In or Register to comment.