Options

"Read CSV -- Problem with newline in quoted values"

caleb_sotelocaleb_sotelo Member Posts: 1 Contributor I
edited June 2019 in Help
The operator documentation for Read CSV states "this operator can read csv files, where all values of an example are writen into one line". Not surprisingly, my input CSV file is not parsed correctly when there are quoted values containing newline characters. The operator thinks a new data record is beginning..

There do not seem to be any helpful operator parameters. Is there any known solution?
Tagged:

Answers

  • Options
    MariusHelfMariusHelf RapidMiner Certified Expert, Member Posts: 1,869 Unicorn
    Hi,

    in the current implementation of the Read CSV operator, files are read one by one without parsing, i.e. by searching only for the line delimiter. Thus, data that contains linebreaks cannot be parsed correctly. I fear the only solution is to replace the linebreaks in the input data, or to export the data e.g. as XML, if you can control the export of your data.

    Best,
      Marius
Sign In or Register to comment.