RapidMiner 9.7 is Now Available

Lots of amazing new improvements including true version control! Learn more about what's new here.

CLICK HERE TO DOWNLOAD

Skipping comments in CSV loop import

tatianiiatatianiia Member Posts: 11 Contributor I
edited November 2019 in Help
Hi! I experience the following problem:

I am using "Loop Files"  and "Read CSV" to import a set of .csv files to RapidMiner.
Each file has roughly the following format:

none1,none2,none3
var1,var2,var3,var4
1,2,3,4
1,2,3,4

I want to skip the first line, so I annotate it as comment, and the second one - as name.
However, although the first line is not present in the output, it seems that the number of columns is equal to its capacity - so, there are 3 columns in the output instead of expected 4. So, this is my problem.

I read that one of the workarounds is naming every required column in dataset metadata information, but here I come across another obstacle: my .csv files have different number of attributes and I don't see any way to create a unified  list of columns in metadata information that will fit them all.

Are there any solutions for that? I have not found any, so I will be grateful for your help.

Answers

  • tatianiiatatianiia Member Posts: 11 Contributor I
    I have removed the first line with ''Remove document parts" operator, so everythings works fine now. But I still we be glad to hear some useful comments, since this solution does not seem to be ideal.
  • Marco_BoeckMarco_Boeck Team Lead Software Engineering Administrator, Moderator, Employee, Member, University Professor Posts: 1,938   RM Engineering
    Hi,

    unfortunately that is a bug. It is on our list and will be fixed in the future.

    Regards,
    Marco
Sign In or Register to comment.