RapidMiner

Struggling importing CSV from R Studio to Rapidminer

Learner II matteomartignon
Learner II

Struggling importing CSV from R Studio to Rapidminer

Hello guys,

I have a lot of problems importing my CSV from R Studio into Rapidminer.

The firt 50 rows are completly gone..

 

Here the comand in R Studio I used to save my dataframe:

write.table(musei, file = "musei_18_F.csv", quote = TRUE, sep = ",", row.names = FALSE)

 

//SCREENSHOTS

My CSV from R StudioMy CSV from R StudioThe same CSV imported in RapidminerThe same CSV imported in Rapidminer

6 REPLIES
RM Certified Expert
RM Certified Expert

Re: Struggling importing CSV from R Studio to Rapidminer

You have 31 warnings when importing the CSV file, look at the lower right of the 2nd image. Click on that and see what it says.

Learner II matteomartignon
Learner II

Re: Struggling importing CSV from R Studio to Rapidminer

Got it

Is there a way to fix this problem in R?

Thanks on advanceWarningsWarnings

RM Certified Expert
RM Certified Expert

Re: Struggling importing CSV from R Studio to Rapidminer

In the last screenshot it looks like your separator might be a ";" instead of a "," which isn't what you are telling RapidMiner, which would definitely cause problems.  If you post a sample data file in addition to the screenshots it would be easier to troubleshoot.

 

Brian T., Lindon Ventures - www.lindonventures.com
Analytics Consulting by Certified RapidMiner Analysts
Learner II matteomartignon
Learner II

Re: Struggling importing CSV from R Studio to Rapidminer

Sure, this is my CSV.

It is a data frame I extracted with RFacebook. The function was "search.pages".

Thanks in advance,

matteo

Highlighted
RM Certified Expert
RM Certified Expert

Re: Struggling importing CSV from R Studio to Rapidminer

Looking at the raw csv it is evident that there are many line break characters in some of the records and that is what is causing the parsing problem with the import of certain records in RapidMiner.  It's theoretically possible to clean this up in RapidMiner but it is probably easier to do it in R or Python using some scripting to remove the extra characters before importing.

 

Brian T., Lindon Ventures - www.lindonventures.com
Analytics Consulting by Certified RapidMiner Analysts
RM Staff
RM Staff

Re: Struggling importing CSV from R Studio to Rapidminer

I've also seen your CSV. As far as I can tell, there is a problem writing the CSV. The problem is entirely in the R part, so there is no much help to give from our part.

 

I can suggest to save the CSV and then read it again from R with read.table() or similar. When you manage to correctly retrieve the file, it will be ready to be read by RapidMiner.

 

Or you can put the code that generates the dataset inside the R scripting operator in RapidMiner, then you won't need to save a CSV file.