Data import wizard should automatically estimate the column separator

yzanyzan Member Posts: 66 Unicorn
edited December 2018 in Product Feedback - Resolved

Reasoning: It should make imports easier.

1
1 votes

Fixed and Released · Last Updated

Comments

  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM Moderator Posts: 2,959 Community Manager

    hello @yzan - can you give us an example of this?


    Scott

     

  • yzanyzan Member Posts: 66 Unicorn

    RapidMiner has multiple wizards. I am specifically talking about the wizard provided by "Read CSV" operator after clicking on "Import Configuration Wizard", step 2 of 4, section "Column Separation".

     

    The desired functionality is implemented, for reference, in: https://docs.python.org/3/library/csv.html#csv.Sniffer, which is, I think, used behind https://github.com/okfn/messytables. Also, Calc automatically estimates the column separator in their import wizard.

     

    Of course, csv.Sniffer provides additional related estimates like whether the CSV file contains a header, or not. Currently, this has to be  set manually in the 3rd step of the wizard.

  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM Moderator Posts: 2,959 Community Manager

    ok got it.  Good suggestion.  Thank you.

  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM Moderator Posts: 2,959 Community Manager
Sign In or Register to comment.