Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

Exporting the preprocessed text data into Excel File

lolollolol Member Posts: 5 Learner I

Hi,

I really need your help. I have created a predictive model. The dataset I'm using contains several text attributes that were merged into one text attribute/column during the preprocessing phase. This preprocessing involved steps such as tokenization, stopword removal, token filtering, stemming, and generating n-grams. Could you please guide me on how to unmerged back all those text attributes and  into different columns as they were before? Additionally, I would like to export the data into an Excel file so that I can compare the original unprocessed text with the preprocessed text. Thank you for your assistance.

Answers

  • ruhailaruhaila Member Posts: 48 Guru
    Hi.

    Since there's no .rmp or data files included in your post, I am merely assuming that you could use the SELECT ATTRIBUTES operator to unmerge attributes. Exporting Excel has the WRITE EXCEL operator. hth.
  • BalazsBaranyBalazsBarany Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert Posts: 955 Unicorn
    Hi!

    Usually the easiest way for getting back the initial data is creating an ID (if you don't have one yet) and multiplying the dataset. Then you do your arbitrary preprocessing, keep the ID attribute, and then join back after the processing.

    Regards,
    Balázs
Sign In or Register to comment.