Options

I want to duplicate data in RapidMiner

AdiletkgzAdiletkgz Member Posts: 5 Learner I
Would you mind to help me to find an operator that will duplicate (increase number of needed information) rows with data of specific types of labeled data.

Best Answers

Answers

  • Options
    lionelderkrikorlionelderkrikor Moderator, RapidMiner Certified Analyst, Member Posts: 1,195 Unicorn
    Hi @Adiletkgz,

    I have difficulties to understand. Can you provide an example of what you have and what you want to obtain ?

    Regards,

    Lionel
  • Options
    varunm1varunm1 Moderator, Member Posts: 1,207 Unicorn
    Hi @Adiletkgz

    Are you asking for making copies of same data to attach to different operators? If so, you can use multiply operator. This will give you same data as many copies you need.

    Thanks,
    Varun
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

  • Options
    AdiletkgzAdiletkgz Member Posts: 5 Learner I
    Hello friends!
    I have a data of clients, which are labeled as "Bad clients" and "Good clients". My problem is that I have a lot of "Good clients" and very few "Bad clients". I want to make copies of "Bad clients" in order to increase prediction accuracy. I plan to create credit scoring model by identifying predictors of "Bad clients". 
    Here an example of my data:
    att1   att2   att3   att4
    M      No     Yes   Good client
    M      Yes     Yes   Good client
    M      Yes     No   Good client
    M      Yes     No   Good client
    M      No     No   Good client
    M      No     Yes   Good client
    M      Yes     No   Bad client

  • Options
    AdiletkgzAdiletkgz Member Posts: 5 Learner I
    I want from this data, where I have limited "Bad clients":
    att1   att2   att3   att4
    M      No     Yes   Good client
    M      Yes     Yes   Good client
    M      Yes     No   Good client
    M      Yes     No   Good client
    M      No     No   Good client
    M      No     Yes   Good client
    M      Yes     No   Bad client

    To have this, where "Bad clients" are duplicated:
    att1   att2   att3   att4
    M      No     Yes   Good client
    M      Yes     Yes   Good client
    M      Yes     No   Good client
    M      Yes     No   Good client
    M      No     No   Good client
    M      No     Yes   Good client
    M      Yes     No   Bad client
    M      Yes     No   Bad client
    M      Yes     No   Bad client
    M      Yes     No   Bad client
    M      Yes     No   Bad client

    As a result I will have approximately equal amount of "Bad clients" and "Good clients"
  • Options
    AdiletkgzAdiletkgz Member Posts: 5 Learner I
    @lionelderkrikor @varunm1
    Sorry for inconvenience, I am new here = )
Sign In or Register to comment.