two data sets

[Deleted User][Deleted User] Posts: 0 Learner III
edited September 2019 in Help
Hi
I have two excel form of data and one of them has Label the other one doesnt have label column so is it possible to import both of them to RM and start use ML algorithms for them?
Thank you

mbs

Best Answers

  • varunm1varunm1 Moderator, Member Posts: 1,207 Unicorn
    Solution Accepted
    Hello @mbs

    Yes, you can. You do not need a label for importing. If both of these datasets are related, you can train the algorithm on a dataset with labels and use the apply model to predict labels for the dataset without labels.

    Hope this helps. Please inform if you have more questions.
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

  • varunm1varunm1 Moderator, Member Posts: 1,207 Unicorn
    Solution Accepted
    I attached a sample process using Titanic labeled and unlabelled datasets for your reference. You can import the .rmp file.

    Thanks
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

Answers

  • [Deleted User][Deleted User] Posts: 0 Learner III
    @varunm1

    Thank you for your quick answer. ;)
    I will try it

    regards 

    mbs
  • [Deleted User][Deleted User] Posts: 0 Learner III
    @varunm1

    Thank you very much  o:)

    best regards
    mbs

    ...................................................................................................................................................
    @varunm1

    از راهنمایی شما خیلی ممنونم

    با احترام
    mbs
  • [Deleted User][Deleted User] Posts: 0 Learner III
    @varunm1

    I import your rmp file but unfortunately I see a problem that shows some columns doesnt match with each other but I check the data and it doesnt have any problem. please guide me to solve it.

    Thank you

    mbs
  • [Deleted User][Deleted User] Posts: 0 Learner III
    edited October 2019
    yes
  • varunm1varunm1 Moderator, Member Posts: 1,207 Unicorn
    Hello @mbs

    Can you set a breakpoint before apply model by right clicking on the apply model operator? Once you set it, run the process and see if the attribute is available, if it is available see what the type (numeric, polynomial etc) of that attribute is. You can also see what type of attribute is coming from the training data set by setting break point after  read excel (training data).
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

  • [Deleted User][Deleted User] Posts: 0 Learner III
    @varunm1

    all my data is in binary form but I think this is not the problem of data and its like a bug and i couldnt handle it. :(

    mbs
  • varunm1varunm1 Moderator, Member Posts: 1,207 Unicorn
    You can send me your data and process in direct message for checking.
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

  • [Deleted User][Deleted User] Posts: 0 Learner III
    sure ;)

    I did it :)
  • [Deleted User][Deleted User] Posts: 0 Learner III
    @varunm1


    Hello

    I changed all the columns header (type dote between words) but still I see so many bugs :(

    And no thing work :/

    please help me to solve it

    regards 

    mbs
  • varunm1varunm1 Moderator, Member Posts: 1,207 Unicorn
    Can you post the images of those bugs?
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

  • [Deleted User][Deleted User] Posts: 0 Learner III
    names dont have any problem :/
  • varunm1varunm1 Moderator, Member Posts: 1,207 Unicorn
    If you have the attribute available in both data sets and they belong to the same data type (polnomial, integer etc), from my understanding its the Names issue (spaces). Lets see if @sgenzer or @mschmitz
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,503 RM Data Scientist
    Hi,
    this UserError just states, that the data set you apply the model on (the lowerone) doesnot contain the attribute WRITE.CONTACTS. Please verify that it exists andis not special.

    BR,
    Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • [Deleted User][Deleted User] Posts: 0 Learner III
    @mschmitz
    Hi

    In both data I have the same attribute " write.contacts".
    But still I see the error. :/

    regards 

    mbs

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,503 RM Data Scientist
    hi,
    keep in mind that captial letters play a role. is this writtena lso all in caps?

    Best,
    Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • varunm1varunm1 Moderator, Member Posts: 1,207 Unicorn
    edited September 2019
    Hello Martin, @mschmitz

    can the spaces before the start of names in excel impact RM attributes if we use read excel operator?
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

  • [Deleted User][Deleted User] Posts: 0 Learner III
    edited September 2019
    @mschmitz


    Yes you see the exact form of writing with captial letters in both of them.

    Thank you 

    mbs
Sign In or Register to comment.