Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

FP-growth on marketing data

bernardo_pagnonbernardo_pagnon Member, University Professor Posts: 64 University Professor
edited March 2020 in Help
Hello,

is it possible to apply FP-growth when the variables are polynomial? I am trying to apply the operator to this dataset, from the book Elements of Statistical Learning: 

https://web.stanford.edu/~hastie/ElemStatLearn/

I am having issues because the variables are polynomial. Can FP-growth handle this data set?

Best,
Bernardo
Tagged:

Best Answer

Answers

  • yyhuangyyhuang Administrator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 364 RM Data Scientist
    Hi @bernardo_pagnon, which data you are using from ESL textbook? Are you able to share it here? Thanks!
  • bernardo_pagnonbernardo_pagnon Member, University Professor Posts: 64 University Professor
    I tried, but I guess it is not supported here.
    You got to enter here

    then click on "Data" (white button on the left), and the "Marketing", and then on the hyperlink "data".

    Regards,
    Bernardo



  • yyhuangyyhuang Administrator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 364 RM Data Scientist
    Thanks @bernardo_pagnon for pointing out the source data. Still I am not sure what you want to extract from the data. Did I get the wrong data (Link direct to the income survey data) with 14 demographic attributes?
    https://web.stanford.edu/~hastie/ElemStatLearn/datasets/marketing.data
    https://web.stanford.edu/~hastie/ElemStatLearn/datasets/marketing.info.txt
  • bernardo_pagnonbernardo_pagnon Member, University Professor Posts: 64 University Professor
    Yes, it is the correct one. My goal was to extract relationships between info available. I guess I identified the problem: different attributes use the number 1 through 5 for instance, to represent the different polynomial values at each attribute. But RM cannot differentiate two numbers "3" in two different attributes: they represent completely different things, but RM identify them as the same.
    I guess a solution would be to work with the data and generate different numbers for each attribute:

    Attribute 1: 1-9
    Attribute 2: 10-11

    and so on.

    Best,
    Bernardo
  • bernardo_pagnonbernardo_pagnon Member, University Professor Posts: 64 University Professor
    Interesting, will try that!
    Thank you so much for your time and for your patience.

    Regards,
    Bernardo
Sign In or Register to comment.