RapidMiner

What is the impact of specifying a positive value in FP-Growth Parameter settings

SOLVED
Super Contributor

What is the impact of specifying a positive value in FP-Growth Parameter settings

Hello,

 

As per operator documentations, the positive value parameter of the FP-Growth operator is defined as follows -

 

"positive value

Description: This parameter determines which value of the binominal attributes should be treated as positive. The attributes with that value are considered as part of a transaction. If left blank, the ExampleSet determines which value is used."

I wish to know what the second part means - "If left blank,....". How does the exampleset determine which values are used? My dataset is comprised of semi-structured text-mined data and I'm trying to look for word associations. The output of Binominal to Polynominal operator gives me a word vector with true/false values. Should I still specify the positive value at FP growth? Thank you.

 

1 REPLY
Highlighted
RMStaff

Re: What is the impact of specifying a positive value in FP-Growth Parameter settings

Hi,

 

Yes, I would do it.  The reason is that in specific cases you might actually want to look for associations with "false", i.e. with things which are NOT happening.  So it is always better to specify it.  If you do not specify it, the first value in the internal value mapping of the example set it considered to be the positive (if I remember correctly).  But this is something you typically do not want to depend on since this order might change with different data loading or preparations.

 

Hope this helps,

Ingo


How to load processes in XML from the forum into RapidMiner: Read this!