Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Answers
You could try the Generalised Sequential Patterns operator ?
But it seems there is an error with this operator. When I try to view the results of the GSPset, I get: Can anyone reproduce this error?
Here is an example process with data from samples (only to demonstrate the error):
Chees Q-Dog
Feb 9, 2012 5:37:18 PM WARNING: Found only 2.0 sequences. Together with the small minimal support, this could result in very many patterns and a long calculation time.
Feb 9, 2012 5:37:18 PM INFO: Generating Candidates of length 1
Feb 9, 2012 5:37:18 PM INFO: Generated 190 candidates
Feb 9, 2012 5:37:18 PM INFO: Building Hashtree for counting candidates of length 2
Feb 9, 2012 5:37:18 PM INFO: Counting supporting sequences for candidates of length 2
Feb 9, 2012 5:37:18 PM INFO: Filtered Candidates. Remaining: 46
Feb 9, 2012 5:37:18 PM INFO: Generating Candidates of length 2
Feb 9, 2012 5:37:18 PM INFO: Generated 138 candidates
Feb 9, 2012 5:37:18 PM INFO: Building Hashtree for counting candidates of length 3
Feb 9, 2012 5:37:18 PM INFO: Counting supporting sequences for candidates of length 3
Feb 9, 2012 5:37:18 PM INFO: Filtered Candidates. Remaining: 35
Feb 9, 2012 5:37:18 PM INFO: Generating Candidates of length 3
Feb 9, 2012 5:37:18 PM INFO: Generated 14 candidates
Feb 9, 2012 5:37:18 PM INFO: Building Hashtree for counting candidates of length 4
Feb 9, 2012 5:37:18 PM INFO: Counting supporting sequences for candidates of length 4
Feb 9, 2012 5:37:18 PM INFO: Filtered Candidates. Remaining: 14
Feb 9, 2012 5:37:18 PM INFO: Generating Candidates of length 4
Feb 9, 2012 5:37:18 PM INFO: Generated 2 candidates
Feb 9, 2012 5:37:18 PM INFO: Building Hashtree for counting candidates of length 5
Feb 9, 2012 5:37:18 PM INFO: Counting supporting sequences for candidates of length 5
Feb 9, 2012 5:37:18 PM INFO: Filtered Candidates. Remaining: 2
Feb 9, 2012 5:37:18 PM INFO: Generating Candidates of length 5
Feb 9, 2012 5:37:18 PM INFO: Generated 0 candidates
Feb 9, 2012 5:37:18 PM INFO: Saving results.
Feb 9, 2012 5:37:18 PM INFO: Process //Data Files/Forum finished successfully after 0 s
Actually I've just posted elsewhere about association rules and referred here
http://rapid-i.com/rapidforum/index.php/topic,3619.msg13530.html#msg13530
So perhaps this operator suffers from the same problems as the association rules operator, and you could stave off disaster ( the Java choke messages ) by setting the frequency bar a bit higher.
As it happens I do a lot of exactly this," if A & B & C at T1 then is D true at T2 with sufficient frequency etc.". If you are looking for rules with only one item in the head then you can copy the attribute you want to represent D, move all the values up on example, do your itemset mining only and then look in the frequent itemsets for that new attribute. Simples!
Because I do this on ~1,000,000 timeslots I ended up writing an operator to outsource the actual itemset generation via a separate service, which is very fast at this, because it is CUDA based; also I needed to write an operator to do the next period peeking. It has revealed interesting stuff in my domain - but it did take quite an effort.
Good luck with your project.
Greetings,
Nils
I get the error when automatically switching to the results view. The process itself works fine, the problem ony occurs when showing the results. @haddock:
I did try lagging the attributes, unfortunately the offset between my attributes isn't constant. Therefore I wanted to use a time intervall
// Edit
I just uninstalled the newest version and reinstalled an older one (5.0.010) and now it seems to works. I wonder what will happen, if I update to the newest version again...
I should be gone with the next bugfix release...
Greetings,
Nils