Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Build a regression model FOR EACH example
leviavihay
Member Posts: 5 Learner I
in Help
Hi,
Following my previous question (https://community.rapidminer.com/discussion/55089), I'm posting a different question regarding my next step.
I got a data set in which each row is a series of [value, date] points.
My goal is to build a linear regression model for each row.
Is it possible?...
Thanks,
Avihay
Following my previous question (https://community.rapidminer.com/discussion/55089), I'm posting a different question regarding my next step.
I got a data set in which each row is a series of [value, date] points.
My goal is to build a linear regression model for each row.
Is it possible?...
Thanks,
Avihay
Tagged:
0
Answers
If you want something similar, you can also check out the "leave one out" cross-validation approach. This builds a model on n-1 examples (where n is your total example counts) and then validates that model on each example separately.
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
I will go over the "Loop Examples" operator info, thanks.
Regarding you comment about whether it's even necessary - in this case each row is a different device. For each device I got different reads in different dates. I wish to build a linear regression model (for now) for each one to predict when it will go over a certain threshold (different one for each device)
I would certainly at least check the performance of such a combined model before I went down the road of building and managing many separate models.
Another significant problem with your approach is that it will be very difficult to measure or assess the accuracy of the approach over time, since you will only have one record for which you can validate the model in the future (presumably, although if you have multiple time periods from the same device then you might be able to increase your sample in that way).
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts