The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.
Options

Modification request to LinearRegressionModel

mtgreenmtgreen Member Posts: 9 Contributor II
edited June 2019 in Help
I have a simple modification request.  I'd like to have the LinearRegressionModel (maybe other regression models) modified such that it does not require the input exampleSet exactly match the exampleset used to train the model.  Instead, I think that the inputed example set should merely contain all required attributes for the model (in any order).  The great news is that the change can be done without modifying any method signatures. 

This could be done very easily using a hashmap (attribute name paired with coefficient).  All of the required information to do this is provided in the constructor.  I'm happy to provide my tested code if that makes life simpler.  Just a (very) small way of giving back for such a great toolkit.

mtgreen

Answers

  • Options
    mtgreenmtgreen Member Posts: 9 Contributor II
    one additional note...

    My suggested method would require matching attribute names for the inputed example set. I can see why this may also be undesirable.  The code could be written to default to one (e.g., name match) and then switch to the second if all required attribute names are not in the provided exampleset.  You can get the best of both worlds.
  • Options
    landland RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 2,531 Unicorn
    Hi,
    the behaviour you wished to have is in fact the behavior we aim at. But some, especially the early implemented learners, are not adapted yet. All I can say: its on our ToDo.

    Greetings,
      Sebastian
  • Options
    landland RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 2,531 Unicorn
    Hi,
    the linear regression model now uses the attributes names of needed attributes for calculation on apply time.
    Hence it not only works on extended data sets, instead it even copes with example sets with fewer attributes than on training time, if only attribute having a weight of 0 were removed.

    Greetings,
      Sebastian
Sign In or Register to comment.