Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Symmetric mean absolute percentage error
Dear All,
I think it is good to include the SMAPE measure into the Rapid Miner regression performance operator.
http://en.wikipedia.org/wiki/SMAPE
This measure is more robust then correlation.
Allows to compare regression results from different data sets.
It is symmetrical. (Well not really, but better then MAPE)
Best regards,
Wessel
edit:
More detailed discussion:
http://www.buseco.monash.edu.au/ebs/pubs/wpapers/2005/wp13-05.pdf
I'm not really sure what error measure is best.
But it is clear that a measure is needed which is somewhat independent on the data set.
Like a measure that ranges from 0 to 1. Instead of -infinity to +infinity.
second edit:
What constitutes a good error measure? First, a good error measure is clearly interpretable and summarizes the cost related to the error. Second, a good error measure is robust to outliers. Third, a good error measure is stable if only a few data points are used. Finally, a good error measure is unaffected by units of measurement.
I think it is good to include the SMAPE measure into the Rapid Miner regression performance operator.
http://en.wikipedia.org/wiki/SMAPE
This measure is more robust then correlation.
Allows to compare regression results from different data sets.
It is symmetrical. (Well not really, but better then MAPE)
Best regards,
Wessel
edit:
More detailed discussion:
http://www.buseco.monash.edu.au/ebs/pubs/wpapers/2005/wp13-05.pdf
I'm not really sure what error measure is best.
But it is clear that a measure is needed which is somewhat independent on the data set.
Like a measure that ranges from 0 to 1. Instead of -infinity to +infinity.
second edit:
What constitutes a good error measure? First, a good error measure is clearly interpretable and summarizes the cost related to the error. Second, a good error measure is robust to outliers. Third, a good error measure is stable if only a few data points are used. Finally, a good error measure is unaffected by units of measurement.
0