Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

forecast SVR

sarah_mi88sarah_mi88 Member Posts: 4 Learner I
edited November 2019 in Help
Hi everyone,

I want to apply support vector regression with sales data for training from 2016-2017 and for testing from 2018 (label date). My aim is to see the forecast value for the next 4 periods. But operator "apply forecast" doesn't work and operator "Performance (Regression)" doesn't evaluate labels of type date. For parameters choosen see screenshot below. If any data are missing, pls comment. What do I have to do??

Thx and cheers,
Sarah
Tagged:

Best Answer

Answers

  • varunm1varunm1 Member Posts: 1,207 Unicorn
    Hello @sarah_mi88

    It says that you set a column with date data type as label column. Did you set that? Can you provide .rmp file (File --> Export Process) and dara for us to debug?
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

  • sarah_mi88sarah_mi88 Member Posts: 4 Learner I
    Hello @varunm1

    thanks for your help! I specified the label and get now the prediction values. But how do I find "good" values for gamma, C, epsilon/nu and p? (nu-SVR or epsilon-SVR, I want to do regression). What is common practice? Doing CV? But how? See .rmp file in attachment. Currently the prediction doesn't include trend, seasonality; the predicted value is the same for the whole test interval.

  • sarah_mi88sarah_mi88 Member Posts: 4 Learner I
    Thank you so much. Really appreciating it. Besides I get this error. Can you help me with that too? (attached xlsx)
  • varunm1varunm1 Member Posts: 1,207 Unicorn
    Hello @sarah_mi88

    This error comes when your dataset has irregular information in the date column. For time series, you need to have a monotonically increasing date column (you should not have the same date and time repeating in your dataset).

    Based on the dataset you gave (very small dataset). Please find the working process. 
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

  • sarah_mi88sarah_mi88 Member Posts: 4 Learner I
    Hello Varun

    ok. Sorry for the stupid question but why is the value always the same ? (no trend, saisonality,  same prediction for Q1-4)
  • varunm1varunm1 Member Posts: 1,207 Unicorn
    I observed that the model is doing worse. If you see the squared correlation value from the performance it is zero which means the model is not at all good. This may be due to fewer data in your dataset (7 examples is very small). Try simple models like GLM and see how it goes, you can also look at time series models like ARIMA.
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

Sign In or Register to comment.