How can I use deep learning with windowing operator when the horizon is larger than one?

hsanchez · June 2020

Hello Guys,
I am trying to use the process example "s&p 500 regression using windowing and convolution" and it works well to predict the price for the next day when in windowing operator (horizon=1); however if horizon is larger than 1 (a few days ahead forecast) the deep learning operator fails.
Question: Do you have an example where I can use deep learning, windowing and horizon > 1? I will be happy if the example "s&p 500 regression using windowing and convolution" could be modified to consider horizon > 1. I am aiming to forecast price for the next few mins ahead therefore I need horizon > 1.

I have also tried to the same deep learning operator used in the example mentioned above but this time using multi horizon forecast and the same problem occurs. Deep learning cant handle a situation when horizon > 1. I am not expert in deep learning operator but I think the apparent limitation is associated with multi label handling?
Other operators such as gradient boosted tree works well with horizon > 1

Below I have attached the process.

<?xml version="1.0" encoding="UTF-8"?><process version="9.6.000">

</operator>

</process>

<?xml version="1.0" encoding="UTF-8"?><process version="9.6.000">

<description align="center" color="transparent" colored="false" width="126">Reducing the data to the attribute we want to predict: 'Close' - Which is the closing price of respective stocks.</description>

</operator>

<description align="center" color="transparent" colored="false" width="126">Often normalizing data helps a neural network to perform better.</description>

</operator>

<description align="center" color="transparent" colored="false" width="126">Using windowing to convert the data into a form, that displays one entry as an attribute with preceeding 30 entries as additional attributes.</description>

</operator>

</enumeration>

<description align="center" color="transparent" colored="false" width="126">Split data into training and test.</description>

</operator>

</process>

<description align="center" color="transparent" colored="false" width="126">Data Preparation: Normalization, Windowing, Label Setting</description>

</operator>

</process>

<?xml version="1.0" encoding="UTF-8"?><process version="9.6.000">

</operator>

</operator>

<description align="center" color="transparent" colored="false" width="126">Often architectures using convolutional layers end with a fully-connected layer before the last layer.</description>

</operator>

<description align="center" color="transparent" colored="false" width="126">Since regression is performed on neuron and the 'None (identity)' activation function has to be used.</description>

</operator>

<description align="center" color="gray" colored="true" height="63" resized="false" width="712" x="75" y="448">This network architecture uses convolutional and pooling layers in combination with standard fully-connected layers.</description>

<description align="center" color="yellow" colored="false" height="407" resized="false" width="167" x="75" y="32">A convolutional layer uses a sliding window to only take a subset of provided information into account. This is done mutiple times (= activation map count), while automatically changing the so called kernel that is used as a mask for windowing. This method has the advantage of being able to focus on local patterns.</description>

<description align="center" color="yellow" colored="false" height="313" resized="false" width="183" x="269" y="34">A pooling layer eases the training process by reducing the information. Here only the maximum value of each 2x2 kernel window (created in the previous Convolutional Layer) is kept.</description>

</process>

<description align="center" color="transparent" colored="false" width="126">Open the Deep Learning operator by double-clicking on it, to discovere the layer setup.</description>

</operator>

</process>

<?xml version="1.0" encoding="UTF-8"?><process version="9.6.000">

</operator>

</process>

<?xml version="1.0" encoding="UTF-8"?><process version="9.6.000">

</operator>

</process>

jacobcybulski · July 2020

I see your problem, the issue is that currently RM Deep Learning (built in and extensions) do not support returning results as tensors, i. e. multiple labels or their vectors. This means that in Deep Learning forecasting you are limited to the horizon of one, this is regardless of what algorithm you use CNN or LSTM. However, as @Telcontar120 suggested RM features multi-horizon operators, both in the new Forecasting extension and in Time Series extension, which could solve your problem using the classical forecasting algorithms.

Telcontar120 · July 2020

If forecasting the S&P Index was easy then there would be a lot of rich data scientists :-)
You might want to look at the new Forecasting extension, which has some automated operators for both univariate and multivariate forecasting.

lionelderkrikor · July 2020

Hi @hsanchez

I agree with Brian and I can not prevent me to quote Pierre DAC :

"...Forecasting is difficult, especially when it comes to the future..."

Regards,

Lionel

jacobcybulski · July 2020

Another possibility is to use a Python extension, prepare your data in RM, do all your Deep Learning magic in Python with Tensorflow (for example) and then return the multiple horizon output as a vector back to RM.

hsanchez · July 2020

Hello Guys, Thank you very much for taking your time to look at my post and answering my question with good humor @Telcontar120 and some comments from @lionelderkrikor and thanks to @jacobcybulski for his answer. I would say that @jacobcybulski answered my question. Yes, sure Stock Market prediction is a difficult thing but that does not prevent us to try it and challenging ourselves with something very close to "can we forecast emotions of a group of people?". Each stock, groups people by a common interest and behavior. I may say each stock represents lets say an average emotion corresponding to the people forming that stock. Yes, sure it is chaotic/random but should be a way to get it right "some times". I am not market specialist, I am just curious and I never read a book about stock market. Having the curiosity as driver and rapidminer as tool off hope I decided to do something that yield lets say by luck

400 dollars. Yes, that will not make you rich @Telcontar120

but I was able to buy tons of satisfaction. By the way, I used GBT and I would say that algorithm is a rock!!! it is wonderful.
Pull the intraday data using Python->Apply STL to remove some noise->(I am using multi-variable), normalize series->weight by PCA to focus on those variables that real matter-> windowing (to train, validate, and use my last row to apply my model) and another parallel windowing to enrich the data (feature generation) with parameters such as min, max, std deviation, etc. Split the data in three parts: train, evaluate the model, and use the last row as unseen, multi-horizon forecasting using GBT (GBT wow! when you tune it), multi-horizon performance. Apply model to the unseen data, multi-horizon performance, Tune ARIMA and apply it to the sequence to compare its performance/forecasting with that designed using GBT.
Then you can get some satisfactions when "some times" you get it right.
What I learn?

Rapidminer did a great job with this time series extension.
Each stock has its own emotion and behavior.
There is not such thing of "free lunch" , you shall develop one model per stock. Each stock has its own personality and emotions.
GBT well tuned can surprise you.

I want to remark. I am not an expert in time series either stock market. I am just curious and the process described above may be subject to missing steps.
Thank you guys for your time

Howdy, Stranger!

Quick Links

Categories

Altair RapidMiner Community

GET HELP. LEARN BEST PRACTICES. NETWORK WITH YOUR PEERS.

How can I use deep learning with windowing operator when the horizon is larger than one?

Best Answer

Answers