The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.

Prediction Model + Result Analysis

asiddiqasiddiq Member Posts: 25 Contributor II
I have (24 Columns, and 5100 Rows) Data that contain the following attributes [Dengue Fever Data(district name, gender, nationality, week and year of record the case), Air quality Data (temperature, Humidity, rainfall, and other)], for the period between 2010 to 2018. I would like to create a prediction model that involve the following steps:
1. Dimensionality reduction
2. Clustering
3. Linear regression.
4. Time Series Analysis.

I have tried simple design but I got the following result, and I'm not sure if my work is right to not!?

Best Answer


  • Options
    hbajpaihbajpai Member Posts: 102 Unicorn
    Hi @asiddiq,

    The result you shared shows the Linear regression model and it shows the coefficient of your variable as well as the importance of the variable. Since, you have the data for Dengue fever, are you trying to predict how many people will suffer for it based on a time series prediction? I am unable to follow your motivation for Dimensionality reduction and Clustering. Can you please elaborate?
    Also, from your problem statement feature engineering in terms of seasonality and weather patterns would be an essential step for developing predictive model.
  • Options
    asiddiqasiddiq Member Posts: 25 Contributor II
    I would like to predict future patents and future location risk areas. The reduce dimension and clusters are work together to replace the missing values by using the k-nearest method. is it clear!
Sign In or Register to comment.