Options

# "Newbie: clustering with k-means and then using multipe regression linear"

abudcoolabiz
Member Posts:

**3**Contributor I
Hi, I'm a student from Indonesia.

Currently I'm working on my assignment to predict lead time manufacturing on make to order manufacture. The dependent attribute is lead time (continuous). and the independent variable are routing, material, etc.

I'm going to use clustering (k-means) to make a few groups of data. and then I will make rules of each group using multiple regression linear.

I'm very new to this software. So the first question is, Can I do these method with Rapid Miner? Then how can I accomplished it? try with simple and detail step please... thank you so much for the attention. and I really hope you can help me.

best regards,

Currently I'm working on my assignment to predict lead time manufacturing on make to order manufacture. The dependent attribute is lead time (continuous). and the independent variable are routing, material, etc.

I'm going to use clustering (k-means) to make a few groups of data. and then I will make rules of each group using multiple regression linear.

I'm very new to this software. So the first question is, Can I do these method with Rapid Miner? Then how can I accomplished it? try with simple and detail step please... thank you so much for the attention. and I really hope you can help me.

best regards,

Tagged:

0

## Answers

2,531Unicornfor getting more familiar with the software, I really recommend to do the online tutorial available in the welcome screen.

If you have a basic understanding of the software, here is it, how it works for your assignment. But because I'm not getting paid for solving your exercises, I must hold this quite short, but you will get an idea how to work this out.

Here's a process setup, which will first load data (in this case it will be generated randomly). Then KMeans is used as clustering algorithm, which will assign each example to a cluster. This cluster attribute is then used as label, hence the target variable for a following linear regression. You might exchange operator with an analog operator, for example use LinearDiscriminantAnalysis instead of linear regression, or use another clustering operator. I hope that this will help you, solving your problem.

Greetings,

Sebastian

3Contributor Ii really appreciate it.

I've read the tutorial but its getting me dizzy cos english is not my main language. but i'm still good tough hahaha.

so, i can build a model which is clustering the data and then i made a rule on each cluster data set?

thanks again.

nawaf, Indonesia.

3Contributor Iit works... wow great! thanks a lot pal.