Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

"Item Distribution Performance Bug?"

Fabian_WewersFabian_Wewers Member Posts: 3 Contributor I
edited May 2019 in Help
Hello everybody,

i am just working on a small project and found a mysterious thing. In my opinion the Item Distribution Performance (Cluster Performance) should become one if the whole data set belongs to exactly one cluster (out of n) and should become zero if the data is uniformely distributed.
Unfortunately my observations with the GiniCoefficient brought out another result.
To find out what was going wrong I read the GiniCoefficient.java file and tried to implement the function in Open Office calc.

My example to test the functionality of my implementation:
Three clusters C1=108, C2=247, C3=44 Members.
My Gini-Coefficient and the one of Rapidminer brought out: 0.997, but the clusters are nearly uniformely distributed, so the result should not become nearly one!
I then tried out finding the error and compared the Gini Function to the Squared Error Function. I think that the mean in the GiniCoefficient.java has been implemented in the wrong way, because it becomes always one:
double mean = sum / n;
The n in row 39 should become x.length, otherwise the n is 399 (the numbers of rows) in my example and not 3 (the number of clusters). When changing n to 3 my result for the given example is: 0.491.
double mean = sum / x.length;
Can anybody else approve my test? I hope that I have made a mistake :-).

Greetings


Fabian

Answers

  • landland RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 2,531 Unicorn
    Hi Fabian,

    I hope this, too, but nevertheless, please post a bug report on bugs.rapid-i.com for that. We will check that as soon as possible.

    Greetings,
      Sebastian
Sign In or Register to comment.