Very strange - parameter changes during csv generation

spokspok · October 2014

Hi,

I´m rather new and try text mining with some 100s of documents. I tokenize, filter stopwords, use porter stemmer, filter by lenght, transform to lower case and wright the result to csv.

In the csv file some stemmed words have propabilities in the range of 10e+11, althought they have propabilities well below 1 in the example set table in rapidminer (as it should be to my opinion).

This effect is reproducable and seems to happen cumulative at certain parameters (words).

The effect occurs also if I copy and paste the example set matrix from rapidmier into excel.

What goes wrong??????

Many thanks for any assistance

die_eike · August 2019

Hi, I know this post is very old, but I recently had the same problem. It is probably a bug. I compared the results in both rapidminer and csv. Its an error with the engineering notation. In rapidminer, let's suppose the results are written in engineering notation with a MINUS (e.g. 10E-11). When writing these results to csv, they suddenly change to PLUS (e.g.10E+11). That's the error. Any solutions, rapidminer team?

Marco_Boeck · August 2019

Hi,

May I ask which RapidMiner version you are using? I tested this against the latest version, and it runs fine. I created an example set with the Data Editor, and simply added 2 rows, one with 1E+11 and one with 1E-11, both in a numerical and in a nominal column. The CSV result is as expected:

"SciNo";"SciNoNom"
1.0E11;"1E+11"
1.0E-11;"1E-11"

Regards,
Marco

Howdy, Stranger!

Quick Links

Categories

Altair RapidMiner Community

GET HELP. LEARN BEST PRACTICES. NETWORK WITH YOUR PEERS.

Very strange - parameter changes during csv generation

Comments