Options

"Bug in SparseFormatExampleSource?"

Legacy UserLegacy User Member Posts: 0 Newbie
edited May 2019 in Help

I read in some examples from a DB and vectorized them with the text tools. The resulting data set looked like this:

image

I saved it to a file with an ExampleSetWriter. Then I read it back in using a SparseFormatExampleSource. The result was:

image

Note that the word 'weight' has been _inserted_ into the Name column, pushing the other values down. But its value seems to have _overwritten_ the Statistics column.

Bug? Could it have to do with 'weight' being a special weighting column? I couldn't reproduce the problem with the sample newsgroup data. Maybe it didn't contain the word 'weight'.
Tagged:

Answers

  • Options
    Legacy UserLegacy User Member Posts: 0 Newbie
    To clarify, I didn't add any weighting column... Just sourced the text, applied the StringTextImput operators, saved, then tried to read in the file. Using both write and read operators creates two datasets, the original and the newly read one. So it's easy to see the differences.
  • Options
    IngoRMIngoRM Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM Founder
    Hi,

    did you try loading the data with and witout defining the .aml file?

    Cheers,
    Ingo
Sign In or Register to comment.