When I try to store a dataset on a remote server, I get the following error message. It seems that it is trying to add the attribute names into a metadata table that already contains them. How do I clear these metadata tables out?
Solved! Go to Solution.
Is it possible to attach the more verbose error from inside the log view for that process? You should be able to access it from clicking the log icon next to the completion times in your process scheduler.
This fuller log segment will give us an idea of the specific error
Sure, here is the full error from the log file.
SEVERE: Process failed: com.rapidminer.operator.UserError: Cannot store data in repository at entry '../data/model_table_test'. Reason: Cannot store example set in database as es_31: com.microsoft.sqlserver.jdbc.SQLServerException: Violation of PRIMARY KEY constraint 'PK__es_31_me__1842C92D2CCBCF70'. Cannot insert duplicate key in object 'dbo.es_31_meta'. The duplicate key value is (STATUS).. Jul 19, 2017 12:56:02 PM <unknown> <unknown> SEVERE: Here: Process (Process) subprocess 'Main Process' +- Retrieve sample_table (Retrieve) +- prep table (Subprocess) subprocess 'Nested Process' | +- Set Role (Set Role) | +- Generate ID (Generate ID) | +- Nominal to Text (Nominal to Text) +- Multiply (Multiply) +- Filter disk vars (Select Attributes) +- Select id & desc (Select Attributes) +- Token and Filter (Process Documents from Data) subprocess 'Vector Creation' | +- Filter Tokens (by Content) (Filter Tokens (by Content)) | +- Transform Cases (Transform Cases) | +- Remove xslt (Execute Script) | +- Tokenize (Tokenize) | +- Filter Tokens (by Length) (Filter Tokens (by Length)) | +- Filter junk out (Execute Script) +- Execute GenAttr (Execute Process) +- Join (Join) +- Generate Attributes (Generate Attributes) +- Select Attributes (Select Attributes) ==> +- store model_table (Store)
the Primary key in the Meta data tables contains the Attribute names which need to be unique in RapidMiner (and they definitely are in your RapidMiner object you want to store).
Nevertheless I suppose there are characters in these names which may not be correctly interpreted by your database.
Just for testing purposes, could you please try and use the Operator Rename by Generic Names to rename all attributes.
The resulting ExampleSet should look like 'att_1', 'att_2',....
It turns out that there were two attributes that contained the same name and it makes sense that these were causing a violation of the primary key constraint in the database. I have removed some unnecessary columns and it now works.