Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
How to delete attributes or rows of the exampleset automatically
Hi guys,
i am a new member of the Rapidminer community and would like know, how can i just remove or delete automatically several attributes or rows, which contain certain kind values? For my apllication i dont need the time stamp and would like to delete them from my example set. Thank you
i am a new member of the Rapidminer community and would like know, how can i just remove or delete automatically several attributes or rows, which contain certain kind values? For my apllication i dont need the time stamp and would like to delete them from my example set. Thank you
0
Best Answer
-
lionelderkrikor RapidMiner Certified Analyst, Member Posts: 1,195 UnicornHi @sharki,
That was interesting to solve !
The idea here is to find and replace the timestamp values by the caracter "?" using a regex to "capture" the timestamp values for each attribute, so I'm using Generate Attributes operator with the following expression :if(finds(eval(concat("att_",%{iteration})),"(0[1-9]|[1-2][0-9]|3[0-1]).(0[1-9]|1[0-2]).[0-9]{4} (2[0-3]|[01][0-9]):[0-5][0-9]"),"?",eval(concat("att_",%{iteration})))
then I loop over the attributes to remove the example(s) which contains the caracter "?"...
Note that the attributes names have to be "att_1", "att_2", "att_3" etc. ... but according to the last screenshot of your first post it is already the case.
You have just to put the process in attached file at the end of your own process.
Hope this helps,
Regards,
Lionel
PS : In attached file, the .xls file I used to create a fictive exampleset representative of yours.
6
Answers
I have maybe an idea. Can you share your dataset ?
In addition, can you elaborate :
What do you want to do exactly ?
- if an attribute contains at least a date, you remove this attribute ?
- if a row contains at least a date, you remove the row ?
Regards,
Lionel
Here is my process so far.
<?xml version="1.0" encoding="UTF-8"?>
So after Combine Documents Operator i got a Dataset, which look like that
the example set looks like that before transpose
and at the end of the whole process like that
So if you take a look at my first picture, then maybe you would know my intention of my set up. If i could clear up the time stamp in the rows, then i will get exact the same dataset like the one in the first picture. And because i work with a dynamic data set, therefore i would like to know to delete the rows, colums or unwanted values in my exampleset automatically, so that i would'nt just have to delete the rows, colums , attributes or values by hand. Sorry for the long answer. i hope, i could express well, what i would like to do.
What is the complete pattern of your timestamp ? (ie DD.MM.YYYY ? or something else ..?)
In the screenshot you shared, the timestamp is truncated so, I can not determine it.
Regards,
Lionel
the pattern of the timestamp is DD.MM.YYYY HH:MM i guess. The application records every ten minutes different values of the parameters, which are measured by several sensores.
Good luck for your study !
Regards,
Lionel