Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Log File Import
Hey Everyone,
I'm trying to use RapidMiner to Analyze Some Logged Events which I am getting from a System. Unfortunately, the Logs are not very well sorted and need cleaning up. I have tried the csv-import module on the textfile-logs which I have, but even with regex-matching I find it to be not customizable enough to cover all items in the log. Is there a similar extension or Plugin with more flexible Definitions which I could use to import logs into a Table?
Eventually, the lines look a bit like
<some Information> :Event :Timestamp : <some item containing "::" sometimes>: Action
but not all do, there are simple one-worder lines as well.
Do I need to buid external Data Preprocessing, or can this be done within RM?
I tried searching the forum and the extension marketplace as well with limited success. Any recommendations?
Best regards,
Lino
0
Answers
Could you share your log file, or a least a sample of this log file, in order we can
better understand and try to find the more relevant solution ?
Regards,
Lionel
I builded a process which extract the information like that :
the process :
Regards,
Lionel
What are the extension / type of your simple text/logfile ?
Regards,
Lionel
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
Your static example (with id1 / id3) is possible using the Lag Series operator of the Values Series extension (to install from MarketPlace).
But I understand , you are searching a more general use : Can you describe the "rules" to create your new attribute and the "rule(s)" to subtracting the time values ?
Regards,
Lionel
- You are looking only at the "BLUETOOTH_CONTROL " action ?
- If I good understand for example, the first BLUETOOTH_CONTROL is associated to the first "mouse pressed" ?
and you want calculate the timestamp difference between these 2 events ? and write this value in a new attribute at the level of the "BLUETOOTH_CONTROL" row ?
- Is the "BLUETOOTH_SOURCE" action is associated to the second "mouse pressed" ?
Regards,
Lionel
OK, your explanations are clear : no need of drawing
Unfortunately, I didn't find a solution with RapidMiner's native operators, so I propose
a process using a Python script. In the results, you have a new column called "timestamp_diff" :
To execute this process, you need to :
- install Python on your computer.
- install the Python Scripting extension (from the marketplace).
The process :
Hope it helps,
Regards,
Lionel
Don't hesitate to post your future questions, here in the community.
Good continuation...
Best regards,
Lionel