RapidMiner 9.7 is Now Available
Lots of amazing new improvements including true version control! Learn more about what's new here.
Reading non-standard data files structures - pls help
I am evaluating RapidMiner as a solution to performing research and applicatioin prototyping. It's important to have an easy way to import data easily and manipulate it into the structure I need it before storing the result to a DB - I need to create this capability to work repetetively for many files.
However, I have hit an early block, as although I can read in data from a file containing a standard table, I hit issues if the file contains a slightly different structure. Is there a straightforward way to read in csv and excel data when the header structure is either not standard or even repeats (e.g. multiple data sets in one file appended one after another)
I have provided one example of one of the data files below, in which one of the columns is time, however there is no date column as the date is instead stored as meta data in the top of the file. I need to add the date to the time to create a date-time column but I can't find a straightforward way to read in the different parts of the data file - meta data and column data - separately and consequently perform the data transformation to create a new table to store to the DB.
Any advice would be welcome.
|Offset||AIR/GROUND||GMT (HH:MM:SS)||PRESENT POSN LATITUDE (DEG)||PRESENT POSN LONGITUDE (DEG)||ALTITUDE (FEET)|