Information for bachelorthesis
at the moment I'am writting my bachelorthesis for a german company.
My subject is to show some possibilities how huge amounts of data can be summarized. The data aren't stored in a database, they arrive for example in a email box with pdf-format or office(word/excel)format. The person who sends the data shouldn't have any work to change or fit the data in a special format.
Is it possible to use a rapidminer programm to get the crucial information out of a mass of data? and can I track information back to the document??
I would be very greatful if i get some inforamtions.