It looks like you're new here. Sign in or register to get started.
In MS Excel spreadsheets with macros/VBA enabled, have to be saved as so called xlsm - files.
Is there a solution available to read those files with the excel-reader-operator (only the data, not the VBA part)?
If the data is in a worksheet, then yes. If the data gets generated somehow from the VBA script, then probably not.
I was reading online somewhere that the xlsm files are kind of XML like. Can't guarentee that it'll work but try loading it with the Read XML operator. Just select "all files" when you import it and give it try.
Thank you Thomas, If I do so, RM tells something about invalid XML. Meanwhile I write out xlsx - files for RM via VBA. But it's a little bit anoying to do this within an automation pipeline with n files.
As developer (not an Java-expert) I could imagine that this implementation could be an relatively easy task, if a all-in-one library is used (simply add another extension here and there). My guess is, a sheet stays as sheet, regardless which format is used.
The advantage of Rapidminer-xlsm compatibility is obvious. It would provide greater standard- interoperability between MS Excel and RM, without the need to convert everything produced in Excel on pro grade level in other languages like VBA.
Thanks god there is already an Excel writer in RM, which is an awesome opportunity.
This might be a job for some Groovy Script, which I'm in the process of learning so I'm not help there. Maybe @BalazsBarany has some experience in this. .
Did you try to read your Excel files with Read Excel? At a customer, we process .xlsm files with RapidMiner all the time. They're just standard .xlsx files but with macros.
In my RM, no Read Excel operator nor Read Excel with Format operator understands .xlsm files.
Is this a kind of 'Schmarn'?
@BalazsBarany do we talk about the same RM-version (7.4.000)?
It worked for a long time before 7.4.
Maybe there are different kinds of .xlsm files and yours aren't compatible. The ones I processed are quite simple.
Then it could be a version thingy. I use the latest RM- and Excel versions.
Worth to investigate by the RM-team.
I have the same issue trying to read data from any XLSM file, I always get: "Error loading sheet content. Reason: Unable to recognize OLE stream."
I have found what appears to work, using the Read Excel operator...