🎉 🎉. RAPIDMINER 9.8 IS OUT!!! 🎉 🎉
RapidMiner 9.8 continues to innovate in data science collaboration, connectivity and governance
"Any tips on optimizing the Read XML operator?"
I've a rather lengthy process that at one point reads an XML file using the ReadXML operator and I've found this is a bottle neck in execution speed.
The XML file is only 1,000 records in total with 20 attributes of which the operator only extracts 4 of these fields. Yet it takes around 1minute 30 seconds to run each time. (Doesn't sound like much, but it's going to loop over several hundred of these files)
Are there any tips on speeding up execution time of this operator?
Would it help if I turned off Parse Numbers, Read not matching values as missings or changed data management from double_array to a different value?