RapidMiner

RapidMiner

Using "Read Documents from Files" for Hundreds of Single-Document Classes

Emi
Contributor

Using "Read Documents from Files" for Hundreds of Single-Document Classes

Hi,

I have about a thousand documents in a single folder and would like to use the "Read Documents from Files" operator to process them. I realize I need to have a folder for each document if I would like to indicate each document as a class. I am wondering if there are other operators or a smart way of achieving the same thing as it seems impractical to manually create a folder for each document and also manually indicate each folder as a class in "text directories" parameter of the "Read Documents from Files" operator.

Thank you.
1 REPLY
Super Contributor

Re: Using "Read Documents from Files" for Hundreds of Single-Document Classes

Hello

The file name is provided as meta data so you can parse this using the generate attributes operator.

Regards

Andrew