05-16-2017 07:36 AM
05-16-2017 08:06 AM - edited 05-16-2017 08:08 AM
If those files are in different directories, you could use the Process Documents from Files operator. This way you can tag each directory with a label so that when you build a model (i.e. Naive Bayes) you could see how well specific documents classify.
Since they are in a XLS links, you could use Get Pages operator in conjunction with a loop to extract each URL, get the page, and save it.
05-16-2017 08:14 AM