[SOLVED] Using file names

LokyLoky Member Posts: 9 Contributor II
edited November 2018 in Help
Hi there,

After crawling various URLs I end up with a bunch of files and their names are based on some URL attributes.

Now, when I want to process those documents I'd like to to extract their name so I can use it in my final results set. Is there any way I can do that? Maybe some Macro inside Process Documents?

Thanks in advance for your answers.

Answers

  • MariusHelfMariusHelf RapidMiner Certified Expert, Member Posts: 1,869 Unicorn
    Hi Loky,

    after application of the Process Documents from File operator the resulting example set contains the filename and path as special attributes. Maybe you can get along with that?

    Kind regards,
    Marius
  • LokyLoky Member Posts: 9 Contributor II
    Thanks for your reply.

    Hmm... will try this out and see what I get.
  • LokyLoky Member Posts: 9 Contributor II
    Aha, found my problem!

    I wasn't able to use metadata_file because within my Process Document I had a JSON to XML operator.

    My solution  was to duplicate the process document, to remove the JSON to XML from the duplicated one and after I have processed the data I need I joined everything back together.
Sign In or Register to comment.