The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.
Options

Is there an operator to include only listed text?

amytamyt Member Posts: 1 Newbie
edited June 2019 in Help
Hello, I am new to RapidMiner. I have used the Filter Stopwords (dictionary) to exclude. Is there an operator that will do the opposite? Create a list of words that you want to identify and exclude everything else?
Thank you

Answers

  • Options
    MarlaBotMarlaBot Administrator, Moderator, Employee, Member Posts: 57 Community Manager
    Hi @amyt - this is MarlaBot. I found these great videos on our RapidMiner Academy that you may find helpful:
    Instructional Video: Text Association Rules (Viewing time: ~10m)
    Instructional Video: Loading Text into RapidMiner (Viewing time: ~6m)
    Please LIKE my comment if it helps! 👇

    MarlaBot <3
  • Options
    kaymankayman Member Posts: 662 Unicorn
    Sort of. If you have a wordlist you like to focus upon you just add this in to the 'wor' port of your 'process documents from data' operator.

    This will only take the words in your list into consideration so you could consider this a filter.
  • Options
    MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,525 RM Data Scientist
    Hi @amyt
    Or use the word list for Filter Tokens (Example Set) and invert the filter. That was the reason i've added this invert filter option - to allow white listings.

    BR,
    Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
Sign In or Register to comment.