Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

Wordnet

rfeigelrfeigel Member Posts: 18 Contributor II
edited November 2018 in Help
Is Wordnet supported in v. 6.5? If so, does anyone have clear, simple instructions for how to install it. I've installed it and pointed the dictionary to a folder where I've unzipped Wordnet 3.0. When I run it against a simple text file, I get no results or error messages. Any help is appreciated.

Answers

  • JEdwardJEdward RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 578 Unicorn
    Can you share your process for how you connected? 
    It works for me, was just using it 30 minutes ago alongside Extract Sentiment, Aylien, & Python (for Chinese) to create a hybrid multi-lingual sentiment classification system. 
    My next step will be adding ApacheSolr for improved tokenization, Moses http://www.statmt.org/moses/ for a fallback 'translate to English' sentiment analysis and moving the Python script into a Java operator so it works faster. 
  • rfeigelrfeigel Member Posts: 18 Contributor II
    Here's the script:

    <?xml version="1.0" encoding="UTF-8" standalone="no"?>
    <process version="6.5.002">
      <context>
        <input/>
        <output/>
        <macros/>
      </context>
      <operator activated="true" class="process" compatibility="6.5.002" expanded="true" name="Process">
        <parameter key="logverbosity" value="init"/>
        <parameter key="random_seed" value="2001"/>
        <parameter key="send_mail" value="never"/>
        <parameter key="notification_email" value=""/>
        <parameter key="process_duration_for_mail" value="30"/>
        <parameter key="encoding" value="SYSTEM"/>
        <process expanded="true">
          <operator activated="true" class="wordnet:open_wordnet_dictionary" compatibility="5.3.000" expanded="true" height="60" name="Open WordNet Dictionary" width="90" x="112" y="210">
            <parameter key="resource_type" value="directory"/>
            <parameter key="directory" value="F:\Data\Rapidminer 6.5 Repository\Wordnet 3\dict"/>
            <parameter key="repository_folder" value="//6_5 Repository/Wordnet 3/dict/"/>
          </operator>
          <operator activated="true" class="text:read_document" compatibility="6.5.000" expanded="true" height="60" name="Read Document" width="90" x="112" y="75">
            <parameter key="file" value="C:\Users\User\Desktop\Test.txt"/>
            <parameter key="extract_text_only" value="true"/>
            <parameter key="use_file_extension_as_type" value="true"/>
            <parameter key="content_type" value="txt"/>
            <parameter key="encoding" value="SYSTEM"/>
          </operator>
          <operator activated="true" class="wordnet:find_synonym_wordnet" compatibility="5.3.000" expanded="true" height="76" name="Find Synonyms (WordNet)" width="90" x="313" y="120">
            <parameter key="use_prefix" value="true"/>
            <parameter key="synset_word_prefix" value="syn:"/>
            <parameter key="maximum_recursion_depth" value="1"/>
            <parameter key="multiple_meanings_per_word_policy" value="Take all meanings per token"/>
            <parameter key="multiple_synsets_policy" value="Take only first synset per meaning"/>
            <parameter key="multiple_synset_words_policy" value="Take only first synset word"/>
            <parameter key="concatenation" value="Concatenate result per synset"/>
            <parameter key="keep_original_tokens" value="false"/>
            <parameter key="keep_unmatched_tokens" value="false"/>
            <parameter key="take_ID_instead_of_words" value="false"/>
            <parameter key="work_on_type_noun" value="true"/>
            <parameter key="work_on_type_verb" value="true"/>
            <parameter key="work_on_type_adjective" value="true"/>
            <parameter key="work_on_type_adverb" value="true"/>
          </operator>
          <connect from_op="Open WordNet Dictionary" from_port="dictionary" to_op="Find Synonyms (WordNet)" to_port="dictionary"/>
          <connect from_op="Read Document" from_port="output" to_op="Find Synonyms (WordNet)" to_port="document"/>
          <connect from_op="Find Synonyms (WordNet)" from_port="document" to_port="result 1"/>
          <portSpacing port="source_input 1" spacing="0"/>
          <portSpacing port="sink_result 1" spacing="0"/>
          <portSpacing port="sink_result 2" spacing="0"/>
        </process>
      </operator>
    </process>
Sign In or Register to comment.