Access e-mails with the Text Processing Extension

David_ADavid_A Moderator, Employee, RMResearcher, Member Posts: 177  RM Research
edited December 2018 in Knowledge Base

Nowadays e-mails might be the most common source of text you encounter every day. They are also a common source of data for text mining applications, e.g., spam detection or sentiment analysis.

Having a direct access to an e-mail account and processing mails automatically can be a great boon when putting text mining tasks into production. New incoming mails can be processed and their content scored, which can even trigger more reactions.


Read Documents (Mail) 

The Text Processing Extension available at the RapidMiner marketplace includes an Operator that allows exactly that:

Read Documents (Mail).

It provides access using the common IMAP protocoll or the older POP3 standard. All you need to know is the host address, the log-in data and the used protocol. Also quite important is the folder parameter, as you seldom want to download the complete inbox folder.

 

Connection properties

Unfortunately it is not always as simple as it could be. Some e-mail provider requires you to set additional parameters to access their services and it is sometimes hidden at their sites.

Below are the settings for the two most common providers: Microsoft Outlook (Office365) and Gmail.


Outlook 365 

 

Parameter settings:

 

 Outlook_parameterspng

Connection properties:

ReadDocuments_Outlook_propertiespng

 

Gmail

Parameter settings:

Gmail_parameterspng

 

Connection properties:

ReadDocuments_GMail_propertiespng

 

 

Similar is the Process Documents from Mail Store Operator, which requires the same properties settings. Whith this Operator it is possible to directly work on single e-mails. The Operator also allows to download attached files

DocMusherTelcontar120yyhuangEdin_KlapicPavithra_RaoBalazsBarany

Comments

  • osadasliitosadasliit Member Posts: 3 Contributor I

    Hi 

    I am new to rapidminer how do you access Gmail parameter settings forms. in send mail operator only provide 

    to 

    supbject

    html check box

    body text

    headers 

     

    Gmail

    Parameter settings:  form

  • osadasliitosadasliit Member Posts: 3 Contributor I

    Hi 

    if I want to send mail (not read mail) how do we setup smtp information in rapidminer studio for gmail let say we use send mail operator 

    to 

    supbject

    html check box

    body text

    headers 

     

    thanks

     

Sign In or Register to comment.