Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

Enron Email Dataset

stevefarrstevefarr Member Posts: 93 Maven
edited November 2018 in Knowledge Base

http://www.cs.cmu.edu/~enron/

 

All you text miners - this is the classic dataset. This data was originally made public, and posted to the web, by the Federal Energy Regulatory Commission during its investigation.

 

Some young whippersnapper in the office asked me who Enron were recently - oh how time flies.

Tagged:

Answers

  • robinrobin Member Posts: 100 Guru

    How would you recommend reading this data set in? I have been playing with it for a number of years now and it has been sitting in my archives since 2015. 

     

    Problem I have is that each of the markers that could be used to define the fileds are present in the text as well. As an example the "To:" field would be one of the fields that one would want to extact from the data, however this filed is also present in the mail body. 

Sign In or Register to comment.