RapidMiner

Contributor II carl
Contributor II

Sourcing text mining data from a web search page or Kindle account

Is it possible to use, say, a newspaper search page (e.g. http://www.thetimes.co.uk/search?) to pull in all the full articles as a data source for text mining?  And is it possible to pull in the full text of all purchased Kindle books from ones Kindle account?  If so, what would be the Extension options to enable this?

2 REPLIES
RM Staff
RM Staff
Solution

Re: Sourcing text mining data from a web search page or Kindle account

Hi Carl,

 

i do not think that you can do this on the kindle books. It might be possible to read EPUB ebooks somehow, but I am not sure.

 

For the page. There are some ways. The built in web crawler of Web Mining extension is able to do some things, but it's not the easiest way to do. The other options are:

- Mozenda Extension

- (Maybe) Zapier 

 

Aylien also provides a News API which might be helpful for you.

 

~Martin

--------------------------------------------------------------------------
Head of Data Science Services at RapidMiner
Learner I 3AlphaDataEntry
Learner I

Re: Sourcing text mining data from a web search page or Kindle account

Hi there, 

I don't think you can mine the data of kindle books. Yes, building web crawler of web mining extension is a way but is is difficult task to do.

Although there are many ways for web mining, it is always preferrable to take help of professioanls to ensure that your output is accurate. 

Twitter Feed