Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Boilerplate text analysis - text mining
Dear community
I'm new to text mining with RM and would like to know, if it's even possible to build a process in RM which suits my research question. I would like to create a process which searches for boilerplate language in documents.
In detail I'd like to input management reports from different companies (pdf files) and compare them regarding the usage of boilerplate language/ templates. (If they are using the same sentences or passages in their reports only with other numbers or years)
I would really appreciate every idea
Thank you very much in advance!
I'm new to text mining with RM and would like to know, if it's even possible to build a process in RM which suits my research question. I would like to create a process which searches for boilerplate language in documents.
In detail I'd like to input management reports from different companies (pdf files) and compare them regarding the usage of boilerplate language/ templates. (If they are using the same sentences or passages in their reports only with other numbers or years)
I would really appreciate every idea
Thank you very much in advance!
0
Answers
Regards
Mo
These are the reports one company from the years 2017 until 2019. I would like to compare them regarding the use of boilerplate language.