Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

extract sentences and relate to a tag

cliors16cliors16 Member Posts: 1 Learner I
Hi. First of all I introduce myself, my name is Carlos from Colombia. I'm new to Rapidminer and I'm not very good at English either.

I thank who can help me.

I have a data set with two columns. The first column contains texts of labor profiles. The second column contains the salary.

I would like to create a model with RapidMiner to extract the most recurring job profiles, but I don't want keywords, but phrases or sentences. On the other hand, I would like to relate the results obtained with the salary (this could be done through a linear regression model, I think).

Somebody could help me?

Answers

  • Pavithra_RaoPavithra_Rao Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 123 RM Data Scientist
    Hi @cliors16

    You could use the 'Process document operator' to create a word vector.
    Word vector gives the table of all the words in the documents along with its frequency of the number of times each of this word appears in each document.

    Here's a quick tutorial use case, on how to convert text into a dataset, that can be further used for modeling
    https://academy.rapidminer.com/learn/video/applying-a-model-to-categorize-documents

    Hope this helps.

    Cheers,
    Pavithra
Sign In or Register to comment.