The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.
Options

extract sentences and relate to a tag

cliors16cliors16 Member Posts: 1 Newbie
Hi. First of all I introduce myself, my name is Carlos from Colombia. I'm new to Rapidminer and I'm not very good at English either.

I thank who can help me.

I have a data set with two columns. The first column contains texts of labor profiles. The second column contains the salary.

I would like to create a model with RapidMiner to extract the most recurring job profiles, but I don't want keywords, but phrases or sentences. On the other hand, I would like to relate the results obtained with the salary (this could be done through a linear regression model, I think).

Somebody could help me?

Answers

  • Options
    Pavithra_RaoPavithra_Rao Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 123 RM Data Scientist
    Hi @cliors16

    You could use the 'Process document operator' to create a word vector.
    Word vector gives the table of all the words in the documents along with its frequency of the number of times each of this word appears in each document.

    Here's a quick tutorial use case, on how to convert text into a dataset, that can be further used for modeling
    https://academy.rapidminer.com/learn/video/applying-a-model-to-categorize-documents

    Hope this helps.

    Cheers,
    Pavithra
Sign In or Register to comment.