Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
HR sourcing
I have two excel files containing profiles of the employees. First is employees data and second file contain position we want to fill.
Excel file one
Id name skills department language exp
Excel file two positions
skills department language exp
I want to match positions with employee profile
Like this
Id name skills department language
1 a x fin eng
exp
3 years similarity index .30
Excel file one
Id name skills department language exp
Excel file two positions
skills department language exp
I want to match positions with employee profile
Like this
Id name skills department language
1 a x fin eng
exp
3 years similarity index .30
0
Best Answer
-
lionelderkrikor RapidMiner Certified Analyst, Member Posts: 1,195 Unicorn
Hi @asn4293,
Executed with the fictive exampleset, I supplied, and the 4 attributes, you have done, I have similarities between 0 and 1
but how said in my previous post, there are errors in the results, and for the moment I don't know why.
Regards,
Lionel
1
Answers
Hi @asn4293,
If I good understood, your second Excel file is empty and you want to fill it with the information(s) of the first one :
the second Excel file is equal to the first Excel file but there are not the Id ,and the name ?
Regards,
Lionel
Hi again @asn4293,
Can you share some of your data of your 2 Excel files, please ?
NB : Anonymize the data if it's sensitive subject.
Regards,
Lionel
Hi again @asn4293,
A first element of response :
I used the Cross Distance operator with CosineSimilarity as numerical measure.
Here a process with the Id of the employee, the Id of the position (I created an Id for the position) and the similarity between the employee and the position :
and here you can find my fictive example set :
https://drive.google.com/open?id=1k0HvIMv-cti_UaHu_jZjcXQzygfiG56V
I hope it helps,
Regards,
Lionel
Hi again @asn4293,
Don't consider the process, I share it too fast, without doing all the checks.
Although I think this is the right method, this process is giving false results :
- The first example of the "Employees" example set has always a similarity of 1
- The similarities associated to the "Employees" don't change when we change the caracteristics of the "Position"
If someone has an idea....
Here the process :
Thanks you,
Regards,
Lionel
Hi @lionelderkrikor
I was Able to find out the solution as mentioned by you by finetuning the model using nominal measures and using simple matching similarity in Cross Differences Operator.
Which gives me follwing results: