Options

Great problem in converting text to feature vector

m_keshavarz_comm_keshavarz_com Member Posts: 28 Contributor I
edited December 2018 in Help
Hello
Sorry
I have a thousand sentences
I want to turn text into a feature vector using word2vec instead of tfidf
And then, clustering the texts based on the words
But do not know how
This is my process but
Word2vec is now clustered based on the words extracted, not according to the words in the sentences. And then clustering sentences
Please help me what to do
How to extract wordwords with word2vec and cluster them according to them?
Thanks
Any help is sure to be useful to me and thanks
Waiting
Vc.zip 31.3K

Answers

  • Options
    MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,517 RM Data Scientist

    Hi,

    word2vec maps words to vectors. Not sentences. You need to do an aggregation in order to get sentence vectors. This is not trivially done.

     

    BR,

    Martin

    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
Sign In or Register to comment.