RapidMiner

How to correct spelling mistakes

Wisdom logo Registration now open for RapidMiner Wisdom Americas | New Orleans | October 10-12, 2018   Learn More
Highlighted
Learner III jozeftomas_2020
Learner III

How to correct spelling mistakes

Hi members of the forum

I want to be able to correct spelling using the wordnet
But I do not know how to set up my own process for this purpose
I have a Twitter account
And I want to correct the wrong words in tweets
Someone knows how
Thanks a lot

4 REPLIES
RM Certified Analyst
RM Certified Analyst

Re: How to correct spelling mistakes

Hi @jozeftomas_2020,

 

Spelling correction is a difficult topic in itself (correcting "thear" by "there" or "their"? or "armour" instead of "armoire"?), and its complexity comes from comparing words with their counterparts. If you have the Python extension and Anaconda installed properly, you should take a look at Peter Norvig's spell corrector recommendations (with code). That's something I would do at least.

 

If, on the other hand, you want spell correction recommendations, I would use Levenshtein and/or Phonetic extraction to find similarities.

 

Hope this helps,

 

Rodrigo Fuentealba Cartes
Senior Software Developer & Data Scientist at The Pegasus Group Company S. A. - Chile
https://www.pegasus.cl/
Learner III jozeftomas_2020
Learner III

Re: How to correct spelling mistakes

Hello
Thank you very much for your help
I do not know python
I know just a little R
I saw this site

https://cran.r-project.org/web/packages/hunspell/vignettes/intro.html


How to use Twitter for data?
You know

And
These algorithms you named. Is there a program? What is the operator?
Please help
Thank you

RM Certified Analyst
RM Certified Analyst

Re: How to correct spelling mistakes

Read what I sent you. It contains links regarding R programming too.

Old wise man's piece of advice: great data scientists know what to ask, when to ask and how to ask. You know the "what" part and that is a good beginning, but the "when" part should come after a bit of reading and trying to come up with a solution yourself, and that gives you the "how": if you haven't done anything (will not judge you, but it looks like it's the case), there is no boat for others to jump and help you, as it looks like you are trying to get someone to do your job. You will think I'm being rude now but if you follow my advice you will learn a lot and have fun in the process. It's for you own benefit, trust this old man.

Another thing: don't be afraid of code, it doesn't hurt. The fact that it's called Python doesn't mean it's venomous or that it will strangle you Smiley Happy Read what I sent, try it and if you get stuck in the middle, be back, mention me and I'll try to do my best to get you out of the mud.

Best wishes,
Rodrigo Fuentealba Cartes
Senior Software Developer & Data Scientist at The Pegasus Group Company S. A. - Chile
https://www.pegasus.cl/
Learner III jozeftomas_2020
Learner III

Re: How to correct spelling mistakes

Hello friends
I am not upset about your talk and I consider it an experience and advice
I understood the meaning of the word of mouth
But I do not know how to understand it
(Synonyms or Hyponyms or Hypernyms)
Can I correct spelling?
Or I can get semantic clustering. Do I do documents based on the same content?
I searched a lot but did not find it
And that
I am a beginner and need your help
I entered my data
Stop words, tokenize and steeming, and I used Wordnet but I could not cluster and I could not delete spelling mistakes.
I'm for your own project. I do not have time to learn Python, and on the other hand, I'm going to teach my teacher what to do with this program
Thanks a lot