text mining using in execute python

hmhsinghmhsing Member Posts: 29 Maven
edited December 2018 in Help

I hope to input a text file into Execute Python and do word segmentation. The python scripts is as following.  The result shows parsing failed, can anyone give me some suggestion? Thanks!  

 

import pandas
import jieba
import csv

def rm_main():
    ff = open("C:/Users/user/Desktop/speech.txt").read()
    seglist = jieba.cut(ff, cut_all=False)

f = open("text.csv","w") 
w = csv.writer(f) 
w.writerows(seglist) 
f.close()

Best Answer

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,761 Unicorn
    Solution Accepted

    Are you working in a Windows environment? If so then you Pandas and your Jeiba module need to be installed under an Anaconda distribution. Verify that.

Answers

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,497 RM Data Scientist

    Hi,

    please have a look into the log window. There should be more information on the error.

    Best,

    Martin

    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • hmhsinghmhsing Member Posts: 29 Maven

    The error message is:

    The script could not be parsed. Please check your Python script:ModuleNotFoundError:No module named 'jieba'

    I have already installed jieba package, and similar coding works in Python.

    Thanks!  

Sign In or Register to comment.