🎉 🎉 RAPIDMINER 9.10 IS OUT!!! 🎉🎉

Download the latest version helping analytics teams accelerate time-to-value for streaming and IIOT use cases.

CLICK HERE TO DOWNLOAD

text mining using in execute python

hmhsinghmhsing Member Posts: 23  Maven
edited December 2018 in Help

I hope to input a text file into Execute Python and do word segmentation. The python scripts is as following.  The result shows parsing failed, can anyone give me some suggestion? Thanks!  

 

import pandas
import jieba
import csv

def rm_main():
    ff = open("C:/Users/user/Desktop/speech.txt").read()
    seglist = jieba.cut(ff, cut_all=False)

f = open("text.csv","w") 
w = csv.writer(f) 
w.writerows(seglist) 
f.close()

Best Answer

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,760   Unicorn
    Solution Accepted

    Are you working in a Windows environment? If so then you Pandas and your Jeiba module need to be installed under an Anaconda distribution. Verify that.

    hmhsing

Answers

  • mschmitzmschmitz Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,085  RM Data Scientist

    Hi,

    please have a look into the log window. There should be more information on the error.

    Best,

    Martin

    - Head of Data Science Services at RapidMiner -
    Dortmund, Germany
  • hmhsinghmhsing Member Posts: 23  Maven

    The error message is:

    The script could not be parsed. Please check your Python script:ModuleNotFoundError:No module named 'jieba'

    I have already installed jieba package, and similar coding works in Python.

    Thanks!  

Sign In or Register to comment.