How to Segment Text Effectively Using TextTiling with Python 3.x

koushikdeb2009koushikdeb2009 Member Posts: 2 Contributor I
edited December 2018 in Help

I have written these line of codes

 

f = open("1.ref","r")
alice=f.read()
tt = nltk.tokenize.TextTilingTokenizer()
tiles = tt.tokenize(alice[0:2000])
print(tiles) #Total text in a single valued list

I need to consider full text.If I omit [0:2000] ,then getting an error

TypeError: slice indices must be integers or None or have an index method

Spoiler
 

While printing tiles ,I am getting full text.I need to show segmented text.

Answers

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,761 Unicorn

    @koushikdeb2009 can you post the XML process using the </> button? It will be helpful to see how you formed the python script in Execute Python operator. 

  • koushikdeb2009koushikdeb2009 Member Posts: 2 Contributor I

    There is no XML process.Its program to implement Text Tile process.If there is any code sample to implement texttiling properly with python 3.x ,then please send the link.It will be great help to my project.

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,761 Unicorn

    @koushikdeb2009 we usually see python executed in RapidMiner via the Execute Python operator, hence me asking if you can post your XML process. If you're strictly working in python, then I'd suggest visiting a forum related to Text Tiling. 

Sign In or Register to comment.