The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.
Options

How to Segment Text Effectively Using TextTiling with Python 3.x

koushikdeb2009koushikdeb2009 Member Posts: 2 Contributor I
edited December 2018 in Help

I have written these line of codes

 

f = open("1.ref","r")
alice=f.read()
tt = nltk.tokenize.TextTilingTokenizer()
tiles = tt.tokenize(alice[0:2000])
print(tiles) #Total text in a single valued list

I need to consider full text.If I omit [0:2000] ,then getting an error

TypeError: slice indices must be integers or None or have an index method

Spoiler
 

While printing tiles ,I am getting full text.I need to show segmented text.

Answers

  • Options
    Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,761 Unicorn

    @koushikdeb2009 can you post the XML process using the </> button? It will be helpful to see how you formed the python script in Execute Python operator. 

  • Options
    koushikdeb2009koushikdeb2009 Member Posts: 2 Contributor I

    There is no XML process.Its program to implement Text Tile process.If there is any code sample to implement texttiling properly with python 3.x ,then please send the link.It will be great help to my project.

  • Options
    Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,761 Unicorn

    @koushikdeb2009 we usually see python executed in RapidMiner via the Execute Python operator, hence me asking if you can post your XML process. If you're strictly working in python, then I'd suggest visiting a forum related to Text Tiling. 

Sign In or Register to comment.