The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.

Question regarding the feasibility of a Table data Extraction project with RM

pblack476pblack476 Member Posts: 83 Maven
edited February 2020 in Help
Would it be feasible with RM to extract tables from PDFs? I realize the PDFs might be converted to something else first but would it be possible with RM to run through the entire text of a financial report and identify table data and extract it to examplesets using RM?

I am thinking of trying it out but would like to hear from more seasoned people if they think it is reasonably feasible or if there is a hard wall along the way that I am not yet seeing.

Best Answer


  • Options
    pblack476pblack476 Member Posts: 83 Maven
    @kayman wow. that extension just does it perfectly. Thanks very much.
  • Options
    sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM Moderator Posts: 2,959 Community Manager
    kudos to RM Research team for that one  :)
Sign In or Register to comment.