Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

Read PDF Error BouncyCastleProvider

johan_CGjohan_CG Member Posts: 19 Contributor II
edited August 2019 in Help
Hi everybody,

I've got an unknow error when I run my RapidMiner process on some folder.
To make a long story short, the goal of my process can be describe as following:
 Starting points:
  • Excel file .xls with "Open File"
  • the path of a directory to scan with "Set macro"
Outputs of the process
  • a PDF report
  • several ExampleSet
The process counts words, keywords, calculates ratios ...
It works very well on some folders and on others I've got the following pop-up message:

Process failed
org/bouncycastle/jce/provider/BouncyCastleProvider
Reason: org.bouncycastle.jce.provider.BouncyCastleProvider
The process works on HTML and PDF files but in all folder there are also xls, csv, doc.

Can somebody help me please?
Thanks in advance
Johan

P.S.: I can't put the process because it's too long.
Tagged:

Answers

  • johan_CGjohan_CG Member Posts: 19 Contributor II
    Hi Everybody,

    I've got some new information about my problem.
    The error occurs when I try to analyze PDF which is "protected".

    Can somebody help me to solve this problem?

    Thanks in advance
    Johan
  • Marco_BoeckMarco_Boeck Administrator, Moderator, Employee, Member, University Professor Posts: 1,996 RM Engineering
    Hi,

    unfortunately protected PDFs are currently not supported by the Text Extension, so you will have to make sure there are no protected PDFs in a folder you're trying to process.
    Thank you for reporting it however, I created an internal ticket for this issue.

    Regards,
    Marco
  • johan_CGjohan_CG Member Posts: 19 Contributor II
    Hi Marco,

    Thank you for your reply.
    I found a small freeware to convert PDFs to Text and runs on protected also.
    http://www.colorpilot.com/extract-pdf-text.html
    Moreover this software is open source, maybe it can be usefull for a futur release of Text Processing Extension  ;)

    Regards,
    Johan
Sign In or Register to comment.