Get Pages - Connection Reset Error
I am using the get pages operator to process the home page URL for 4,000 or so websites from an excel file. I have received the following message several times:
Could Not Read Document
Reason: Connection Reset
I cannot figure out what is causing this. I have tried to run the code on a subset of the URLs (~200) and not had any problems, so i know the code works. I also tried it with 1,500 URLs and into a similar problem.
What is causing this? The only thing I have been able to find on this was on Stack Overflow on a similar SQL error.
The article basically said that this error is a Java based error to indicate that the connection between your java process and the database service have been lost. The article says this could have happened for lots of different reasons including a network connection lost, a problem with available resources, as well as others.
This seems like a vague error code. How do i troubleshoot what is causing it? It is repeatedly happening so it seems like an environment, resource or data problem. The only things I can come up with are that it is caused by:
- Internet connection problems - But I tested it in the office an data home to test for this issue. Error appeared multiple times in both locations.
- Data issue - something odd about data is causing the system to lose connection. Maybe exceeding read time limits.
- Community License Restriction - The license has some sort of resource restriction causing a connection reset. Seems odd since there are other licensing errors that I have come across in past.
Any information that you could provide would be much appreciated.