The RapidMiner community is on read-only mode until further notice. Technical support via cases will continue to work as is. For any urgent licensing related requests from Students/Faculty members, please use the Altair academic forum here.
basic xpath problem
Hello,
I'm trying to extract data with xpath from an html page.
I have:
Create Document => Extract Information
Create Document:
query type: xpath
attribute type: nominal
xpath queries: //title
namespace: n/a
ignore CDATA: true
assume html: true
Result:
attribute name: ?
What am I doing wrong? >:(
I'm trying to extract data with xpath from an html page.
I have:
Create Document => Extract Information
Create Document:
Extract Information configurated with:
<html>
<head>
<title>TITLE</title>
</head>
<body>BODY</body>
</html>
query type: xpath
attribute type: nominal
xpath queries: //title
namespace: n/a
ignore CDATA: true
assume html: true
Result:
attribute name: ?
What am I doing wrong? >:(
0
Answers
I can't use path like this, I have to use for example:
//h:title/text()
text() to extract only the text from the title tag
and I have to use h: because is html, right?