Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
citation parsing
hi
I would like to citation analysis and of each string citation the name of author, title, date, etc. to pay.
But I do not know from what operators and how do I use.
I got used to extract information operator, but only the first string is extracted from the file.
Help me please
0
Answers
Hi neda,
can you get the citation as bibtex and use Read BibTex?
~Martin
Dortmund, Germany
I have a file with this content
(<author> A. Cau, R. Kuiper, and W.-P. de Roever. </author> <title> Formalising Dijkstra's development strategy within Stark's formalism. </title> <editor> In C. B. Jones, R. C. Shaw, and T. Denvir, editors, </editor> <booktitle> Proc. 5th. BCS-FACS Refinement Workshop, </booktitle> <date> 1992. </date><author> M. Kitsuregawa, H. Tanaka, and T. Moto-oka. )
Of course this is a big file and i have use svm and crf for Assessment file And to compare the two methods together.
But do not know how to do that !!!!!
Hi Neda,
have you tried to read it in with Read XML?
~martin
Dortmund, Germany
hi
type of my file is txt and i dont know how set operate read xml.
i convert my file with html and set xpath for attribute:
//author
//title
//date
and ....
but xpath for exampel????
then set xml how i do???
Dear Neda,
your file format is very similar to XML. if you replace the first ( with <xml> and the last ) with </xml> it might be possible to read it in. However your posted file has the problem, that there are two authors. This is of course a bit strange.
Another way to read it in would be a parsing in RM. Please have a look at the attached process. You can built similar things with the Process Documents from Files operator to parse all your files.
~
Martin
Dortmund, Germany
hi
i undrestanf for citation parsing should use of svm^structer. but this operator is'nt on list of operator. do you know how add this alogoritm to rapidminer?
tanks
Dear Neda,
are you referring to a struct SVM: https://www.cs.cornell.edu/people/tj/svm_light/svm_struct.html ? If so you really take out the big guns first. The struct svm is possibly the most complex solution you could think about.
I know that Katharina Morik from the CS chair in dortmund had some eye on the topic, but I think there is no integration into RM yet. You would need to integrate it using Java yourself.
~Martin
Dortmund, Germany