Algorithm/Lib for analysing references in scientific documents needed

Question

JochenM 0 Newbie Poster

17 Years Ago

hi folks,

i need a library to analyse references of a scientific document. the lib should be able to identify references in the full text (for instance [1], [2], ... or Author A (1995), ... Author B & C (1968), ...) and it should be able to identify the elements in the reference list. For instance if the reference list looks like this:

...
Smith, J. 1982, A new method for reference analysing, Journal of Information Technology, vol. 23, no. 5, pp. 234-238.
...

the library/algorithm should return for instance an array/list/... like this

Do libraries like this exist in C++, Java, C# or whatever? I would also pay money for it if necessary.

Best regards
Jochen

algorithm c++

4 Contributors
5 Replies
150 Views
3 Days Discussion Span
Latest Post 17 Years Ago Latest Post by JochenM

All 5 Replies

Salem 5,265 Posting Sage

17 Years Ago

What format(s) of documents are you expected to deal with?
- DOC
- PDF
- etc etc

ithelp 757 Posting Virtuoso

17 Years Ago

format is important else how will you know smith is name but not part of title itself, you need to specify the format first and then parse the documents to find out all the informations required.

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

JochenM 0 Newbie Poster · Answer 1 · 2008-05-21T15:22:31+00:00

the format actually doesn't really matter. i would prefer pdf but if it's doc, plain text or whatsoever i will find a way to transform it.

dan_e6 0 Light Poster · Answer 2 · 2008-05-21T17:23:59+00:00

if these documents are going to have an expected format for the stuff and it wont change at all. then it shouldn't be too hard to write a program that can read it all. i dont see what the real problem is unless you're not a programmer and need someone to make this for you. but just realize that the format of the references cannot change at all. it needs to be in the exact order otherwise you'll get garbage results from the program (you mite get "1992" in the surname field for example lol)

JochenM 0 Newbie Poster · Answer 3 · 2008-05-23T16:34:52+00:00

but just realize that the format of the references cannot change at all.

of course format will vary from document to document. otherwise i wouldn't have asked here.

but i found what i was looking for
http://fas.sfu.ca/fas/CitationParser/
http://aye.comp.nus.edu.sg/parsCit

Algorithm/Lib for analysing references in scientific documents needed

Recommended Answers Collapse Answers

All 5 Replies

Recommended Answers