Algorithm/Lib for analysing references in scientific documents needed

Please support our C++ advertiser: Intel Parallel Studio Home
Reply

Join Date: May 2008
Posts: 3
Reputation: JochenM is an unknown quantity at this point 
Solved Threads: 0
JochenM JochenM is offline Offline
Newbie Poster

Algorithm/Lib for analysing references in scientific documents needed

 
0
  #1
May 20th, 2008
hi folks,

i need a library to analyse references of a scientific document. the lib should be able to identify references in the full text (for instance [1], [2], ... or Author A (1995), ... Author B & C (1968), ...) and it should be able to identify the elements in the reference list. For instance if the reference list looks like this:

...
Smith, J. 1982, A new method for reference analysing, Journal of Information Technology, vol. 23, no. 5, pp. 234-238.
...

the library/algorithm should return for instance an array/list/... like this

|surname| |initial| |year| |title| |journal| ....

Do libraries like this exist in C++, Java, C# or whatever? I would also pay money for it if necessary.

Best regards
Jochen
Reply With Quote Quick reply to this message  
Join Date: Dec 2005
Posts: 5,850
Reputation: Salem has a reputation beyond repute Salem has a reputation beyond repute Salem has a reputation beyond repute Salem has a reputation beyond repute Salem has a reputation beyond repute Salem has a reputation beyond repute Salem has a reputation beyond repute Salem has a reputation beyond repute Salem has a reputation beyond repute Salem has a reputation beyond repute Salem has a reputation beyond repute 
Solved Threads: 751
Team Colleague
Salem's Avatar
Salem Salem is offline Offline
Void main'ers are DOOMed

Re: Algorithm/Lib for analysing references in scientific documents needed

 
0
  #2
May 20th, 2008
What format(s) of documents are you expected to deal with?
- DOC
- PDF
- etc etc
Reply With Quote Quick reply to this message  
Join Date: May 2008
Posts: 3
Reputation: JochenM is an unknown quantity at this point 
Solved Threads: 0
JochenM JochenM is offline Offline
Newbie Poster

Re: Algorithm/Lib for analysing references in scientific documents needed

 
0
  #3
May 21st, 2008
the format actually doesn't really matter. i would prefer pdf but if it's doc, plain text or whatsoever i will find a way to transform it.
Reply With Quote Quick reply to this message  
Join Date: May 2006
Posts: 1,857
Reputation: ithelp is a name known to all ithelp is a name known to all ithelp is a name known to all ithelp is a name known to all ithelp is a name known to all ithelp is a name known to all 
Solved Threads: 120
ithelp's Avatar
ithelp ithelp is offline Offline
Posting Virtuoso

Re: Algorithm/Lib for analysing references in scientific documents needed

 
0
  #4
May 21st, 2008
format is important else how will you know smith is name but not part of title itself, you need to specify the format first and then parse the documents to find out all the informations required.
Reply With Quote Quick reply to this message  
Join Date: Apr 2008
Posts: 26
Reputation: dan_e6 is an unknown quantity at this point 
Solved Threads: 0
dan_e6 dan_e6 is offline Offline
Light Poster

Re: Algorithm/Lib for analysing references in scientific documents needed

 
0
  #5
May 21st, 2008
if these documents are going to have an expected format for the stuff and it wont change at all. then it shouldn't be too hard to write a program that can read it all. i dont see what the real problem is unless you're not a programmer and need someone to make this for you. but just realize that the format of the references cannot change at all. it needs to be in the exact order otherwise you'll get garbage results from the program (you mite get "1992" in the surname field for example lol)
Reply With Quote Quick reply to this message  
Join Date: May 2008
Posts: 3
Reputation: JochenM is an unknown quantity at this point 
Solved Threads: 0
JochenM JochenM is offline Offline
Newbie Poster

Re: Algorithm/Lib for analysing references in scientific documents needed

 
0
  #6
May 23rd, 2008
but just realize that the format of the references cannot change at all.
of course format will vary from document to document. otherwise i wouldn't have asked here.

but i found what i was looking for
http://fas.sfu.ca/fas/CitationParser/
http://aye.comp.nus.edu.sg/parsCit
Reply With Quote Quick reply to this message  
Reply

This thread is more than three months old.
Perhaps start a new thread instead?
Message:



Other Threads in the C++ Forum
Thread Tools Search this Thread



Tag cloud for C++
About Us | Contact Us | Advertise | DaniWeb | Acceptable Use Policy | RSS Feed

©2003 - 2009 DaniWeb® LLC