Algorithm for Syntactic Tree Parser

Question

swinefish 5 Junior Poster in Training

15 Years Ago

Hey all

I'm trying to write a Multiple Document Summary (MDS) system as proposed by Chali et al. (2009), see attached. And after a lot of reading I've managed to wrap my head around a lot of the concepts.
However, one of the things I need to make is a component to parse sentences into syntactic trees. I've been looking around on Google for a while and I haven't found anything really useful. There are systems which draw the trees given a specific notation, but I have no way of creating that notation. A reference to a parser is given in the text (Charniak (1999)) however I can only read the abstract of this article not the full text.
If anybody could perhaps suggest an algorithm (or at least point me in the right direction) I would be really grateful.

Thanks in advance
M

data-structure

This attachment is potentially unsafe to open. It may be an executable that is capable of making changes to your file system, or it may require specific software to open. Use caution and only open this attachment if you are comfortable working with pdf files.

Complex_Question_Answering_Unsupervised_Learning.pdf (405.75 KB)

Edited 15 Years Ago by swinefish because: Attached Document

2 Contributors
1 Reply
147 Views
1 Week Discussion Span
Latest Post 15 Years Ago Latest Post by jk451

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

jk451 0 Newbie Poster · Answer 1 · 2009-12-20T21:07:47+00:00

Hey all
I'm trying to write a Multiple Document Summary (MDS) system as proposed by Chali et al. (2009), see attached. And after a lot of reading I've managed to wrap my head around a lot of the concepts.
However, one of the things I need to make is a component to parse sentences into syntactic trees. I've been looking around on Google for a while and I haven't found anything really useful. There are systems which draw the trees given a specific notation, but I have no way of creating that notation. A reference to a parser is given in the text (Charniak (1999)) however I can only read the abstract of this article not the full text.
If anybody could perhaps suggest an algorithm (or at least point me in the right direction) I would be really grateful.
Thanks in advance
M

There are many natural language parsing implementations.

If you can, I very much suggest using NLTK -- Natural Language Toolkit. This is, however, implemented in Python, though e.g. with Java, you can interface via Jython.

This has the huge advantage that it has ready-to-go POS-tagger and natural grammar parser implementation, though you will still have to spend a fair amount of time figuring out how to actually use them.

Otherwise, if you got a random standalone parser from the web, it is likely that it would require tokenized, POS-tagged input.

Note though, that they do not produce a 100 % correct result, because e.g. due to ambiguities it does not exist, plus if it did, they would still approximate it.