Large-Scale Cross-Document Coreference Using Distributed Inference and Hierarchical Models, by Sameer Signh, Amarnag Subramanya, Fernando Pereira, and Andrew McCallum, ACL 2011.

I thought this paper was really interesting. They did things that are very similar to what I’m currently working on. So, this is a bit longer than most of my posts about research papers.


Read more...

Linguistic Structure Prediction, by Noah Smith.

Ok, this isn’t actually a paper, it’s a book. But I just finished it, and it’s a really good book, I thought.


Read more...

An Entity-Level Approach to Information Extraction, by Haghighi and Klein, ACL 2010.

This is the last of H&K’s coreference resolution papers, and as you can see, it’s not actually about coference resolution, per se. I include it in the group because essentially all they do is take their NAACL 2010 system and modify it slightly to perform a template-filling task for information extraction. That’s very similar to the kind of extension I’m looking at, though I’m taking a pretty different approach than they used.


Read more...

Coreference resolution in a modular, entity-centered model, by Haghighi and Klein, NAACL 2010.

This won the best paper award at NAACL that year, and I saw the presentation. I was pretty excited about the work then, and it looks like I still am, because I’m working on extending it. I have the code that ran this system, and I’m working on getting it to run with my data, then I can add some of my ideas to it.


Read more...

Simple coreference resolution with rich syntactic and semantic features, by Haghighi and Klein, EMNLP 2009.

This is the second of H&K’s coreference resolution papers. In a complete departure from their previous work, the system in this paper is completely deterministic and works with just a few syntactic and semantic cues. Basically, they come up with a candidate set of antecedents for each mention and select the one that is closest in the parse tree.


Read more...