Some Recent Publications of the Natural Processing
Group
Listed below are some of the more recent publications of
the faculty and graduate students in the
Natural Language Processing Group (Computer Science Department) at Johns
Hopkins University. (At the bottom are links to further
language-related publications by our close colleagues at JHU.)
2001
- J. Eisner. ``Smoothing a Probabilistic Lexicon via Syntactic
Transformations.''
Forthcoming.
- R. Florian, G. Ngai. ``
Multidimensional Transformation-Based Learning''
In Proceedings of the Fifth Computational Natural Language Learning Workshop
(CoNNL01), pp 1-8, July 2001.
- G. Mann, D. Yarowsky. ``
Multipath Translation Lexicon Induction via Bridge Languages''
To appear in Proceedings of the Second Conference of the North American
Chapter of the Association for Computational Linguistics, Pittsburgh,
PA.
- G. Ngai. ``Maximizing Resources for Corpus-Based Natural Language
Processing.''
PhD Disssertation, Johns Hopkins University.
- G. Ngai , R. Florian. ``
Transformation-Based Learning in the Fast Lane.''
To appear in Proceedings of the Second Conference of the North American
Chapter of the Association for Computational Linguistics, Pittsburgh,
PA.
- D. Yarowsky, G. Ngai. ``Inducing Multilingual POS Taggers and NP Bracketers
via Robust Projection across Aligned Corpora''
To appear in Proceedings of the Second Conference of the North American
Chapter of the Association for Computational Linguistics, Pittsburgh,
PA.
2000
- E. Brill, J. C. Henderson and G. Ngai. ``
Automatic Grammar Induction: Combining, Reducing and Doing Nothing'',
International Workshop on Parsing Technology
- S. Cucerzan and D. Yarowsky. ``
Language Independent Minimally Supervised Induction of Lexical Probabilities
.''
In Proceedings of the 38th Annual Meeting of the Associations for Computational
Linguistics, Hong Kong, pages 270-277.
- J. Eisner. ``
Bilexical Grammars and Their Cubic-time Parsing Algorithms. ''
In Harry Bunt and Anton Nijholt (eds.), Advances in Probabilistic and
Other Parsing Technologies, pp. 29-62. Kluwer Academic Publishers, October.
- J. Eisner and G. Satta. ``
A Faster Parsing Algorithm for Lexicalized Tree-Adjoining Grammars.
''
Proceedings of the 5th Workshop on Tree-Adjoining Grammars and Related
Formalisms (TAG+5), Paris, May.
- J. Eisner. ``
Easy and Hard Constraint Ranking in Optimality Theory: Algorithms and Complexity.
''
In Jason Eisner, Lauri Karttunen and Alain Thériault (eds.), Finite-State
Phonology: Proceedings of the 5th Workshop of the ACL Special Interest Group
in Computational Phonology (SIGPHON),
- J. Eisner. ``
Directional Constraint Evaluation in Optimality Theory.''
In Proceedings of the 18th International Conference on Computational
Linguistics (COLING 2000), Saarbrücken, August, pp. 257-263.
- R. Florian, J. C. Henderson and G. Ngai. ``
Coaxing Confidences Out of an Old Friend: Probabilistic Classifications from
Transformation Rule Lists'' ,
In Proceedings of the Fifth Conference on Empirical Methods in Natural
Language Processing, Hong Kong
- J. C. Henderson. ``Exploiting Diversity for Natural Language Parsing.''
PhD Dissertation, Johns Hopkins University.
- L. Mangu. ``Finding Consensus In Speech Recognition.''
PhD Dissertation, Johns Hopkins University.
- L. Mangu, E. Brill and A. Stolcke.
Finding Consensus in Speech Recognition: Word Error Minimization and Other
Applications of Confusion Networks.
In Computer, Speech and Language , 14(4):373-400.
- G. Ngai and D. Yarowsky.
``Rule Writing or Annotation: Cost-efficient Resource Usage for Base Noun
Phrase Chunking.''
In Proceedings of the 38th Annual Meeting of the Associations for Computational
Linguistics, Hong Kong, pages 117-125.
- P. Resnik and D. Yarowsky. ``
Distinguishing Systems and Distinguishing Senses: New Evaluation
Methods for Word Sense Disambiguation.''
In Natural Language Engineering, 5(2), pp. 113-133.
- S. Khudanpur, J. Wu,
Maximum Entropy Techniques for Exploiting Syntactic, Semantic and Collocational
Dependencies in Language Modeling,
Computer Speech and Language, pp. 355-372, Oct. 2000.
- J. Wu and S. Khudanpur,
Efficient Training Methods for Maximum Entropy Language Modeling,
Proceedings of ICSLP2000, Vol. 3, pp. 114-117, Oct. 2000, Beijing,
China.
- D. Yarowsky. ``
Hierarchical Decision Lists for Word Sense Disambiguation.''
In Computers and the Humanities, 34(2):179-186.
- D. Yarowsky. ``Word Sense Disambiguation.''
In R. Dale, H. Moisl and H. Somers (eds.)
The Handbook of Natural Language Processing. New York: Marcel Dekker,
pp. 629-654.
- D. Yarowsky and R. Wicentowski. ``
Minimally Supervised Morphological Analysis by Multimodal Alignment.''
In Proceedings of the 38th Annual Meeting of the Associations for Computational
Linguistics, Hong Kong, pages 207-216.
1999
- E. Brill and G. Ngai.
Man [and Woman] vs. Machine: A Case Study in Base Noun Phrase Learning
In Proceedings of the 37th Annual Meeting of the Association for Computational
Linguistics, Maryland, College Park.
- S. Cucerzan and D. Yarowsky. ``
Language Independent Named Entity Recognition Combining Morphological and
Contextual Evidence.''
In Proceedings, 1999 Joint SIGDAT Conference on Empirical Methods in
NLP and Very Large Corpora, pp. 90-99.
- J. Eisner and G. Satta. ``
Efficient Parsing for Bilexical Context-Free Grammars and Head
Automaton Grammars. ''
In Proceedings of the 37th Annual Meeting of the Association
for Computational Linguistics, University of Maryland, June, pp. 457-464.
- J. Eisner. ``
Doing OT in a Straitjacket.''
Talk handout, UCLA Linguistics Dept., June..
- R. Florian, D. Yarowsky. ``
Dynamic Non-local Language Modeling via Hierarchical Topic-Based Adaptation
'',
In Proceedings of the 37th Annual Meeting of the Association for Computational
Linguistics, College Park, MD, USA
- J. C. Henderson and E. Brill. ``Exploiting Diversity in Natural Language
Processing: Combining Parsers.''
In Proceedings of the Fourth Conference on Empirical Methods in Natural
Language Processing. College Park, Maryland.
- L. Mangu, E. Brill and A. Stolcke. ``
Finding Consensus Among Words: Lattice-Based Word Error Minimization.
''
In Proc. of EUROSPEECH'99, Budapest, Hungary.
- L. Mangu and E. Brill. ``
Lattice Compression in the Consensual Post-Processing Framework. ''
In Proc. of SCI/ISAS'99, Orlando, Florida.
- L.Mangu, E. Brill and A. Stolcke. ``
Improve Accuracy by Local Consensus.''
In Proc. Hub-5 Conversational Speech Understanding Workshop, Linthicum,
MD.
- S. Armstrong, K. Church, P. Isabelle, E. Tzoukermann and D. Yarowsky
(Eds.), Natural Language Processing Using Very Large Corpora.
Kluwer Academic Publishers.
- S. Khudanpur and J. Wu.
A Maximum Entropy Language Model to Integrate N-Grams and Topic Dependencies
for Conversational Speech Recognition.
Proceedings of ICASSP'99, pp. 553-556, March 14-19, 1999, Phoenix.
- J. Wu and S. Khudanpur,
Combining Nonlocal, Syntactic and N-Gram Dependencies in Language Modeling.
Proceedings of Eurospeech'99, vol 5, pp2179-2182, September 6-10,
1999, Budapest, Hungary.
- D. Yarowsky. ``
Corpus-based Techniques for Restoring Accents in Spanish and French Text
.''
In Natural Language Processing Using Very Large Corpora. Kluwer Academic
Publishers, pp. 99-120.
- D. Yarowsky, R. Florian. ``
Taking the Load Off the Conference Chairs: Towards a Digital Paper-Routing
Assistant,''
In Proceedings of the Fourth Conference on Empirical Methods in Natural
Language Processing, College Park, MD, USA
1998
- E. Brill, R. Florian, J. C. Henderson, and L. Mangu. ``
Beyond N-Grams: Can Linguistic Sophistication Improve Language Modeling?
''
In Proceedings of COLING-ACL'98, Montreal, Canada.
- E. Brill and J. Wu.
Classifier Combination for Improved Lexical Disambiguation
Processings of COLING-ACL'98, pp 191-195, Auguest 10-14, 1998, Montreal
Canada.
- L. Mangu, E. Brill and A. Stolcke. ``
Searching for Consensus to Improve Recognizer Output.''
In Proc. Hub-5 Conversational Speech Recognition Workshop, Linthicum,
MD.
1997
- C. Chelba, D. Engle, F. Jelinek, V. Jimenez, S. Khudanpur, L. Mangu,
H. Printz, E. Ristad, R. Rosenfeld, A. Stolcke, D. Wu. ``
Structure and Performance of a Dependency Language Model. ''
In Proc. EUROSPEECH'97, 2775-2778, Rhodes, Greece.
- J. Eisner. ``
Efficient Generation in Primitive Optimality Theory. ''
In Proceedings of the 35th Annual Meeting of the Association for Computational
Linguistics and the 8th Conference of the European Association for Computational
Linguistics, Madrid, July, pp. 313-320.
- J. Eisner. ``
FootForm Decomposed: Using Primitive Constraints in OT. ''
In MIT Working Papers in Linguistics, vol. 31, edited by Benjamin
Bruening.
- J. Eisner. ``Bilexical grammars and a cubic-time probabilistic
parser.''
In Proceedings of the International Workshop on Parsing Technologies
, MIT, September, pp. 54-65.
- W. Kim and Myoung--Wan Koo.
Statistical Corpus Analysis for KT--TREASURE : Korea Telecom Train ticket
REservation Aid System based Upon speech REcognition,
In Proc. 14th Int. Conf. on Speech Processing pp. 319-323.
- W. Kim and Myoung--Wan Koo.
A Korean Speech Corpus for Train Ticket Reservation Aid System Based On
Speech Recognition,
In 5th Proc. European Conf. on Speech Communication and Technology,
Vol. 4, pp. 1723-1726.
- L. Mangu and E. Brill. ``
Automatic Rule Acquisition for Spelling Correction. ''
In Proceedings of the Fourteenth International Conference on Machine
Learning, ICML'97, Nashville, Tennessee.
- P. Resnik and D. Yarowsky. ``
A Perspective on Word Sense Disambiguation Methods and Their Evaluation
.''
In Proceedings of SIGLEX '97, Washington, DC, pp. 79-86.
- G. Satta and J. C. Henderson. ``String Transformation Learning.''
In Proceedings of the 35th Conference of the Association for Computational
Linguistics. Madrid, Spain.