next up previous
Next: About this document ... Up: Discovery of Linguistic Relations Previous: 5. Contributions

Bibliography

Baker, 1979
Baker, J. (1979).
Trainable grammars for speech recognition.
In Speech communication papers presented at the 97th Meeting of the Acoustical Society, pages 547-550.

Baum, 1972
Baum, L. E. (1972).
An inequality and associated maximization technique in statistical estimation for probabilistic functions of markov processes.
Inequalities, 3:1-8.

Beeferman et al., 1997
Beeferman, D., Berger, A., and Lafferty, J. (1997).
A model of lexical attraction and repulsion.
In ACL/EACL '97.

Briscoe and Waegner, 1992
Briscoe, T. and Waegner, N. (1992).
Robust stochastic parsing using the inside-outside algorithm.
In AAAI '92 Workshop on Probabilistically-Based Natural Language Processing Techniques, pages 39-53.

Brown et al., 1992
Brown, P. F. et al. (1992).
Class-based n-gram models of natural language.
Computational Linguistics, 18(4):467-479.

Carroll and Charniak, 1992a
Carroll, G. and Charniak, E. (1992a).
Learning probabilistic dependency grammars from labeled text.
In Probabilistic Approaches to Natural Language, Papers from 1992 AAAI Fall Symposium, pages 25-31.

Carroll and Charniak, 1992b
Carroll, G. and Charniak, E. (1992b).
Two experiments on learning probabilistic dependency grammars from corpora.
In Workshop Notes, Statistically Based NLP Techniqies, AAAI, pages 1-13.

Charniak, 1993
Charniak, E. (1993).
Statistical language learning.
MIT Press.

Charniak, 1997
Charniak, E. (1997).
Statistical parsing with a context-free grammar and word statistics.
In AAAI'97.

Chen, 1996
Chen, S. F. (1996).
Building probabilistic models for natural language.
PhD thesis, Harvard University.

Chomsky, 1957
Chomsky, N. (1957).
Syntactic Structures.
Mouton.

Chomsky, 1965
Chomsky, N. (1965).
Aspects of the theory of syntax.
MIT Press.

Collins, 1996
Collins, M. J. (1996).
A new statistical parser based on bigram lexical dependencies.
In Proceedings of the 34th Annual Meeting of the ACL.

Cormen et al., 1990
Cormen, T. H., Leiserson, C. E., and Rivest, R. L. (1990).
Introduction to Algorithms.
MIT Press and McGraw-Hill.

Cover and Thomas, 1991
Cover, T. M. and Thomas, J. A. (1991).
Elements of Information Theory.
John Wiley and Sons, Inc.

de Marcken, 1995
de Marcken, C. G. (1995).
On the unsupervised acquisition of phrase-structure grammars.
In Third Workshop on Very Large Corpora.

de Marcken, 1996
de Marcken, C. G. (1996).
Unsupervised language acquisition.
PhD thesis, MIT.

Fujisaki et al., 1989
Fujisaki, T., Jelinek, F., et al. (1989).
A probabilistic parsing method for sentence disambiguation.
In Proceedings of the 1st International Workshop on Parsing Technologies, pages 85-94.

Gaifman, 1965
Gaifman, H. (1965).
Dependency systems and phrase-structure systems.
Information and Control, 8:304-337.

Graham et al., 1994
Graham, R. L., Knuth, D. E., and Patashnik, O. (1994).
Concrete Mathematics.
Addison-Wesley, 2 edition.

Harary, 1969
Harary, F. (1969).
Graph Theory.
Addison-Wesley.

Hudson, 1984
Hudson, R. A. (1984).
Word Grammar.
B. Blackwell.

Jelinek, 1985
Jelinek, F. (1985).
Markov source modeling of text generation.
In Skwirzinski, J. K., editor, The Impact of Processing Techniques on Communications, pages 569-598. Martinus Nijhoff.

Lari and Young, 1990
Lari, K. and Young, S. (1990).
The estimation of stochastic context-free grammars using the inside-outside algorithm.
Computer Speech and Language, 4(1):35-56.

Lee, 1997
Lee, L. (1997).
Similarity-based approaches to natural language processing.
PhD thesis, Harvard University.

Magerman, 1995
Magerman, D. M. (1995).
Statistical decision-tree models for parsing.
In Proceedings of the 33rd Annual Meeting of the ACL.

Mel'cuk, 1988
Mel'cuk, I. A. (1988).
Dependency Syntax: Theory and Practice.
SUNY.

Pearl, 1988
Pearl, J. (1988).
Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference.
Morgan Kaufmann.

Pereira and Tishby, 1992
Pereira, F. and Tishby, N. (1992).
Distributional similarity, phase transitions and hierarchical clustering.
In Probabilistic Approaches to Natural Language, Papers from 1992 AAAI Fall Symposium, pages 108-112.

Pereira and Schabes, 1992
Pereira, F. C. and Schabes, Y. (1992).
Inside-outside reestimation from partially bracketed corpora.
In Proceedings of the 30th Annual Meeting of the Association for Computational Linguist, pages 128-135.

Quirk et al., 1985
Quirk, R., Greenbaum, S., Leech, G., and Svartvik, J. (1985).
A Comprehensive Grammar of the English Language.
Longman.

Rabiner and Juang, 1986
Rabiner, L. and Juang, B. (1986).
An introduction to hidden markov models.
IEEE ASSP Magazine, pages 4-16.

Schank and Colby, 1973
Schank, R. C. and Colby, K. M. (1973).
Computer Models of Thought and Language.
Freeman.

Shannon, 1948
Shannon, C. E. (1948).
A mathematical theory of communication.
The Bell System Technical Journal, 27.

Shannon, 1951
Shannon, C. E. (1951).
Prediction and entropy of printed english.
The Bell System Technical Journal, 30:50-64.

Sharman et al., 1990
Sharman, R., Jelinek, F., and Mercer, R. (1990).
Generating a grammar for statistical training.
In Proceedings of the Third DARPA Speech and Natural Language Workshop, pages 267-274.

Sleator and Temperley, 1991
Sleator, D. and Temperley, D. (1991).
Parsing english with a link grammar.
Technical Report CMU-CS-91-196, CMU.

Sleator and Temperley, 1993
Sleator, D. and Temperley, D. (1993).
Parsing english with a link grammar.
In Third international workshop on parsing technologies.

Stolcke, 1994
Stolcke, A. (1994).
Bayesian learning of probabilistic language models.
PhD thesis, University of California at Berkeley.

Viterbi, 1967
Viterbi, A. J. (1967).
Error bounds for convolutional codes and an asymptotically optimal decoding algorithm.
IEEE Transactions on Information Processing, 13:260-269.

Zipf, 1949
Zipf, G. K. (1949).
Human Behavior and the Principle of Least Effort.
Addison-Wesley.



Deniz Yuret
1998-10-07