AT&T

AT&T Labs Research
AT&T Labs Research - Our People

 

Home
People
Projects
Research Areas
Resources

Mugshot

Church, Kenneth W.

   

Email: kwc@research.att.com

URL: http://www.research.att.com/~kwc

Phone: +1 973 360-8620
Fax: +1 973 360-8077
Room: D235
Address: 180 PARK AVE, P.O. BOX 971, FLORHAM PARK, NJ 07932-0971 UNITED STATES

Dept. Head, AT&T Labs Research

 


CV

 

(NOTE: Some of the older papers are hard to find in electronic form. Links marked with an asterisk (*) are as similar to the published paper as I could find.)

 

Journals

 

1.    Church, K., and Patil, R. (1982) “Coping with Syntactic Ambiguity or How to Put the Block in the Box on the Table,” American Journal of Computational Linguistics, Volume 8, Number 3-4, July-December.

2.    Church, K. (1987) “Phonological Parsing and Lexical Retrieval” Cognition, Volume 25, pp. 53-69. ps

3.    Church, K., and Hanks, P., “Word Association Norms, Mutual Information and Lexicography,” Computational Linguistics, Vol 16:1, pp. 22-29, (1991). ps

4.    Church, K., and Gale, W., “A Comparison of the Enhanced Good-Turing and Deleted Estimation Methods for Estimating Probabilities of English Bigrams,” Computer Speech and Language, 5:1, (1991).

5.    Church, K., and Gale, W. (1991) “Probability Scoring for Spelling Correction,” Statistics and Computing. ps (missing figures)

6.    Church, K., review of Aarts, J., and Meijs, W. (eds.) (1991) “Theory and Practice in Corpus Linguistics” Rodopi, Amesterdam, Computational Linguistics.

7.    Gale, W., and Church, K. (1993) “A Program for Aligning Sentences in Bilingual Corpora,” Computational Linguistics, 19:1, pp. 75-102. ps (figures missing) text (containing useful appendix) see related conference paper

8.    Gale, W., Church, K. and Yarowsky, D. (1993) “A Method for Disambiguating Word Senses in a Large Corpus,” Computers and Humanities, 26: 415-439. ps

9.    Church, K. and Helfman, J. (1993) “Dotplot: a Program for Exploring Self-Similarity in Millions of Lines of Text and Code,” The Journal of Computational and Graphical Statistics, 2:2, pp. 153-174. ps related pages

10.Church, K. and Mercer, R. (1993) “Introduction to the Special Issue on Computational Linguistics Using Large Corpora,” Computational Linguistics, 19:1, pp. 1-24. ps

11.Church, K. and Hovy, E. (1993) “Good Applications for Crummy Machine Translation,” Machine Translation, vol 8, pp. 239-258. ps

12.Church, K. and Rau, L. (1995) “Commercial Applications of Natural Language Processing,” Communications of the ACM, pp. 71-79. ps

13.Church, K. and Gale, W. (1995) “Poisson Mixtures,” Journal of Natural Language Engineering. ps

14.Isabelle, P. and Church, K. (eds.) (1997) special issue of Machine Translation, vol 12:1/2, Kluwer.

15.Dagan, I., Church, K. (1997), “Termight: Coordinating Humans and Machines in Bilingual Terminology Acquisition,” Machine Translation, vol 12:1/2, pp. 89-107.

16.Yamamoto, M. and Church, K (2001) “Using Suffix Arrays to compute Term Frequency and Document Frequency for All Substrings in a Corpus,” Computational Linguistics, vol 27:1, pp. 1-30, MIT Press. pdf related links

 

Patents

 

1.    4,829,580 “Text analysis system with letter sequence recognition and speech stress assignment arrangement.”

2.    5,146,405 “Methods for part-of-speech determination and usage.”

3.    5,283,833 “Method and apparatus for speech processing using morphology and rhyming.”

4.    5,541,836 “Word disambiguation apparatus and methods.”

5.    5,572,423 “Method for correcting spelling using error frequencies.”

6.    5,608,622 “System for analyzing translations.”

7.    5,647,023 “Method of nonlinear filtering of degraded document images.”

 

Conference Proceedings

 

1.    Church, K., (1979) “Co-ordinate Squares: A Solution to Many Chess Pawn Endgames,” (abbreviated version of B.S. Thesis), International Joint Conference on Artificial Intelligence, Tokyo, Japan.

2.    Church, K. (1980) “On Parsing Strategies and Closure,” Association for Computational Linguistics. pdf (direct from ACL)

3.    Church, K. (1983) “A Finite-State Parser for Use in Speech Recognition,” Association for Computational Linguistics. pdf (direct from ACL)

4.    Church, K. (1983) “Allophonic and Phonotactic Constraints are Useful,” International Joint Conference on Artificial Intelligence, Karlsruhe, West Germany.

5.    Church, K. (1983) “A Finite-State Parser for Use in Speech Recognition,” Journal of the Acoustical Society of America, Supplement 1, Vol. 74.

6.    Ejerhed, E., and Church, K. (1983) “Finite State Parsing,” in Karlsson, F. (ed), Papers from the Seventh Scandinavian Conference of Linguistics, University of Helsinki.

7.    Church, K. (1985) “Stress Assignment in Letter to Sound Rules for Speech Synthesis,” Association for Computational Linguistics. pdf (direct from ACL) ps

8.    Church, K. (1986) “Morphological Decomposition and Stress Assignment for Speech Synthesis,” Association for Computation Linguistics. pdf (direct from ACL)

9.    Church, K. (1987) “Three Classes of + Boundaries,” The Eleventh International Congress of Phonetic Sciences, Estonia, USSR. ps

10.Church, K. (1988) “A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text,” Second Conference on Applied Natural Language Processing, Austin, Texas, pp. 136-143. pdf (direct from ACL) ps

11.Koskenniemi, K., and Church K. (1988) “Complexity, Two-Level Morphology and Finnish,” Coling, Budapest, Hungary. ps

12.Church, K. (1989) “Syntactic Parsing May Not Help Speech Recognition Very Much,” in Working Notes of the AAAI Symposium: Text-Based Intelligent Systems.

13.Church, K., and Hanks, P. (1989) “Word Association Norms, Mutual Information and Lexicography,” Association for Computational Linguistics, Vancouver, Canada. pdf (direct from ACL)

14.Church, K., and Gale, W. (1989) “Enhanced Good-Turing and Cat-Cal: Two New Methods for Estimating Probabilities of English Bigrams,” Second Darpa Workshop on Speech and Natural Language, Cape Cod. ps

15.Church, K., and Hindle, D. (1990) “Collocational Constraints and Corpus-Based Linguistics,” in Working Notes of the AAAI Symposium: Text-Based Intelligent Systems.

16.Church, K., and Gale, W. (1990) “Poor Estimates of Context are Worse than None,” Third Darpa Workshop on Speech and Natural Language, Hidden Valley, PA. ps

17.Kernighan, M., Church, K., Gale, W (1990) “A Spelling Correction Program Based on a Noisy Channel Model,” Coling, Helsinki, Finland. ps

18.Coker, C., Church, K., Liberman, M. (1990) “Morphology and Rhyming: Two Powerful Alternatives to Letter-to-Sound Rules for Speech Synthesis,” European Speech Communication Association, Conference on Speech Synthesis. ps

19.Gale, W. A. and K. W. Church (1990) “Estimation Procedures for Language Context: Poor Estimates are Worse than None,” Proceedings in Computational Statistics, 1990, p.69-74, Physica-Verlag, Heidelberg.

20.Gale, W., and Church, K. (1991) “Identifying Word Correspondences in Parallel Text,” Fourth Darpa Workshop on Speech and Natural Language, Asilomar, pp. 152-157. ps*

21.Gale, W., and Church, K. (1991) “A Program for Aligning Sentences in Bilingual Corpora,” Association for Computational Linguistics. pdf (direct from ACL)

22.Church, K. (1991) “Some Statistical Opportunities in Speech and Language,” 23rd Symposium on the Interface, Computing Science and Statistics, Seattle, Washington. ps

23.Church, K. (1991) “Concordances for Parallel Text,” Seventh Annual Conference of the UW Centre for the New OED and Text Research, Oxford, England. ps

24.Church, K. and Hovy, E. (1991) “Good Applications for Crummy Machine Translation,” in Neal, J. and Walter, S. Natural Language Processing Systems Evaluation Workshop, University of California, Berkeley, CA.

25.Church, K. and Gale, W. (1992) “One Sense per Discourse,” Proceedings of Fifth DARPA Workshop on Speech and Natural Language. ps

26.Gale, W., Church, K., Yarowsky, D. (1992) “Estimating Upper and Lower Bounds on the Performance of Word-Sense Disambiguation Programs,” Association for Computational Linguistics. ps

27.Gale, W., Church, K., Yarowsky, D. (1992) “Using Bilingual Materials to Develop Word Sense Disambiguation Methods, TMI, Montreal. pdf (direct from ACL) ps

28.Church, K. and Helfman, J. (1992) “Dotplot: a Program for Exploring Self-Similarity in Millions of Lines of Text and Code,” Interface.

29.Gale, W., Church, K. and Yarowsky, D. (1992) “Work on Statistical Methods for Word Sense Disambiguation,” Proceedings AAAI Fall Workshop on Statistics in Natural Language.

30.Church, K. (1993) “Char_align: A Program for Aligning Parallel Texts at the Character Level,” Association for Computational Linguistics, pp. 1-8. pdf (direct from ACL) ps

31.Dagan, I., Church, K., and Gale, W. (1993) “Robust Word Alignment for Machine Aided Translation,” Proceedings of the Workshop on Very Large Corpora: Academic and Industrial Perspectives, available from the Association for Computational Linguistics, pp. 1-8. ps

32.Church, K., Dagan, I., Gale, W., Fung, P., Helfman, J., Satish, B. (1993) “Aligning Parallel Texts: Do Methods Developed for English-French Generalize to Asian Languages?” Pacific Asia Conference on Formal and Computational Linguistics, pp. 1-12. ps

33.Church, K., Gale, W., Helfman, J., Lewis, D. (1994) “Fax: An Alternative to SGML,” Coling. ps

34.Fung, P. and Church, K. (1994) “K-vec: A New Approach for Aligning Parallel Texts,” Coling. ps

35.Dagan, I. and Church, K. (1994) “TERMIGHT: Identifying and Translating Technical Terminology” 4th Conference on Applied Natural Language Processing. pdf (direct from ACL)

36.Agazzi, O., Church, K., and Gale, W. (1994), “Using OCR and Equalization to Downsample Documents,” Proceedings of the 12th International Conference on Pattern Recognition, Jerusalem, Israel, October 10-13, pp. 305-309.

37.Church, K. (1995) “One Term or Two?” Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 310-318. ps

38.Church, K. and Gale, W. (1995) “Inverse Document Frequency (IDF): A Measure of Deviation from Poisson,” Proceedings of the Third Workshop on Very Large Corpora, pp. 121-130. ps

39.Caldwell, D., Church, K. and Fowler, G. (1998), “Using Quantitative Methods to Compress Call Detail: From Data Warehousing to Data Publishing,” Symposium on Quantitative Analysis for Decision Making, pp. 71-79.

40.Yamamoto, M. and Church, K (1998) “Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus,” Proceedings of the Sixth Workshop on Very Large Corpora, pp. 28-37.

41.Belanger, D., Church, K. and Hume, A. (1999) Virtual Data Warehousing, Data Publishing, and Call Detail, pp. 106-117, W. Jonker (eds.) “Databases in Telecommunications 1999, International Workshop Co-located with VLDB-99,” Edinburgh, Scotland, Proceedings. Lecture Notes in Computer Science, Vol. 1819, Springer, 2000, ISBN 3-540-67667-9 http://www.informatik.uni-trier.de/~ley/db/conf/vldbw/vldbw1999.html. doc*

42.Ozawa, T., Yamamoto, M., Umemura, K., Church, K. (1999) Japanese word segmentation using similarity measure for IR. In Proceedings of the First NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition. Tokyo, Japan, August, 1999, p. 89-96. pdf

43.Buchsbaum, A., Caldwell, D., Church, K., Fowler, G. and Muthukrishnan, S. (2000), “Engineering the Compression of Massive Tables: An Experimental Approach,” in Proc. 11th ACM-SIAM Symp. on Discrete Algorithms (SODA), pp. 175-184. pdf related links

44.Church, K. (2000), “Empirical Estimates of Adaptation: The chance of Two Noriega's is closer to p/2 than p2,” Coling, pp. 173-179. pdf ps

45.Umemura, K. and Church, K. (2000), “Empirical Term Weighting and Expansion Frequency,” Empirical Methods in Natural Language Processing and Very Large Corpora, pp. 117-123. pdf

46.Yamamoto, E., Yamamoto, M., Umemura, K. and Church, K. (2000) “Dynamic Programming: A Method for Taking Advantage of Technical Terminology in Japanese Documents,” Information Retrieval with Asian Languages (IRAL), pp. 125-132.

47.Feng, L., Umemura, K., Yamamoto, M. and Church, K. (2000) “Using Variable Length Ngrams for Retrieving Technical Abstracts in Japanese,” Information Retrieval with Asian Languages (IRAL), pp. 213-214.

48.Sable, C. and Church, K. (2001) “Using Bins to Empirically Estimate Term Weights for Text Categorization,” 2001 Conference on Empirical Methods in Natural Language Processing (EMNLP 2001). ps alternate

49.Sable, C. McKeown, K., and Church, K. (2002) "NLP Found Helpful (at least for one Text Categorization Task)," in Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP), Philadelphia, PA. ps

50.Church, K. (2003) “Speech and Language Processing: Where have we been and where are we going,” Eurospeech, Geneva, Switzerland.  pdf slides

 

 

Books

 

1.    Church, K. (1988) Phonological Parsing in Speech Recognition, Kluwer.

2.    Armstrong, S., Church, K., Isabelle, P., Manzi, S., Tzoukermann, E. and Yarowksky, D. (eds.) (1999), Natural Language Processing Using Very Large Corpora, Kluwer.

 

Book Chapters

 

1.    Church, R., and Church, K. (1977) “Plans, Goals, and Search Strategies for the Seleciton of a Move in Chess,” in Frey, P. (ed), Chess Skill in Man and Machine, Springer-Verlag.

2.    Martin, W., Church, K., and Patil, R. (1987) “Preliminary Analysis of a Breadth-First Parsing Algorithm: Theoretical and Experimental Results,” L.Bolc (ed.), Natural Language Parsing Systems, Springer-Verlag.

3.    Church, K. (1987) “Phonological Parsing and Lexical Retrieval” in Frauenfelder, U. and Tyler, L., (ed) Spoken Word Recognition, MIT Press.

4.    Church, K, Gale, W., Hanks, P., Hindle, D. (1991) “Parsing, Word Associations and Typical Predicate-Argument Relations,” in Tomita, M.(ed.)  Current Issues in Parsing Technology, Kluwer Academic Publishers, Boston, MA. ps

5.    Church, K., Hanks, P., Hindle, D., Gale, W. (1991) “Using Statistics in Lexical Analysis,” in Zernik (ed), Lexical Acquisition: Using On-line Resources to Build a Lexicon, Lawrence Erlbaum, pp. 115-164. ps

6.    Liberman, M., and Church, K. (1991) “Text Analysis and Word Pronunciation in Text-to-Speech Synthesis,” in Furui, S., and Sondhi, M. (eds.), Advances in Speech Signal Processing. ps*

7.    Church, K., Hanks, P., Hindle, D., Gale, W., Moon, R., “Substitutablity,” in Atkins and Zampolli (ed.), (1994), Computational Approaches to the Lexicon Automating the Lexicon II Schema, Oxford University Press, pp. 153-180. ps

8.    Church, K, “Text Analysis,” in Mellish (ed.), (to appear) Encyclopedia of Language and Linguistics, Pergamon Press, Aberdeen University Press.

9.    Church, K., “Comments on Computational Learning Model for Metrical Phonology,” Levine, R. (ed.), Formal Grammar: Theory and Implementation, Vancouver Studies in Cognitive Science Series, UBC Press, Vol. 2 (1992). ps

10.Church, K. (1992) “Current Practice in Part of Speech Tagging and Suggestions for the Future,” in Simmons (ed.), Abornik praci: In Honor of Henry Kucera, Michigan Slavic Studies, pp. 13-48. ps

11.Gale, W. and Church, K. (1993) “Statistical Approaches to Aligning Sentences and Identifying Word Correspondences in Parallel Texts: a Report on Work in Progress,” in D. J. Hand (ed.) Artificial Intelligence Frontiers in Statistics, AI and Statistics III, pp. 281-294.

12.Gale, W. and Church, K. (1994) “What's Wrong with Adding One?”  in N. Oostdijk and P. de Haan (eds.), Corpus-Based Research into Languge: In honour of Jan Aarts, Rodopi, Amsterdam, pp. 189-200. ps

13.Gale, W. and Church, K. (1994) “Discrimination Decisions in 100,000 Dimensional Spaces,” in A. Zampolli, N. Calzolari and M. Palmer (eds.), Current Issues in Computational Linguisitcs: In honour of Don Walker, Kluwer Academic Publishers, The Netherlands, pp. 429-550. ps

 

Tutorials

 

1.    Ngrams, ACL 1995. ps

2.    NLP Techniques and Text Retrieval, Lugano Switzerland. ps

3.    Unix for Poets ps

4.    Applications of Text Analysis ps

5.    Empirical Methods in Natural Language Processing: What's Happened Since the First SIGDAT Meeting? NAACL-2000 abstract, slides & handout

 

Invited Talks

 

1.    EACL-1993

2.    What’s Happened Since the First SIGDAT Meeting?, SIGDAT-1999 ppt

3.    Empiricism form TMI-1992 to AMTA-2002 to AMTA-2012: Have IBM Models 1-5 failed to solve all the world's problems? AMTA-2002 ppt

 


home | people | projects | research areas | resources

Copyright and Privacy Notice