Here are some useful links on various technical topics related to NLP. See also nlp_links for general NLP links of various sorts.

Format of the following links:

Work to do on this page/suggestions:

General, Books

NOTE: An entry marked as [on-line] means that the whole book is available on-line. Even for the other books, there are often sample chapters and supplementary material of other sort available.

  • book [on-line] Jurafsky and Martin (2008) - Speech and Language Processing, draft 2nd edition: Most draft chapters are available online and are perhaps the best reference to start with for those who want a solid intro without excessive math. Beware: The first edition of Jurafsky and Martin is lacking in coverage of a large number of currently core NLP areas. Use the draft 2nd edition instead.
  • book Manning and Schuetze - Foundations of Statistical Natural Language Processing: Along with Jurafsky and Martin, this is the standard NLP reference. Beware: The text is often pitched too high to work well as a first introduction to the field.
  • book Chris Bishop (2007) - Pattern Recognition and Machine Learning: This book covers statistical machine learning rather than NLP. It is great as a more-advanced intro to many important modern-day topics (generalized linear models, generative vs. discriminative models, neural networks, kernel methods, graphical models, mixture models, EM, variational inference, sampling, HMM’s, ensembles, etc.). Beware: The book claims to be suitable as a first intro for those with little or no previous machine learning or probability background, but it in fact is pitched far, far above this. You should have a good understanding of probability and statistics and decent experience with machine learning before tackling this book.
  • book Russell and Norvig - Artificial Intelligence: A Modern Approach: This is the standard book on artificial intelligence, and is pitched towards those new to the field. It has chapters on probability (mostly Bayesian), probabilistic inference, directed graphical models (i.e. Bayes nets), probabilistic inference over time (e.g. HMM’s), learning, statistical learning, etc. These can serve as good overviews for those with no prior experience with these topics.
  • book [on-line] David MacKay - Information Theory, Inference, and Learning Algorithms: Covers much more than just information theory. Extensive coverage of many statistical topics.
  • book Tom Mitchell - Machine Learning: This is the classic book for machine learning. However, it covers very little in the way of statistical machine learning. It does have one chapter on Bayesian learning and one on computational learning theory.

The web site has a new chapter on Naive Bayes and logistic regression.

  • book Degroot and Schervish - Probability and Statistics (intermediate): There are a tremendous number of books on statistics. Most of them follow a pretty standardized set of topics, but differ somewhat in the level of difficulty. This book’s level is intermediate. It assumes at least passing familiarity with statistics and multi-variate calculus. The advantage of this book for an NLP‘er is that it takes a much more Bayesian outlook than most other books of this sort. It also has a good chapter on sampling (importance/rejection sampling, Gibbs sampling, MCMC, bootstrap, etc.).
  • book Casella and Berger - Statistical Inference (advanced): This is the classic book on statistics. It is well organized and extremely clearly written. It assumes previous statistics background. It has little Bayesian statistics, making it less than completely useful for NLP‘ers.

General, On-line and not in book form

Bayesian Modeling

Boosting (see "Ensemble Models")

Conditional Random Fields

Context-Free Grammars (CFG, PCFG) (see "Parsing")

Coreference

Dirichlet Processes (see "Nonparametric Bayes")

Discourse

Ensemble Models, Boosting

Expectation Maximization

Gaussian Processes

Gibbs Sampling (See "Sampling")

Grammar Learning/Inference/Induction

Lexicalized:

  • [Charniak, 1996]
  • [Collins, 1999]

Manual refinement:

  • [Johnson, 1998]
  • [Klein, Manning, 2003]

Automatic refinement:

  • [Matsuzaki, et al., 2005]
  • [Petrov, et al., 2006]

Misc:

  • Kenichi Kurihara and T. Sato. Variational Bayesian grammar induction for natural language. International Colloquium on Grammatical Inference, 2006.

Graphical Models

Hidden Markov Models

Information Extraction

Information Retrieval, Question Answering

Information Theory

Lagrange Multipliers

Language Modeling, N-Grams

Lexical Acquisition

Logistic Regression (see "Maximum Entropy")

Kernels (see "Support Vector Machines")

Machine Learning (General)

Machine Translation

Introduction:

Word-for-Word Translation:

Phrase-based Systems:

Syntax-Based Systems:

Language Modeling for MT:

Evaluation:

Discriminative Training:

Synchronous Tree Adjoining Grammar:

Interlingua:

Example-Based MT:

Multilingual NLP:

Tree Transducers:

Pages with links to related texts:

Markov Chain Monte Carlo (MCMC) (See "Sampling")

Markov Logic Networks

Maximum Entropy, Logistic Regression

Overviews and fundamental texts:

Other important texts:

Pages with links to related texts:

Mixture Models

N-Grams (See "Language Modeling")

Named Entity Recognition

Neural Networks

Nonparametric Bayes, Dirichlet Processes

Overviews:

Basic theory:

More complex models:

Approximate inference: MCMC (Sampling):

Approximate inference: Empirical Bayes:

Approximate inference: Variational Bayes:

Language modeling:

Other applications:

Pages with links to related texts:

Parsing

PCFG’s:

Part-of-Speech Tagging

Principal Components Analysis (PCA)

Reranking

Sampling, Gibbs Sampling, Markov Chain Monte Carlo (MCMC)

Scalability

Selectional Preferences (see "Lexical Acquisition")

Semantic Role Labeling (see "Shallow Semantics")

Semantics

Semi-Supervised Learning

Shallow Semantics

Speech Recognition, Speech Synthesis

Summarization

  • Hal Daumé III and Daniel Marcu. Bayesian query-focused summarization. COLING/ACL, 2006.

Support Vector Machines, Kernels

Topic Modeling

Variational Methods

Word Sense Disambiguation (WSD) (see "Shallow Semantics")

 
nlp_papers.txt · Last modified: 2008/07/18 19:40 by benwing
 
Recent changes RSS feed Creative Commons License Donate Powered by PHP Valid XHTML 1.0 Valid CSS Driven by DokuWiki