User:Jeblad/Natural language processing

Natural language processing Notes about stemming, lemmatization and morphology

This is a collection (termbase) of terms and notes about stemming, lemmatization and morphology that might be useful for automatic text synthesis within the Mediawiki environment. Perhaps the collection will be large enough to be useful for other uses too, but for the moment it is only a loosely knit collection of personal notes. It is also influenced by my own understanding of the problems, which may be skewed, flawed or even outright dangerously wrong.

Terms
Complete termbase
 * Turing test
 * /Conversational agent
 * Transitive
 * Ditransitive
 * Object
 * Action
 * Syntax
 * Morphology
 * /Generative grammar
 * Phonetics
 * Phonology
 * Synthesis
 * Machine synthesis
 * Inference
 * Definition
 * Factoid
 * Translation
 * Machine translation
 * /Coreference resolution
 * Discourse
 * Pragmatic
 * Dialogue
 * Dialogue system
 * Resolve
 * Part of speech
 * Tagging
 * Ambiguous
 * Disambiguate
 * Word sense disambiguation
 * Lexical disambiguation
 * Syntactic disambiguation
 * Probabilistic parsing
 * Speech act interpretation
 * State machine
 * States
 * Transitions
 * Inputs
 * Rule system
 * Logic
 * Probabilistic model
 * Vector-space model
 * State-space search
 * Dynamic programming
 * Machine learning
 * Sequence models
 * Classifier
 * Decision trees
 * Support vector machines
 * Gaussian mixture models
 * Logistic regression
 * Expectation-Maximization (EM)
 * Finite state automata
 * Finite state transducers
 * Deterministic
 * Non-deterministic
 * /Symbolic language processing
 * /Stochastic language processing
 * Regular grammars
 * Regular relations
 * Context-free grammars
 * Feature-augmented grammars
 * First-order logic
 * Predicate calculus
 * Lambda calculus
 * Feature structures
 * Semantic primitives
 * Semantics
 * Pragmatics
 * Non-logical lexical semantics
 * Weighted automaton (Markov model)
 * Hidden Markov model (HMM)
 * Maximum entropy Markov model
 * Conditional random fields
 * Word meaning
 * Depth-first search
 * Best-first search
 * A* search
 * Cross-validation
 * Entrophy