User:Jeblad/Natural language processing

Natural language processing Notes about stemming, lemmatization and morphology

This is a collection of notes about stemming, lemmatization and morphology that might be useful for automatic text synthesis within the Mediawiki environment. Perhaps the collection will be large enough to be useful for other uses too, but for the moment it is only a loosely knit collection of personal notes. It is also influenced by my own understanding of the problems, which may be skewed, flawed or even outright dangerously wrong.

A kind of micro lexicon is used for structuring the notes, as I have had some success on reuse of such notes at other projects.

Todo

 * Conversational agent
 * Transitive
 * Ditransitive
 * Object
 * Action
 * Syntax
 * Morphology
 * Phonetics
 * Phonology
 * Synthesis
 * Machine synthesis
 * Inference
 * Definition
 * Factoid
 * Translation
 * Machine translation
 * Discourse
 * Pragmatic
 * Dialogue
 * Dialogue system
 * Resolve
 * Part of speech
 * Tagging
 * Ambiguous
 * Disambiguate
 * Word sense disambiguation
 * Lexical disambiguation
 * Syntactic disambiguation
 * Probabilistic parsing
 * Speech act interpretation
 * State machine
 * States
 * Transitions
 * Inputs
 * Rule system
 * Logic
 * Probabilistic model
 * Vector-space model
 * State-space search
 * Dynamic programming
 * Machine learning
 * Classifier
 * Expectation-Maximization (EM)
 * Finite state automata
 * Finite state transducers
 * Deterministic
 * Non-deterministic
 * Regular grammars
 * Regular relations
 * Context-free grammars
 * Feature-augmented grammars
 * First-order logic
 * Predicate calculus
 * Lambda calculus
 * Feature structures
 * Semantic primitives
 * Semantics
 * Pragmatics
 * Non-logical lexical semantics
 * Weighted automaton (Markov model)
 * Hidden Markov model (HMM)
 * Word meaning