Factored language model: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 23:57, 13 July 2005 edit Gang Ji (talk \| contribs) 3 edits No edit summary ← Previous edit		Latest revision as of 23:02, 24 June 2025 edit undo Fadesga (talk \| contribs) Autopatrolled, Extended confirmed users 289,080 edits No edit summary
(20 intermediate revisions by 18 users not shown)
Line 1: ~~Factored~~The '''factored language model''' ('''FLM''') is an extension of a conventional [[~~Language~~language model]] introduced by Jeff Bilmes and Katrin Kirchoff in 2003. In an FLM, each word is viewed as a vector of ''k'' factors: ~~''w~~<~~sub>i</sub~~math>w_i = \{~~f<sub>i</sub><sup>~~f_i^1~~</sup>~~, ..., ~~f<sub>i</sub><sup>~~f_i^k\}.</~~sup~~math>~~}''.~~ An FLM provides the probabilistic model ''<math>P(f\|f_1, ..., f_N)</math> where the prediction of a factor <math>f<~~sub~~/math>1 is based on <math>N</~~sub~~math> parents <math>\{f_1, ..., ff_N\}<~~sub~~/math>N. For example, if <math>w</~~sub~~math> represents a word token and <math>t</math> represents a [[Part of speech]] tag for English, the expression <math>P(w_i\|w_{i-2}, w_{i-1}, t_{i-1})''</math> gives a model for predicting current word token based on a traditional [[Ngram]] model as well as the [[Part of speech]] tag of the previous word. A major advantage of factored language models is that they allow users to specify linguistic knowledge such as the relationship between word tokens and [[Part of speech]] in English, or morphological information (stems, root, etc.) in Arabic. Like [[N-gram]] models, smoothing techniques are necessary in parameter estimation. In particular, generalized backing-off is used in training an FLM.▼ ▲Like [[N-gram]] models, smoothing techniques are necessary in parameter estimation. In particular, generalized ~~backing~~back-off is used in training an FLM. == References ==▼ ▲== References == *{{~~Conference~~cite ~~reference~~conference \| ~~Author~~author=J Bilmes and K Kirchhoff \| ~~Title~~url=[http://ssli.ee.washington.edu/people/bilmes/mypapers/hlt03.pdf \| title=Factored Language Models and Generalized Parallel Backoff] \| ~~Booktitle~~book-title=Human Language Technology Conference \| ~~Pages~~year=2003 \| ~~Year~~archive-url=~~2003~~https://web.archive.org/web/20120717075838/http://ssli.ee.washington.edu/people/bilmes/mypapers/hlt03.pdf \| archive-date=17 July 2012}} [[Category:Language modeling]] {{comp-stub}}▼ [[Category:~~Natural~~Statistical natural language processing]]▼ [[Category:Probabilistic models]] ~~[[Category:Computational linguistics]]~~ ▲{{~~comp~~NLP-stub}} ▲[[Category:Natural language processing]]