Revision as of 21:37, 3 May 2014 edit Hermel (talk \| contribs) Extended confirmed users, Pending changes reviewers 2,052 edits m →Example: avoid confusion: "iff" -> "if and only if" ← Previous edit		Revision as of 14:26, 18 May 2014 edit undo Phil Boswell (talk \| contribs) Administrators 40,568 edits m convert dodgy URL to ID using AWB Next edit →
Line 78: ==Extensions== ===Generating a parse tree=== The above algorithm is a [[recognizer]] that will only determine if a sentence is in the language. It is simple to extend it into a [[parser]] that also construct a [[parse tree]], by storing parse tree nodes as elements of the array, instead of the boolean 1. The node is linked to the array elements that were used to produce it, so as to build the tree structure. Only one such node in each array element is needed if only one parse tree is to be produced. However, if all parse trees of an ambiguous sentence are to be kept, it is necessary to store in the array element a list of all the ways the corresponding node can be obtained in the parsing process. This is sometimes done with a second table B[n,n,r] of so-called ''backpointers''. Line 91 ⟶ 92: ===Valiant's algorithm=== The [[Analysis of algorithms\|worst case running time]] of CYK is <math>\Theta(n^3 \cdot \|G\|)</math>, where ''n'' is the length of the parsed string and ''\|G\|'' is the size of the CNF grammar ''G''. This makes it one of the most efficient algorithms for recognizing general context-free languages in practice. {{harvtxt\|Valiant\|1975}} gave an extension of the CYK algorithm. His algorithm computes the same parsing table as the CYK algorithm; yet he showed that [[Matrix multiplication#Algorithms for efficient matrix multiplication\|algorithms for efficient multiplication]] of [[Boolean matrix\|matrices with 0-1-entries]] can be utilized for performing this computation. Using the [[Coppersmith–Winograd algorithm]] for multiplying these matrices, this gives an asymptotic worst-case running time of <math>O(n^{2.38} \cdot \|G\|)</math>. However, the constant term hidden by the [[Big O Notation]] is so large that the Coppersmith–Winograd algorithm is only worthwhile for matrices that are too large to handle on present-day computers {{harv\|Knuth\|1997}}, and this approach requires subtraction and so is only suitable for recognition. The dependence on efficient matrix multiplication cannot be avoided altogether: {{harvtxt\|Lee\|2002}} has proved that any parser for context-free grammars working in time <math>O(n^{3-\varepsilon} \cdot \|G\|)</math> can be effectively converted into an algorithm computing the product of <math>(n \times n)</math>-matrices with 0-1-entries in time <math>O(n^{3 - \varepsilon/3})</math>. Line 102 ⟶ 103: ==References== {{Reflist}} * [[{{citation \| authorlink1 = John Cocke]] ~~and~~\| first1 = John \| last1 = Cocke \| first2 = Jacob T. \| last2 = Schwartz (\| year = 1970). \| title = Programming languages and their compilers: Preliminary notes. \| journal = Technical report, \| publisher = [[Courant Institute of Mathematical Sciences]], [[New York University]]. }} * [[{{citation \| authorlink1 = Tadao Kasami \| first1 = T. \| last1 = Kasami]] (\| year = 1965). \| title = An efficient recognition and syntax-analysis algorithm for context-free languages. \| journal = Scientific report AFCRL-65-758, \| publisher = Air Force Cambridge Research Lab, \| ___location = [[Bedford, MA]]. }} * {{citation \| first = Daniel H. \| last = Younger (\| year = 1967). \| title = Recognition and parsing of context-free languages in time ''n''<sup>3</sup>. ''\| journal = Information and Control'' \| volume = 10( \| issue = 2): ~~189–208.~~\| pages = 189–208 }} * {{citation \|last=Knuth \|first=Donald E. \|authorlink=Donald E. Knuth \|title=The Art of Computer Programming Volume 2: Seminumerical Algorithms \|publisher=Addison-Wesley Professional \|edition=3rd \|date=November 14, 1997 \|isbn=978-0-201-89684-8 \|pages=501 }} * {{Citation Line 116 ⟶ 117: \| volume=8 \| url=http://www.informatica-didactica.de/cmsmadesimple/index.php?page=LangeLeiss2009 ~~\| place=[http://www.informatica-didactica.de/cmsmadesimple/uploads/Artikel/LangeLeiss2009/LangeLeiss2009.pdf pdf]~~ }} {{Citation Line 141: }} {{citation \|last=Valiant \|first=Leslie G. \|authorlink=Leslie G. Valiant \|title=General context-free recognition in less than cubic time \|journal=Journal of Computer and System Sciences \|volume=10 \|issue=2 \|year=1975 \|pages=308–314 }} *{{citation \|last=Lang\|first=Bernard\|title=Recognition can be harder than parsing\|journal=Computational Intelligence\|~~date~~year=1994\|volume=10\|issue=4\|pages=486–494\|~~url~~id =~~http://~~ {{citeseerx~~.ist.psu.edu/viewdoc/download?rep=rep1&type=pdf&doi=~~\|10.1.1.50.6982}} }} ==External links==

CYK algorithm: Difference between revisions