Symbolic artificial intelligence: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 03:30, 11 July 2025 edit Bender235 (talk \| contribs) Autopatrolled, Extended confirmed users, Pending changes reviewers, Rollbackers, Template editors 472,816 edits m →See also ← Previous edit		Latest revision as of 10:56, 16 August 2025 edit undo 95.90.198.16 (talk) →Neuro-symbolic AI: integrating neural and symbolic approaches: added link to main article Neuro-symbolic AI
(One intermediate revision by one other user not shown)
Line 4: is the term for the collection of all methods in artificial intelligence research that are based on high-level [[physical symbol systems hypothesis\|symbolic]] (human-readable) representations of problems, [[Formal logic\|logic]] and [[search algorithm\|search]].<ref>{{Cite journal\|last1=Garnelo\|first1=Marta\|last2=Shanahan\|first2=Murray\|date=2019-10-01\|title=Reconciling deep learning with symbolic artificial intelligence: representing objects and relations\|journal=Current Opinion in Behavioral Sciences\|language=en\|volume=29\|pages=17–23\|doi=10.1016/j.cobeha.2018.12.010\|s2cid=72336067 \|doi-access=free\|hdl=10044/1/67796\|hdl-access=free}}</ref> Symbolic AI used tools such as [[logic programming]], [[production (computer science)\|production rules]], [[semantic nets]] and [[frame (artificial intelligence)\|frames]], and it developed applications such as [[knowledge-based systems]] (in particular, [[expert systems]]), [[symbolic mathematics]], [[automated theorem provers]], [[ontologies]], the [[semantic web]], and [[automated planning and scheduling]] systems. The Symbolic AI paradigm led to seminal ideas in [[Artificial intelligence#Search and optimization\|search]], [[symbolic programming]] languages, [[Intelligent agent\|agents]], [[multi-agent systems]], the [[semantic web]], and the strengths and limitations of formal knowledge and [[automated reasoning\|reasoning systems]]. Symbolic AI was the dominant [[paradigm]] of AI research from the mid-1950s until the mid-1990s.{{sfn\|Kolata\|1982}} Researchers in the 1960s and the 1970s were convinced that symbolic approaches would eventually succeed in creating a machine with [[artificial general intelligence]] and considered this the ultimate goal of their field.<ref>{{Cite journal \|~~last~~last1=Newell \|~~first~~first1=Allen \|last2=Simon \|first2=Herbert A. \|date=1976-03-01 \|title=Computer science as empirical inquiry: symbols and search \|url=https://dl.acm.org/doi/10.1145/360018.360022 \|journal=Commun. ACM \|volume=19 \|issue=3 \|pages=113–126 \|doi=10.1145/360018.360022 \|issn=0001-0782}}</ref> An early boom, with early successes such as the [[Logic Theorist]] and [[Arthur Samuel (computer scientist)\|Samuel]]'s [[Arthur Samuel (computer scientist)\|Checkers Playing Program]], led to unrealistic expectations and promises and was followed by the first [[AI winter\|AI Winter]] as funding dried up.{{sfn\|Kautz\|2022\|pp=107-109}}{{sfn\|Russell \|Norvig\|2021\|p=19}} A second boom (1969–1986) occurred with the rise of expert systems, their promise of capturing corporate expertise, and an enthusiastic corporate embrace.{{sfn\|Russell \|Norvig\|2021\|pp=22-23}}{{sfn\|Kautz\|2022\|pp=109-110}} That boom, and some early successes, e.g., with [[XCON]] at [[Digital Equipment Corporation\|DEC]], was followed again by later disappointment.{{sfn\|Kautz\|2022\|pp=109-110}} Problems with difficulties in knowledge acquisition, maintaining large knowledge bases, and brittleness in handling out-of-___domain problems arose. Another, second, AI Winter (1988–2011) followed.{{sfn\|Kautz\|2022\|p=110}} Subsequently, AI researchers focused on addressing underlying problems in handling uncertainty and in knowledge acquisition.{{sfn\|Kautz\|2022\|pp=110-111}} Uncertainty was addressed with formal methods such as [[hidden Markov model]]s, [[Bayesian reasoning]], and [[statistical relational learning]].{{sfn\|Russell \|Norvig\|2021\|p=25}}{{sfn\|Kautz\|2022\|p=111}} Symbolic machine learning addressed the knowledge acquisition problem with contributions including [[Version space learning\|Version Space]], [[Leslie Valiant\|Valiant]]'s [[Probably approximately correct learning\|PAC learning]], [[Ross Quinlan\|Quinlan]]'s [[ID3 algorithm\|ID3]] [[decision-tree]] learning, [[Case-based reasoning\|case-based learning]], and [[inductive logic programming]] to learn relations.{{sfn\|Kautz\|2020\|pp=110-111}} [[Artificial neural network\|Neural networks]], a subsymbolic approach, had been pursued from early days and reemerged strongly in 2012. Early examples are [[Frank Rosenblatt\|Rosenblatt]]'s [[perceptron]] learning work, the [[backpropagation]] work of Rumelhart, Hinton and Williams,<ref>{{cite journal\| doi = 10.1038/323533a0\| issn = 1476-4687\| volume = 323\| issue = 6088\| pages = 533–536\| last1 = Rumelhart\| first1 = David E.\| last2 = Hinton\| first2 = Geoffrey E.\| last3 = Williams\| first3 = Ronald J.\| title = Learning representations by back-propagating errors\| journal = Nature\| date = 1986 \| bibcode = 1986Natur.323..533R\| s2cid = 205001834}}</ref> and work in [[convolutional neural network]]s by LeCun et al. in 1989.<ref>{{Cite journal\| volume = 1\| issue = 4\| pages = 541–551\| last1 = LeCun\| first1 = Y.\| last2 = Boser\| first2 = B.\| last3 = Denker\| first3 = I.\| last4 = Henderson\| first4 = D.\| last5 = Howard\| first5 = R.\| last6 = Hubbard\| first6 = W.\| last7 = Tackel\| first7 = L.\| title = Backpropagation Applied to Handwritten Zip Code Recognition\| journal = Neural Computation\| date = 1989\| doi = 10.1162/neco.1989.1.4.541\| s2cid = 41312633}}</ref> However, neural networks were not viewed as successful until about 2012: "Until Big Data became commonplace, the general consensus in the Al community was that the so-called neural-network approach was hopeless. Systems just didn't work that well, compared to other methods. ... A revolution came in 2012, when a number of people, including a team of researchers working with Hinton, worked out a way to use the power of [[GPUs]] to enormously increase the power of neural networks."{{sfn\|Marcus \|Davis\|2019}} Over the next several years, [[deep learning]] had spectacular success in handling vision, [[speech recognition]], speech synthesis, image generation, and machine translation. However, since 2020, as inherent difficulties with bias, explanation, comprehensibility, and robustness became more apparent with deep learning approaches; an increasing number of AI researchers have called for [[Neuro-symbolic AI\|combining]] the best of both the symbolic and neural network approaches<ref name="Rossi"> Line 72: to describe that high performance in a specific ___domain requires both general and highly ___domain-specific knowledge. Ed Feigenbaum and Doug Lenat called this The Knowledge Principle: {{Blockquote \|text=(1) The Knowledge Principle: if a program is to perform a complex task well, it must know a great deal about the world in which it operates.<br/>(2) A plausible extension of that principle, called the Breadth Hypothesis: there are two additional abilities necessary for intelligent behavior in unexpected situations: falling back on increasingly general knowledge, and analogizing to specific but far-flung knowledge.<ref name="Knowledge Principle">{{Cite ~~journal~~book\| last1=Lenat\| first1=Douglas B\| last2=Feigenbaum\| first2=Edward A \| title~~=On the thresholds of knowledge\| journal~~=Proceedings of the International Workshop on Artificial Intelligence for Industrial Applications\| chapter=On the thresholds of knowledge\| date=1988\| pages=291–300\| doi=10.1109/AIIA.1988.13308\| s2cid=11778085}}</ref>}} ==== Success with expert systems ==== Line 164: ==== Neuro-symbolic AI: integrating neural and symbolic approaches ==== {{Main\|Neuro-symbolic AI}} Neuro-symbolic AI attempts to integrate neural and symbolic architectures in a manner that addresses strengths and weaknesses of each, in a complementary fashion, in order to support robust AI capable of reasoning, learning, and cognitive modeling. As argued by [[Leslie Valiant\|Valiant]]{{sfn\|Valiant\|2008}} and many others,{{sfn\|Garcez\|Besold\|De Raedt\|Földiák\|2015}} the effective construction of rich computational [[cognitive model]]s demands the combination of sound symbolic reasoning and efficient (machine) learning models. [[Gary Marcus]], similarly, argues that: "We cannot construct rich cognitive models in an adequate, automated way without the triumvirate of hybrid architecture, rich prior knowledge, and sophisticated techniques for reasoning.",{{sfn\|Marcus\|2020\|p=44}} and in particular: Line 334 ⟶ 335: Part of these disputes may be due to unclear terminology: <blockquote>Turing award winner [[Judea Pearl]] offers a critique of machine learning which, unfortunately, conflates the terms machine learning and deep learning. Similarly, when Geoffrey Hinton refers to symbolic AI, the connotation of the term tends to be that of expert systems dispossessed of any ability to learn. The use of the terminology is in need of clarification. Machine learning is not confined to [[Association rule learning\|association rule]] mining, c.f. the body of work on symbolic ML and relational learning (the differences to deep learning being the choice of representation, localist logical rather than distributed, and the non-use of [[gradient descent\|gradient-based learning algorithms]]). Equally, symbolic AI is not just about [[Production system (computer science)\|production rules]] written by hand. A proper definition of AI concerns [[knowledge representation and reasoning]], autonomous [[multi-agent system]]s, planning and [[Argumentation framework\|argumentation]], as well as learning.{{sfn\|Garcez\|Lamb\|2020\|p=8}}</blockquote>It is worth noting that, from a theoretical perspective, the boundary of advantages between connectionist AI and symbolic AI may not be as clear-cut as it appears. For instance, Heng Zhang and his colleagues have proved that mainstream knowledge representation formalisms are recursively isomorphic, provided they are universal or have equivalent expressive power.<ref>{{Cite journal \|~~last~~last1=Zhang \|~~first~~first1=Heng \|last2=Jiang \|first2=Guifei \|last3=Quan \|first3=Donghui \|date=2025-04-11 \|title=A Theory of Formalisms for Representing Knowledge \|url=https://ojs.aaai.org/index.php/AAAI/article/view/33674 \|journal=Proceedings of the AAAI Conference on Artificial Intelligence \|language=en \|volume=39 \|issue=14 \|pages=15257–15264 \|doi=10.1609/aaai.v39i14.33674 \|issn=2374-3468\|arxiv=2412.11855 }}</ref> This finding implies that there is no fundamental distinction between using symbolic or connectionist knowledge representation formalisms for the realization of [[artificial general intelligence]] (AGI). Moreover, the existence of recursive isomorphisms suggests that different technical approaches can draw insights from one another. From this perspective, it seems unnecessary to overemphasize the advantages of any single technical school; instead, mutual learning and integration may offer the most promising path toward the realization of AGI. === Situated robotics: the world as a model === Line 492 ⟶ 493: * {{Cite journal \|doi=10.1093/mind/LIX.236.433 \|issn=0026-4423 \|volume=LIX \|issue=236 \|pages=433–460 \|last=Turing \|first=A. M. \|title=I.—Computing Machinery and Intelligence \|journal=Mind \|accessdate=2022-09-14 \|date=1950 \|url=https://doi.org/10.1093/mind/LIX.236.433\|url-access=subscription }} * {{Cite book\| pages = 415–422\| last = Valiant\| first = Leslie G\| chapter= Knowledge Infusion: In Pursuit of Robustness in Artificial Intelligence\| date = 2008 \|editor1=Hariharan, R. \|editor2=Mukund, M. \|editor3=Vinay, V. \|title=Foundations of Software Technology and Theoretical Computer Science (Bangalore)}} * {{cite ~~conference~~book \| year=2017 \| author1=Xifan Yao \| author2=Jiajun Zhou \| author3=Jiangming Zhang \| author4=Claudio R. Boer \| title=2017 5th International Conference on Enterprise Systems (ES) \| chapter=From Intelligent Manufacturing to Smart Manufacturing for Industry 4.0 Driven by Next Generation Artificial Intelligence and Further Onon \| pages=311–318 \|publisher=IEEE ~~\|publisher=IEEE~~ ~~\|conference=2017 5th International Conference on Enterprise Systems (ES)~~ \|doi=10.1109/es.2017.58 \| isbn=978-1-5386-0936-1 }} }} [[Category:Artificial intelligence]]