Content deleted Content added
→Neuro-symbolic AI: integrating neural and symbolic approaches: added link to main article Neuro-symbolic AI |
|||
(One intermediate revision by one other user not shown) | |||
Line 4:
is the term for the collection of all methods in artificial intelligence research that are based on high-level [[physical symbol systems hypothesis|symbolic]] (human-readable) representations of problems, [[Formal logic|logic]] and [[search algorithm|search]].<ref>{{Cite journal|last1=Garnelo|first1=Marta|last2=Shanahan|first2=Murray|date=2019-10-01|title=Reconciling deep learning with symbolic artificial intelligence: representing objects and relations|journal=Current Opinion in Behavioral Sciences|language=en|volume=29|pages=17–23|doi=10.1016/j.cobeha.2018.12.010|s2cid=72336067 |doi-access=free|hdl=10044/1/67796|hdl-access=free}}</ref> Symbolic AI used tools such as [[logic programming]], [[production (computer science)|production rules]], [[semantic nets]] and [[frame (artificial intelligence)|frames]], and it developed applications such as [[knowledge-based systems]] (in particular, [[expert systems]]), [[symbolic mathematics]], [[automated theorem provers]], [[ontologies]], the [[semantic web]], and [[automated planning and scheduling]] systems. The Symbolic AI paradigm led to seminal ideas in [[Artificial intelligence#Search and optimization|search]], [[symbolic programming]] languages, [[Intelligent agent|agents]], [[multi-agent systems]], the [[semantic web]], and the strengths and limitations of formal knowledge and [[automated reasoning|reasoning systems]].
Symbolic AI was the dominant [[paradigm]] of AI research from the mid-1950s until the mid-1990s.{{sfn|Kolata|1982}} Researchers in the 1960s and the 1970s were convinced that symbolic approaches would eventually succeed in creating a machine with [[artificial general intelligence]] and considered this the ultimate goal of their field.<ref>{{Cite journal |
[[Artificial neural network|Neural networks]], a subsymbolic approach, had been pursued from early days and reemerged strongly in 2012. Early examples are [[Frank Rosenblatt|Rosenblatt]]'s [[perceptron]] learning work, the [[backpropagation]] work of Rumelhart, Hinton and Williams,<ref>{{cite journal| doi = 10.1038/323533a0| issn = 1476-4687| volume = 323| issue = 6088| pages = 533–536| last1 = Rumelhart| first1 = David E.| last2 = Hinton| first2 = Geoffrey E.| last3 = Williams| first3 = Ronald J.| title = Learning representations by back-propagating errors| journal = Nature| date = 1986 | bibcode = 1986Natur.323..533R| s2cid = 205001834}}</ref> and work in [[convolutional neural network]]s by LeCun et al. in 1989.<ref>{{Cite journal| volume = 1| issue = 4| pages = 541–551| last1 = LeCun| first1 = Y.| last2 = Boser| first2 = B.| last3 = Denker| first3 = I.| last4 = Henderson| first4 = D.| last5 = Howard| first5 = R.| last6 = Hubbard| first6 = W.| last7 = Tackel| first7 = L.| title = Backpropagation Applied to Handwritten Zip Code Recognition| journal = Neural Computation| date = 1989| doi = 10.1162/neco.1989.1.4.541| s2cid = 41312633}}</ref> However, neural networks were not viewed as successful until about 2012: "Until Big Data became commonplace, the general consensus in the Al community was that the so-called neural-network approach was hopeless. Systems just didn't work that well, compared to other methods. ... A revolution came in 2012, when a number of people, including a team of researchers working with Hinton, worked out a way to use the power of [[GPUs]] to enormously increase the power of neural networks."{{sfn|Marcus |Davis|2019}} Over the next several years, [[deep learning]] had spectacular success in handling vision, [[speech recognition]], speech synthesis, image generation, and machine translation. However, since 2020, as inherent difficulties with bias, explanation, comprehensibility, and robustness became more apparent with deep learning approaches; an increasing number of AI researchers have called for [[Neuro-symbolic AI|combining]] the best of both the symbolic and neural network approaches<ref name="Rossi">
Line 72:
to describe that high performance in a specific ___domain requires both general and highly ___domain-specific knowledge. Ed Feigenbaum and Doug Lenat called this The Knowledge Principle:
{{Blockquote
|text=(1) The Knowledge Principle: if a program is to perform a complex task well, it must know a great deal about the world in which it operates.<br/>(2) A plausible extension of that principle, called the Breadth Hypothesis: there are two additional abilities necessary for intelligent behavior in unexpected situations: falling back on increasingly general knowledge, and analogizing to specific but far-flung knowledge.<ref name="Knowledge Principle">{{Cite
==== Success with expert systems ====
Line 164:
==== Neuro-symbolic AI: integrating neural and symbolic approaches ====
{{Main|Neuro-symbolic AI}}
Neuro-symbolic AI attempts to integrate neural and symbolic architectures in a manner that addresses strengths and weaknesses of each, in a complementary fashion, in order to support robust AI capable of reasoning, learning, and cognitive modeling. As argued by [[Leslie Valiant|Valiant]]{{sfn|Valiant|2008}} and many others,{{sfn|Garcez|Besold|De Raedt|Földiák|2015}} the effective construction of rich computational [[cognitive model]]s demands the combination of sound symbolic reasoning and efficient (machine) learning models. [[Gary Marcus]], similarly, argues that: "We cannot construct rich cognitive models in an adequate, automated way without the triumvirate of hybrid architecture, rich prior knowledge, and sophisticated techniques for reasoning.",{{sfn|Marcus|2020|p=44}} and in particular:
Line 334 ⟶ 335:
Part of these disputes may be due to unclear terminology:
<blockquote>Turing award winner [[Judea Pearl]] offers a critique of machine learning which, unfortunately, conflates the terms machine learning and deep learning. Similarly, when Geoffrey Hinton refers to symbolic AI, the connotation of the term tends to be that of expert systems dispossessed of any ability to learn. The use of the terminology is in need of clarification. Machine learning is not confined to [[Association rule learning|association rule]] mining, c.f. the body of work on symbolic ML and relational learning (the differences to deep learning being the choice of representation, localist logical rather than distributed, and the non-use of [[gradient descent|gradient-based learning algorithms]]). Equally, symbolic AI is not just about [[Production system (computer science)|production rules]] written by hand. A proper definition of AI concerns [[knowledge representation and reasoning]], autonomous [[multi-agent system]]s, planning and [[Argumentation framework|argumentation]], as well as learning.{{sfn|Garcez|Lamb|2020|p=8}}</blockquote>It is worth noting that, from a theoretical perspective, the boundary of advantages between connectionist AI and symbolic AI may not be as clear-cut as it appears. For instance, Heng Zhang and his colleagues have proved that mainstream knowledge representation formalisms are recursively isomorphic, provided they are universal or have equivalent expressive power.<ref>{{Cite journal |
=== Situated robotics: the world as a model ===
Line 492 ⟶ 493:
* {{Cite journal |doi=10.1093/mind/LIX.236.433 |issn=0026-4423 |volume=LIX |issue=236 |pages=433–460 |last=Turing |first=A. M. |title=I.—Computing Machinery and Intelligence |journal=Mind |accessdate=2022-09-14 |date=1950 |url=https://doi.org/10.1093/mind/LIX.236.433|url-access=subscription }}
* {{Cite book| pages = 415–422| last = Valiant| first = Leslie G| chapter= Knowledge Infusion: In Pursuit of Robustness in Artificial Intelligence| date = 2008 |editor1=Hariharan, R. |editor2=Mukund, M. |editor3=Vinay, V. |title=Foundations of Software Technology and Theoretical Computer Science (Bangalore)}}
* {{cite
| year=2017
| author1=Xifan Yao | author2=Jiajun Zhou | author3=Jiangming Zhang | author4=Claudio R. Boer
| title=2017 5th International Conference on Enterprise Systems (ES) | chapter=From Intelligent Manufacturing to Smart Manufacturing for Industry 4.0 Driven by Next Generation Artificial Intelligence and Further
|doi=10.1109/es.2017.58
| isbn=978-1-5386-0936-1 }}
[[Category:Artificial intelligence]]
|