Revision as of 02:57, 27 June 2025 edit Fgnievinski (talk \| contribs) Autopatrolled, Extended confirmed users 71,081 edits No edit summary Tags: Mobile edit Mobile web edit Advanced mobile edit ← Previous edit		Revision as of 20:33, 30 June 2025 edit undo Alenoach (talk \| contribs) Extended confirmed users 5,805 edits m link Tag: Visual edit Next edit →
Line 55: Systems that are expert or knowledge based are software systems that are made by experts. This system consists of a knowledge based encoding for the ___domain knowledge. This system is usually modeled as production rules, and someone uses this knowledge base which the user can question the system for knowledge. In expert systems, the language and explanations are understood with an explanation for the reasoning or a problem solving activity.<ref name="auto"/> However, these techniques are not very suitable for [[Language model\|language models]] like [[Generative pre-trained transformer\|generative pretrained transformers]]. Since these models generate language, they can provide an explanation, but which may not be reliable. Other techniques include [[Attention (machine learning)\|attention]] analysis (examining how the model focuses on different parts of the input), probing methods (testing what information is captured in the model's representations), causal tracing (tracing the flow of information through the model) and circuit discovery (identifying specific subnetworks responsible for certain behaviors). Explainability research in this area overlaps significantly with interpretability and [[AI alignment\|alignment]] research.<ref>{{cite arXiv \|last1=Luo \|first1=Haoyan \|title=From Understanding to Utilization: A Survey on Explainability for Large Language Models \|date=2024-02-21 \|eprint=2401.12874 \|last2=Specia \|first2=Lucia\|class=cs.CL }}</ref> === Interpretability ===

Explainable artificial intelligence: Difference between revisions