Revision as of 06:14, 7 June 2025 edit 125.253.110.64 (talk) →External links ← Previous edit		Revision as of 14:24, 29 June 2025 edit undo 240e:47d:1af9:d214:ed15:eb2a:1a4c:dcd9 (talk) →Use: Intel Core Ultra and AMD Ryzen AI are included NPU. Next edit →
Line 7: Their purpose is either to efficiently execute already trained AI models (inference) or to train AI models. Their applications include [[algorithm]]s for [[robotics]], [[Internet of things]], and [[data (computing)\|data]]-intensive or sensor-driven tasks.<ref>{{cite web \|url=https://www.eetimes.com/google-designing-ai-processors/ \|title=Google Designing AI Processors\|date=May 18, 2016 }} Google using its own AI accelerators.</ref> They are often [[Manycore processor\|manycore]] designs and focus on [[precision (computer science)\|low-precision]] arithmetic, novel [[dataflow architecture]]s, or [[in-memory computing]] capability. {{As of\|2024}}, a typical AI [[integrated circuit]] chip [[transistor count\|contains tens of billions]] of [[MOSFET]]s.<ref>{{cite web\|url=https://www.datacenterdynamics.com/en/news/nvidia-reveals-new-hopper-h100-gpu-with-80-billion-transistors/\|title=Nvidia reveals new Hopper H100 GPU, with 80 billion transistors\|last=Moss\|first=Sebastian\|date=2022-03-23\|website=Data Center Dynamics\|access-date=2024-01-30}}</ref> AI accelerators are used in mobile devices such as Apple [[iPhone]]s and [[Huawei]] ~~cellphones,~~smartphones<ref>{{Cite web\|url=https://consumer.huawei.com/en/press/news/2017/ifa2017-kirin970\|title=HUAWEI Reveals the Future of Mobile AI at IFA}}</ref>, and ~~personal~~seen ~~computers~~in ~~such~~many as[[Qualcomm]] and [[~~Intel~~Samsung]] ~~laptops~~smartphone processors,<ref>https://docs.qualcomm.com/bundle/publicresource/87-71408-1_REV_B_Snapdragon_8_gen_3_Mobile_Platform_Product_Brief.pdf</ref> some [[Intel]] <ref>{{Cite web\|url=https://www.intel.com/content/www/us/en/newsroom/news/intels-lunar-lake-processors-arriving-q3-2024.html\|title=Intel's Lunar Lake Processors Arriving Q3 2024\|website=Intel\|date=May 20, 2024 }}</ref> and [[AMD]] ~~laptops~~computer processors<ref>{{cite web\|title=AMD XDNA Architecture\|url=https://www.amd.com/en/technologies/xdna.html}}</ref>, and [[Apple silicon]] [[Mac (computer)\|Macs]].<ref>{{Cite web \|title=Deploying Transformers on the Apple Neural Engine \|url=https://machinelearning.apple.com/research/neural-engine-transformers \|access-date=2023-08-24 \|website=Apple Machine Learning Research \|language=en-US}}</ref> Accelerators are used in [[cloud computing]] servers, including [[tensor processing unit]]s (TPU) in [[Google Cloud Platform]]<ref>{{Cite journal\|date=2017-06-24\|title=In-Datacenter Performance Analysis of a Tensor Processing Unit\|journal=ACM SIGARCH Computer Architecture News\|volume=45\|issue=2\|pages=1–12\|language=EN\|doi=10.1145/3140659.3080246\|doi-access=free \|last1=Jouppi \|first1=Norman P. \|last2=Young \|first2=Cliff \|last3=Patil \|first3=Nishant \|last4=Patterson \|first4=David \|last5=Agrawal \|first5=Gaurav \|last6=Bajwa \|first6=Raminder \|last7=Bates \|first7=Sarah \|last8=Bhatia \|first8=Suresh \|last9=Boden \|first9=Nan \|last10=Borchers \|first10=Al \|last11=Boyle \|first11=Rick \|last12=Cantin \|first12=Pierre-luc \|last13=Chao \|first13=Clifford \|last14=Clark \|first14=Chris \|last15=Coriell \|first15=Jeremy \|last16=Daley \|first16=Mike \|last17=Dau \|first17=Matt \|last18=Dean \|first18=Jeffrey \|last19=Gelb \|first19=Ben \|last20=Ghaemmaghami \|first20=Tara Vazir \|last21=Gottipati \|first21=Rajendra \|last22=Gulland \|first22=William \|last23=Hagmann \|first23=Robert \|last24=Ho \|first24=C. Richard \|last25=Hogberg \|first25=Doug \|last26=Hu \|first26=John \|last27=Hundt \|first27=Robert \|last28=Hurt \|first28=Dan \|last29=Ibarz \|first29=Julian \|last30=Jaffey \|first30=Aaron \|display-authors=1 \|arxiv=1704.04760 }}</ref> and [[Trainium]] and [[Inferentia]] chips in [[Amazon Web Services]].<ref>{{cite web \| title = How silicon innovation became the 'secret sauce' behind AWS's success\| website = Amazon Science\| date = July 27, 2022\| url = https://www.amazon.science/how-silicon-innovation-became-the-secret-sauce-behind-awss-success\| access-date = July 19, 2024}}</ref> Many vendor-specific terms exist for devices in this category, and it is an [[emerging technologies\|emerging technology]] without a [[dominant design]]. [[Graphics processing units]] designed by companies such as [[Nvidia]] and [[AMD]] often include AI-specific hardware, and are commonly used as AI accelerators, both for [[Machine learning\|training]] and [[Inference engine\|inference]].<ref>{{cite web\| last1 = Patel\| first1 = Dylan\| last2 = Nishball\| first2 = Daniel\| last3 = Xie\| first3 = Myron\| title = Nvidia's New China AI Chips Circumvent US Restrictions\| url=https://www.semianalysis.com/p/nvidias-new-china-ai-chips-circumvent\| website = SemiAnalysis\| date=2023-11-09\| access-date=2024-02-07}}</ref> All models of Intel [[Meteor Lake]] processors have a built-in ''versatile processor unit'' (''VPU'') for accelerating [[statistical inference\|inference]] for computer vision and deep learning.<ref>{{Cite web\|url=https://www.pcmag.com/news/intel-to-bring-a-vpu-processor-unit-to-14th-gen-meteor-lake-chips\|title=Intel to Bring a 'VPU' Processor Unit to 14th Gen Meteor Lake Chips\|website=PCMAG\|date=August 2022 }}</ref>

Neural processing unit: Difference between revisions