Revision as of 03:23, 28 January 2024 edit WikiCleanerBot (talk \| contribs) Bots 1,007,735 edits m v2.05b - Bot T20 CW#61 - Fix errors for CW project (Reference before punctuation) Tag: WPCleaner ← Previous edit		Revision as of 09:40, 22 March 2024 edit undo N7729697 (talk \| contribs) 2 edits m add applications in deep rl Tags: Reverted Visual edit Next edit →
Line 1: {{Short description\|Machine learning that combines deep learning and reinforcement learning}} {{Machine learning}} '''Deep reinforcement learning''' ('''deep RL''') is a subfield of [[machine learning]] that combines [[reinforcement learning]] (RL) and [[deep learning]]. RL considers the problem of a computational agent learning to make decisions by trial and error. Deep RL incorporates deep learning into the solution, allowing agents to make decisions from unstructured input data without manual engineering of the [[state space]]. Deep RL algorithms are able to take in very large inputs (e.g. every pixel rendered to the screen in a video game) and decide what actions to perform to optimize an objective (e.g. maximizing the game score). Deep reinforcement learning has been used for a diverse set of applications including but not limited to [[robotics]], [[video game]]s, [[natural language processing]], [[computer vision]],<ref>{{Cite journal \|last=Le \|first=Ngan \|last2=Rathour \|first2=Vidhiwar Singh \|last3=Yamazaki \|first3=Kashu \|last4=Luu \|first4=Khoa \|last5=Savvides \|first5=Marios \|date=2022-04-01 \|title=Deep reinforcement learning in computer vision: a comprehensive survey \|url=https://doi.org/10.1007/s10462-021-10061-9 \|journal=Artificial Intelligence Review \|language=en \|volume=55 \|issue=4 \|pages=2733–2819 \|doi=10.1007/s10462-021-10061-9 \|issn=1573-7462\|arxiv=2108.11510 }}</ref> education, transportation, finance and [[Health care\|healthcare]].<ref name="francoislavet2018"/> == Overview == Line 31: In a subsequent project in 2017, [[AlphaZero]] improved performance on Go while also demonstrating they could use the same algorithm to learn to play [[chess]] and [[shogi]] at a level competitive or superior to existing computer programs for those games, and again improved in 2019 with [[MuZero]].<ref name="muzero"/> Separately, another milestone was achieved by researchers from [[Carnegie Mellon University]] in 2019 developing [[Pluribus (poker bot)\|Pluribus]], a computer program to play [[poker]] that was the first to beat professionals at multiplayer games of no-limit [[Texas hold 'em]]. [[OpenAI Five]], a program for playing five-on-five [[Dota 2]] beat the previous world champions in a demonstration match in 2019. Deep reinforcement learning has also been applied to many domains beyond games. In robotics, it has been used to let robots perform simple household tasks<ref name="levine2016"/> and solve a Rubik's cube with a robot hand.<ref name="openaihand"/><ref name="openaihandarxiv"/> Deep RL has also found sustainability applications, used to reduce energy consumption at data centers.<ref name="deepmindcooling" /> and Parallel [[Hybrid electric vehicle\|Hybrid Electric Vehicle]]<ref name=":0" />. Deep RL for [[autonomous driving]] is an active area of research in academia and industry.<ref name="neurips2021ml4ad" /> [[Loon_LLC\|Loon]] explored deep RL for autonomously navigating their high-altitude balloons.<ref name="loonrl" /> == Algorithms ==

Deep reinforcement learning: Difference between revisions