Revision as of 15:45, 29 January 2023 edit FULBERT (talk \| contribs) Autopatrolled, Event coordinators, Extended confirmed users, IP block exemptions, Mass message senders, New page reviewers, Rollbackers 21,813 edits m Cleaned up using AutoEd ← Previous edit		Revision as of 15:53, 29 January 2023 edit undo FULBERT (talk \| contribs) Autopatrolled, Event coordinators, Extended confirmed users, IP block exemptions, Mass message senders, New page reviewers, Rollbackers 21,813 edits Alter: pmc. Add: doi-access, s2cid, authors 1-1. Removed proxy/dead URL that duplicated identifier. Removed parameters. Some additions/deletions were parameter name changes. \| Use this tool. Report bugs. \| #UCB_Gadget Next edit →
Line 5: '''Decision tree learning''' is a supervised learning approach used in [[statistics]], [[data mining]] and [[machine learning]]. In this formalism, a classification or regression [[decision tree]] is used as a [[predictive model]] to draw conclusions about a set of observations. Tree models where the target variable can take a discrete set of values are called '''[[classification]] [[decision tree\|trees]]'''; in these tree structures, [[leaf node\|leaves]] represent class labels and branches represent [[Logical conjunction\|conjunction]]s of features that lead to those class labels. Decision trees where the target variable can take continuous values (typically [[real numbers]]) are called '''[[regression analysis\|regression]] [[decision tree\|trees]]'''. More generally, the concept of regression tree can be extended to any kind of object equipped with pairwise dissimilarities such as categorical sequences.<ref name=":1">{{Cite journal \|~~last~~last1=Studer \|~~first~~first1=Matthias \|last2=Ritschard \|first2=Gilbert \|last3=Gabadinho \|first3=Alexis \|last4=Müller \|first4=Nicolas S. \|date=2011 \|title=Discrepancy Analysis of State Sequences \|url=http://journals.sagepub.com/doi/10.1177/0049124111415372 \|journal=Sociological Methods & Research \|language=en \|volume=40 \|issue=3 \|pages=471–510 \|doi=10.1177/0049124111415372 \|s2cid=13307797 \|issn=0049-1241}}</ref> Decision trees are among the most popular machine learning algorithms given their intelligibility and simplicity.<ref>{{Cite journal\|last1=Wu\|first1=Xindong\|last2=Kumar\|first2=Vipin\|last3=Ross Quinlan\|first3=J.\|last4=Ghosh\|first4=Joydeep\|last5=Yang\|first5=Qiang\|last6=Motoda\|first6=Hiroshi\|last7=McLachlan\|first7=Geoffrey J.\|last8=Ng\|first8=Angus\|last9=Liu\|first9=Bing\|last10=Yu\|first10=Philip S.\|last11=Zhou\|first11=Zhi-Hua\|date=2008-01-01\|title=Top 10 algorithms in data mining\|journal=Knowledge and Information Systems\|language=en\|volume=14\|issue=1\|pages=1–37\|doi=10.1007/s10115-007-0114-2\|s2cid=2367747\|issn=0219-3116\|hdl=10983/15329\|hdl-access=free}}</ref> Line 304: * Non-parametric approach that makes no assumptions of the training data or prediction residuals; e.g., no distributional, independence, or constant variance assumptions * '''Performs well with large datasets.''' Large amounts of data can be analyzed using standard computing resources in reasonable time. * '''Accuracy with flexible modeling'''. These methods may be applied to healthcare research with increased accuracy.<ref>{{Cite journal \|~~last~~last1=Hu \|~~first~~first1=Liangyuan \|last2=Li \|first2=Lihua \|date=2022-12-01 \|title=Using Tree-Based Machine Learning for Health Studies: Literature Review and Case Series ~~\|url=https://www.mdpi.com/1660-4601/19/23/16080~~ \|journal=International Journal of Environmental Research and Public Health \|language=en \|volume=19 \|issue=23 \|pages=16080 \|doi=10.3390/ijerph192316080 \|issn=1660-4601 \|pmc=~~PMC9736500~~9736500 \|pmid=36498153\|doi-access=free }}</ref> * '''Mirrors human decision making more closely than other approaches.'''<ref name=":0" /> This could be useful when modeling human decisions/behavior. * '''Robust against co-linearity, particularly boosting.'''

Decision tree learning: Difference between revisions