Exploratory data analysis: Difference between revisions

Content deleted Content added
Alexbubuu (talk | contribs)
I added a figure showing exploratory data analysis
Line 80:
== Example ==
 
Findings from EDA are orthogonal to the primary analysis task. To illustrate, consider an example from Cook et al. where the analysis task is to find the variables which best predict the tip that a dining party will give to the waiter.<ref>[[Dianne Cook (statistician)|Cook, D.]] and [[Deborah F. Swayne|Swayne, D.F.]] (with A. Buja, D. Temple Lang, H. Hofmann, H. Wickham, M. Lawrence) (2007) ″Interactive"Interactive and Dynamic Graphics for Data Analysis: With R and GGobi″GGobi" Springer, 978-0387717616</ref> The variables available in the data collected for this task are: the tip amount, total bill, payer gender, smoking/non-smoking section, time of day, day of the week, and size of the party. The primary analysis task is approached by fitting a regression model where the tip rate is the response variable. The fitted model is
 
: ([[tip rate]]) = 0.18 - 0.01 × (party size)