Don't watch this! (A t test example where nearly everything I say is wrong) HD

26.01.2018
(Recorded in 2013, but misplaced and not released until now.) I work through an example of a one-sample t test on a mean, and (intentionally) make many false statements. Some of them might sound pretty reasonable. The lesson is: Get your statistics help from a reputable source! Wrong statements, with explanations: 0:25: “Is the mean weight of cucumbers in my garden equal to 200 grams?” Not wrong, but it’s a bad example on a number of fronts: -It’s not a question anybody would ask. -The mean weight of cucumbers is in the garden is not going to equal exactly 200 grams, and we know that going in. -A t test might provide evidence on whether the true mean weight differs from 200 grams, but it’s not going to tell us that the true mean equals 200 grams. -If I really wanted to know the true mean weight (of the cucumbers that currently exist in my garden), I could probably pick them all and find out the true value. Or perhaps measure them on the vine with minimal error. 0:55: “H_0: X bar = 200 grams.” Hypotheses never involve statistics or the value of statistics. 1:25: “H_a” X bar greater than 200 grams.” The choice of alternative must never be based on the current sample’s data. It’s cheating to pick the alternative based on the observed data. And again, hypotheses never involve statistics or the values of statistics. 1:40: “…we just pick alpha, our significance level, to be 0.05.” While this is fairly common practice, it usually doesn’t make any sense and definitely doesn’t make any sense here. Here, no decision needs to be made, and we would simply assess the strength of the evidence against H_0 using the p-value. (Some will disagree with me on this front.) 2:00: “I could have picked any cucumbers in my garden, so every cucumber had the same chance of being picked.” While it is true that I could have picked any set of 4 cucumbers in the garden, that doesn’t imply they all had the same chance of being selected. 2:04: “Yes, this was a simple random sample.” Even though there was likely some randomness involved, that doesn’t make it a simple random sample. A SRS has a very specific meaning, one that was almost surely not the case here. 2:23: “Knowing what the first cucumber weighed tells me nothing about what the second cucumber weighs. They were weighed independently, and the observations are therefore independent.” This is just silly. 2:37: “This is not an important question for a t test.” One of the assumptions of the one-sample t test is normality, and that assumption is very important for small sample sizes. 2:51: “The t statistic has a t distribution, and we won’t need to concern ourselves with normality.” The t statistic has a t distribution only if H_0 is true and the assumptions (including normality) are true. 3:00: “The t statistic is X bar - mu” The hypothesized mean (mu_0) is subtracted in the numerator of the test statistic, not the true mean mu (we don’t know mu, and if we did know it we wouldn’t be carrying out th

Похожие видео