Difference between revisions of "Talk:1347: t Distribution"

Explain xkcd: It's 'cause you're dumb.
Jump to: navigation, search
Line 19: Line 19:
 
The teacher's t-distribution looks like multiple spikier curves with different centres added together
 
The teacher's t-distribution looks like multiple spikier curves with different centres added together
 
and it doesn't fit the table. [[User:Wwt|Wwt]] ([[User talk:Wwt|talk]]) 13:17, 26 March 2014 (UTC)
 
and it doesn't fit the table. [[User:Wwt|Wwt]] ([[User talk:Wwt|talk]]) 13:17, 26 March 2014 (UTC)
 +
 +
 +
I took from it that the Students Distribution was too perfect, and real data would rarely yield those idealized results in a small sample size. That the teacher's distribution used actual numbers, with the occasional spikes. I took from the title text, the tendency of students, or anyone with pre-conceived notions, to keep redoing the test until they get the results they expect, in this case, the textbook result. [[Special:Contributions/173.245.55.71|173.245.55.71]] 13:25, 26 March 2014 (UTC)

Revision as of 13:25, 26 March 2014

http://en.m.wikipedia.org/wiki/Student%27s_t-test


173.245.50.73 05:20, 26 March 2014 (UTC)Adam


I think this is a comment of the quality of education today - it is difficult to grade students on a distribution curve and even more so when you take into account the distribution curve of the teachers ability. 108.162.249.205 (talk) (please sign your comments with ~~~~)

I noticed the teacher's curve is symmetrical, and after further inspection it could be interpreted as an edge detection: high values show where an edge occurs. The two highest peaks would nicely align with the edges of the paper, the next highest peaks fit the edges of the table, and the rest could be approximation artefacts, as they're equidistant and rather insignificant compared to those four. I'm not statistics pro, but maybe that rings someone's bells? 108.162.210.239 07:56, 26 March 2014 (UTC)

Interesting observation. It may play into an age-long legend told and re-told among the students that some teachers grade papers by tossing the whole pile in the air; those sheets that land on the teacher's desk get a pass, those falling to the floor get a fail. Sometimes the story gets modified in such a way that papers falling on the teacher's book (or other object) laying on the desk will get a higher marking than those simply hitting the desk. The latter version would explain the higher sheet-size-apart peaks. 108.162.210.111 08:57, 26 March 2014 (UTC)

To be more explicit, I think the sheet of paper represents some data. Cueball is not happy with the results of applying Student's t test, so ze is trying more complex tools in the hope of getting significance. -- TimMc / 173.245.52.27 11:51, 26 March 2014 (UTC)

Man, normally these explanations clear the comic right up for me, but I've read this one thrice now and I still can't figure out what a t-distribution is, much less a joke based on one. The only definition being a Wikipedia quote written in legalese doesn't help. So a t-distribution estimates...the probability of a population's average when there's unknown information?108.162.216.48 12:17, 26 March 2014 (UTC)

The unknown information is the sample size (class size, for example) and standard distribution (by how much, on average, is something going to vary from the mean). The unknown information is not "in the data".Jarod997 (talk) 12:28, 26 March 2014 (UTC)
Basically, if you have an underlying process that would produce samples with a Gaussian distribution with mean of 0, and stddev of 1, and then you pull a finite number of samples out of it, and do the usual "average" operation on those samples (i.e. sum them and divide by the number of samples) you would expect that that computed average would be close to zero. But it might not be! By chance the samples you pulled might mostly have been from the far right or left side of distribution and the average you got would be way off. Student's T distribution (for a certain number of samples, n) is basically "given that the underlying process a Gaussian with mean zero and stddev of 1, if I repeatedly take n samples from that distribution and compute the average of those samples to get an "estimated mean", this is how I expect that estimated mean to be distributed". Naturally, this is important in questions like "I took 100 samples and got an average of 0.02 -- does this mean that it is sensible to think that the mean of the underlying distribution is actually zero?"
Of course, most of the joke is that the distribution is named "Student's", which is not strongly dependent on the nature of the statistics. Vyzen (talk) 12:42, 26 March 2014 (UTC)

The teacher's t-distribution looks like multiple spikier curves with different centres added together and it doesn't fit the table. Wwt (talk) 13:17, 26 March 2014 (UTC)


I took from it that the Students Distribution was too perfect, and real data would rarely yield those idealized results in a small sample size. That the teacher's distribution used actual numbers, with the occasional spikes. I took from the title text, the tendency of students, or anyone with pre-conceived notions, to keep redoing the test until they get the results they expect, in this case, the textbook result. 173.245.55.71 13:25, 26 March 2014 (UTC)