So, you need a statistically significant sample?(technology.stitchfix.com) |
So, you need a statistically significant sample?(technology.stitchfix.com) |
If you're designing a drug, you'd better be very careful not to accidentally approve something that is useless. It would cost a ton of money and lives in the long run if it was no better than placebo. False rejection of the null is very bad. False acceptance of the null is not so bad.
By contrast, if you're doing an A/B test on a website, you're actually not in bad shape if you accidentally think that a red button is a bit better than a blue button, assuming that they're pretty close. False rejection of the null is okay.
However you are screwed if you miss out on the chance that a red button gives you 50% more conversion. With websites, false acceptance of the null is very bad. It's okay to mistakenly think your button is effective but it's very bad to mistakenly think that the button is ineffective.
Websites have the opposite cost benefit calculation to science generally and shouldn't use the same parameters.
Is this necessarily true if you can sample from the population in a fair and unbiased way?
Yes, it's necessarily true. If your sample is small you are necessarily subject to large sampling error.
In essence: the individuals you happened to pick (even fairly) are overrepresented, and the rest are underrepresented.
For example, if you sample 1% of the population in a fair and unbiased way, that would tell you something with a much higher degree of confidence than if you sampled even 10% of the population in a biased way (or in a way such that you don't know whether you are biased or not).
For a sample of 100, it's 1%. For 1000, its 0.1%. The more opinions you can collect, the less they individually mean.
Whereas, if you instead take a sample of 1000, you typically get results between 61% and 69%.
Source: http://www.wolframalpha.com/input/?i=binomial%2810%2C0.65%29...
You can't take an "unbiased" sample in the sense that you mean here.
There's only one way you can take a completely unbiased sample: if you know exactly how everyone will vote in advance and select them carefully on that basis. But if you already know how everyone will vote in advance, then sampling is a fruitless exercise.