-
Notifications
You must be signed in to change notification settings - Fork 16
Open
Labels
enhancementNew feature or requestNew feature or request
Description
For the statistical refresher part:
What is a good p-value histogram? It should have a high peak on the left suggesting that you are comparing two conditions that have different distributions.
If distribution is uniform then no difference between your experimental conditions being tested.
Perhaps also split episode 02 into "statistical refresher" and a new episode termed 03 "statistics applied to RNA-seq"
For episode 02:
- population and sample notions
- simulate two populations from two countries with different heights.
- draw a sample + increase size of sample and make average + sd estimations.
- Case 1 = identical populations (= same country)
- draw N samples of similar size. Say N = 5, N = 10 groups or N = 10,000 groups.
- perform a t test for each of these three group sizes.
- draw a p-value histogram for these 3 group sizes.
- FDR procedure to control for type I error = false positives.
- Case 2 = different populations
- draw N samples of similar size. Say N = 5, N = 10 groups or N = 10,000 groups.
- perform a t test for each of these three group sizes.
- draw a p-value histogram for these 3 group sizes.
- FDR procedure to control for type I error = false positives.
- Type I error and type II error.
- FDR procedure
- Power
For episode 03 = application to RNA-seq
- maximise biological replicate sample numbers to increase statistical power.
- talk about sample sequencing depth = rarefaction curve.
- p-values histogram profiles and what to do about it.
Useful links
https://www.bioconductor.org/packages/release/bioc/html/qvalue.html
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request