Editing 2400: Statistics

Jump to: navigation, search

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then save the changes below to finish undoing the edit.
Latest revision Your text
Line 13: Line 13:
 
The main focus of the comic is a graph showing cases of COVID-19 versus time for two groups: one group was vaccinated and the other group was not. Graphs are ways to visualize data, and for real data indicate specific values. This graph seems to be based on [https://www.nejm.org/doi/full/10.1056/nejmoa2034577 the Pfizer vaccine's results]. The higher line ("placebo group") rises in a steep curve. The lower line ("vaccine group") follows the first for a bit but then levels out to a much slower rate of climb. Officially, a scientific assessment of the effectiveness of anything requires rigorous statistical analysis. This is particularly true in medical studies, where impacts of biology can be highly complex and subject to many factors, meaning that careful review of the data is necessary to confirm that an intervention was effective. The joke of this comic is that the intervention presented here is so ''obviously'' effective that it's obvious even to a layman with little understanding of the math. A few days after the vaccine was administered, cases in the vaccinated group essentially flatline, while cases in the placebo group continue to rise as a significant rate. The data is so "good", meaning that numbers for the treatment and control groups diverge so dramatically, that actual analysis becomes almost a formality: a glance at the chart would convince most people that the treatment is effective.  
 
The main focus of the comic is a graph showing cases of COVID-19 versus time for two groups: one group was vaccinated and the other group was not. Graphs are ways to visualize data, and for real data indicate specific values. This graph seems to be based on [https://www.nejm.org/doi/full/10.1056/nejmoa2034577 the Pfizer vaccine's results]. The higher line ("placebo group") rises in a steep curve. The lower line ("vaccine group") follows the first for a bit but then levels out to a much slower rate of climb. Officially, a scientific assessment of the effectiveness of anything requires rigorous statistical analysis. This is particularly true in medical studies, where impacts of biology can be highly complex and subject to many factors, meaning that careful review of the data is necessary to confirm that an intervention was effective. The joke of this comic is that the intervention presented here is so ''obviously'' effective that it's obvious even to a layman with little understanding of the math. A few days after the vaccine was administered, cases in the vaccinated group essentially flatline, while cases in the placebo group continue to rise as a significant rate. The data is so "good", meaning that numbers for the treatment and control groups diverge so dramatically, that actual analysis becomes almost a formality: a glance at the chart would convince most people that the treatment is effective.  
  
βˆ’
This comic was released one week after the FDA granted an emergency use authorization for the {{w|BNT162b2|Pfizer COVID-19 vaccine}}, and 8 days after results of its Phase 3 clinical trial were published in [https://www.nejm.org/doi/full/10.1056/nejmoa2034577 the ''New England Journal of Medicine'']. The document includes the following [https://www.nejm.org/na101/home/literatum/publisher/mms/journals/content/nejm/2020/nejm_2020.383.issue-27/nejmoa2034577/20210819/images/img_xlarge/nejmoa2034577_f3.jpeg chart].  The charts draw the integral of the incidence data rather than the data itself ("cumulative" rather than "rate"): this results in changes in disease rate towards the left side of the chart, being added into the data on the right side, amplifying their difference.  This technique for emphasizing the data is valid: the spread between the lines only continues to increase if the effect continues happening, such that the total spread at the right is proportional to the total effect the vaccine had.  The charts do not show any information on other possible variables.  Randall has described previously in his webcomics how very clear charts can be made to hide misleading data.  The linked graph does not leave the numbers out, and the numbers indicate the vaccine is 91% effective at preventing the disease (and a 95% chance of being between 85 and 95% efficient).  
+
This comic was released one week after the FDA granted an emergency use authorization for the {{w|BNT162b2|Pfizer COVID-19 vaccine}} was released, and 8 days after results of its Phase 3 clinical trial were published in [https://www.nejm.org/doi/full/10.1056/nejmoa2034577 the ''New England Journal of Medicine'']. The document includes the following [https://www.nejm.org/na101/home/literatum/publisher/mms/journals/content/nejm/2020/nejm_2020.383.issue-27/nejmoa2034577/20210819/images/img_xlarge/nejmoa2034577_f3.jpeg chart].  The charts draw the integral of the incidence data rather than the data itself ("cumulative" rather than "rate"): this results in changes in disease rate towards the left side of the chart, being added into the data on the right side, amplifying their difference.  This technique for emphasizing the data is valid: the spread between the lines only continues to increase if the effect continues happening, such that the total spread at the right is proportional to the total effect the vaccine had.  The charts do not show any information on other possible variables.  Randall has described previously in his webcomics how very clear charts can be made to hide misleading data.  The linked graph does not leave the numbers out, and the numbers indicate the vaccine is 91% effective at preventing the disease (and a 95% chance of being between 85 and 95% efficient).  
  
 
The advice here could be seen as the inverse of the "science tip" in [[2311: Confidence Interval]], in which the data was so ''bad'' that its error bars fell outside of the graph and were not shown. Also there's some association with [[1725: Linear Regression]] where the data is not so good that you don't need to perform linear analysis.
 
The advice here could be seen as the inverse of the "science tip" in [[2311: Confidence Interval]], in which the data was so ''bad'' that its error bars fell outside of the graph and were not shown. Also there's some association with [[1725: Linear Regression]] where the data is not so good that you don't need to perform linear analysis.

Please note that all contributions to explain xkcd may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see explain xkcd:Copyrights for details). Do not submit copyrighted work without permission!

To protect the wiki against automated edit spam, we kindly ask you to solve the following CAPTCHA:

Cancel | Editing help (opens in new window)