How to defend yourself against misleading statistics in the news | Sanne Blauw | TEDxMaastricht

TEDx Talks
3 Nov 201616:20
EducationalLearning
32 Likes 10 Comments

TLDRThe speaker, a self-proclaimed 'number nerd,' delves into the world of statistics and their impact on our perception of reality. They highlight the importance of accurate data, as numbers featured in news stories can dictate our choices regarding health, economy, and politics. The talk exposes five common statistical deceptions found in the media: misleading graphs with dual scales, biased or exaggerated poll results, overconfident presentation of precise numbers, the misinterpretation of spectacular statistics without context, and the dangerous confusion between correlation and causation. By understanding these pitfalls, the speaker encourages statistical literacy as a form of self-defense to prevent the misuse of data from harming individuals and society.

Takeaways
  • πŸ“ˆ **Misleading Graphs**: Graphs can be manipulated to present a misleading picture, such as using two different scales to exaggerate differences.
  • πŸ€’ **Health Statistics**: Be cautious with health-related statistics; a study might present shocking numbers but could be based on a small or non-representative sample.
  • πŸ“Š **Pollution in Polls**: Polls can be skewed by low response rates and broad definitions of terms, leading to potentially exaggerated or misleading results.
  • πŸ”’ **Decimal Deception**: Numbers with many decimal places can appear precise but often hide the complexity and uncertainty in data collection and analysis.
  • πŸ“‰ **Economic Fluctuations**: Economic indicators like GDP growth can fluctuate significantly due to the complexity of their calculation and the time it takes to finalize figures.
  • πŸ– **Risk Representation**: The representation of risk statistics needs context; a 20% increase in risk might not be as significant as it sounds without understanding the baseline risk.
  • 🧠 **Correlation vs. Causation**: Just because two phenomena occur together does not mean one causes the other; this is a common mistake that can lead to incorrect conclusions.
  • πŸ“Š **Graphs in Context**: Graphs and statistics are more meaningful when presented with the right context; without it, they can be misinterpreted.
  • 🚨 **Actionable Statistics**: Be wary of statistics that prompt action without proper understanding, as they can lead to harmful decisions based on incomplete or misleading data.
  • 🌟 **Spectacular Statistics**: Spectacular or shocking statistics should be taken with a grain of salt, as they might be designed to attract attention rather than inform.
  • βš–οΈ **Statistical Self-Defense**: It's important to be critical of the statistics we encounter in the news and to understand the methods and margins of error behind them to avoid being misled.
Q & A
  • What is the significance of numbers in shaping our worldview according to the speaker?

    -Numbers are crucial because they inform our decisions on various aspects of life, including diet, healthcare, economic conditions, and political choices. They can also misinform if not accurate or presented correctly.

  • What is the first statistical lie discussed in the transcript?

    -The first statistical lie is the 'good-looking graph,' where graphs may appear too good to be true due to the use of different scales to exaggerate trends or effects.

  • What was the issue with the Planned Parenthood graph presented in Congress?

    -The issue was that the graph used two different scales for the two lines it depicted, making it seem as though abortions were increasing dramatically while cancer screenings were decreasing, which was misleading when viewed on a single scale.

  • What are the problems with the sexual assault poll mentioned in the New York Times?

    -The problems included a very small sample of universities, a low response rate from the women surveyed, and a broad definition of sexual assault that could include minor incidents.

  • Why is it important to be cautious about precise numbers with many decimal points?

    -Precise numbers can create a false sense of accuracy and objectivity. For example, GDP growth figures are often revised as better data becomes available, which can significantly alter economic interpretations and policy decisions.

  • What is the potential issue with spectacular statistics that seem impressive out of context?

    -Spectacular statistics can be misleading if taken out of context. For instance, a 20% increased risk may sound significant, but it might only represent a small increase in actual probability, especially if the baseline risk is low.

  • What is the danger of assuming correlation implies causation?

    -The danger is that one might take action based on a false cause-and-effect relationship. This can lead to misguided policies, social biases, and ineffective or harmful interventions.

  • Why did the speaker say that the mid-80s saw an increase in reported brain tumors?

    -The increase was likely due to the wider availability of MRI scanners, which made it possible to detect tumors that previously went unnoticed, rather than a real increase caused by a specific artificial sweetener as some suggested.

  • What is the speaker's profession and how does it relate to the topic of the talk?

    -The speaker is a journalist who writes about the value of data and statistics. This background gives the speaker authority to discuss the misuse of statistics in the media and the importance of statistical literacy.

  • What is the main message the speaker wants the audience to take away from the talk?

    -The speaker wants the audience to be aware of common statistical lies and to practice statistical self-defense to ensure that they interpret numerical data correctly and make informed decisions.

  • Why did the speaker mention the example of drug-related arrests in the United States?

    -The example was used to illustrate the potential harm of assuming correlation implies causation. The higher rate of drug-related arrests among black Americans does not mean that being black causes criminal behavior; it could be a result of systemic racism in policing practices.

  • What is the importance of understanding the five common statistical lies?

    -Understanding these lies is important to prevent misinformed decision-making, societal harm, and to ensure that statistical data is used responsibly and ethically in shaping public opinion and policy.

Outlines
00:00
πŸ“Š The Misrepresentation of Data in Graphs

The first paragraph introduces the topic of statistical misinformation in the media. The speaker, Song, a self-proclaimed 'number nerd,' begins by conducting a small survey to gauge the audience's familiarity with TED and their recent encounter with numbers in the news. The main focus is on how graphs can be manipulated to present misleading information. An example is given where a graph presented in Congress concerning Planned Parenthood's services is criticized for using two different scales to exaggerate the decline in cancer screenings and the rise in abortions. The paragraph emphasizes the importance of accurate data and the potential for numbers to be 'tortured' to support various narratives.

05:02
πŸ“‰ The Distortion of Polls and Precise Numbers

The second paragraph discusses the issues with polls and the overconfidence in precise numbers. It highlights how small sample sizes, low response rates, and broad definitions can skew poll results, as illustrated by a New York Times article that overstated the prevalence of sexual assault on campus. The paragraph also addresses the problem of decimal points giving a false sense of accuracy, using GDP growth rates as an example. It shows how initial estimates can change drastically over time as more accurate data becomes available, which can significantly impact economic decisions and policies.

10:06
πŸ– The Spectacle of Statistics and Correlation Fallacies

The third paragraph cautions against the dramatic presentation of statistics, or 'Tom Cruise of Statistics,' where context is key to understanding the real implications of data. It points out that sensational headlines, such as a 20% higher risk of bowel cancer from eating processed meat, may not be as significant as they seem when considering the actual increase in percentage terms. The paragraph also warns against mistaking correlation for causation, using the example of brain tumor rates increasing alongside the introduction of NutraSweet; however, the more plausible explanation is the increased use of MRI scanners that led to more tumors being detected.

15:06
βš–οΈ The Danger of Conflating Correlation with Causation

The fourth and final paragraph emphasizes the importance of not equating correlation with causation, which can lead to misguided actions and societal harm. It challenges the assumption that because black Americans are more likely to be arrested for drug-related offenses, they are inherently more criminal. The paragraph suggests that the disparity is likely due to systemic racism in policing rather than differences in drug use or sales among racial groups. It concludes with a call to be vigilant against statistical lies to prevent their negative impact on individuals and society.

Mindmap
Keywords
πŸ’‘Statistical Lies
Statistical lies refer to the misuse or misrepresentation of statistical data to deceive or mislead an audience. In the video, the speaker discusses five common types of statistical lies found in the media, emphasizing the importance of critical thinking when interpreting numerical data. These lies can distort reality and lead to incorrect conclusions.
πŸ’‘Graph Manipulation
Graph manipulation is a method of statistical lie where graphs are designed to appear more convincing by using different scales for different data sets, making trends seem more significant than they are. The speaker uses an example from the American Congress to illustrate how this can lead to misleading conclusions about the services provided by Planned Parenthood.
πŸ’‘Poll Bias
Poll bias occurs when a survey or poll is not representative of the population it claims to study, leading to inaccurate or exaggerated results. The video cites a study that was reported in the New York Times, which may have overestimated the prevalence of sexual assault on campus due to a small sample size and a broad definition of assault.
πŸ’‘Decimal Points and Precision
Decimal points and precision refer to the level of detail provided by a numerical value, often indicated by the number of digits beyond the decimal point. The video points out that an excessive number of decimal places can create a false sense of accuracy, as seen with the frequent revisions to GDP growth figures, which are difficult to calculate and subject to change as more data becomes available.
πŸ’‘Spectacular Statistics
Spectacular statistics are numbers that appear impressive due to their size or the percentage change they represent, but may not be as significant when context is considered. An example from the video is the claim that daily processed meat consumption increases the risk of bowel cancer by 20%, which, when contextualized, represents a small absolute risk increase.
πŸ’‘Correlation vs. Causality
Correlation vs. causality is a fundamental statistical concept that highlights the difference between two events occurring together (correlation) and one event causing the other (causality). The video warns against assuming causality from correlation, using the example of brain tumor rates increasing with the introduction of NutraSweet, which was later debunked in favor of increased MRI usage allowing for more tumor detections.
πŸ’‘Data Patterns
Data patterns refer to the trends or regularities that can be observed within a set of data. The speaker, as a 'number nerd,' is fascinated by these patterns and how they can reveal insights about the world. The video emphasizes that understanding these patterns is crucial for interpreting statistical information accurately.
πŸ’‘Torturing Numbers
The phrase 'torturing numbers' is used to describe the manipulation of statistical data to force a particular conclusion or narrative. The video speaker mentions a quote, 'torture numbers and they'll confess to anything,' to highlight how easily statistics can be twisted to support a misleading argument.
πŸ’‘Statistical Self-Defense
Statistical self-defense is the idea of equipping oneself with the knowledge and skills to critically evaluate statistical claims and avoid being misled by bad statistics. The video encourages viewers to learn how to defend against statistical lies to protect themselves and society from misinformation.
πŸ’‘Media Representation of Numbers
Media representation of numbers involves how numerical data is presented in news outlets and can significantly influence public perception. The video criticizes the media for often presenting numbers in a way that is not only misleading but also shapes people's views on various topics, such as health, economy, and politics.
πŸ’‘Addiction to Numbers
The speaker describes an 'addiction to numbers' as a fascination with numerical patterns and data analysis from a young age. This personal anecdote sets the stage for discussing the importance of numbers in understanding the world and the speaker's journey from childhood hobbies to a professional career in data and journalism.
Highlights

The speaker emphasizes the importance of understanding and interpreting statistical data accurately, as it shapes our view of the world and informs our decisions.

A study is mentioned where 93.4% of participants have heard of TED, indicating its widespread recognition.

No participants reported having the flu, suggesting that the audience is likely in good health.

The speaker, Song, introduces himself as a 'number nerd' with a lifelong passion for data and patterns.

Song discusses his career evolution from PhD researcher to journalist, focusing on the value of data and statistics.

The news is full of numbers that influence our lives, but the accuracy and trustworthiness of these numbers are crucial.

The speaker identifies 'tortured numbers' in the news that can misinform the public, quoting the phrase 'torture numbers and they'll confess to anything'.

Five common statistical lies in newspapers are introduced, aiming to educate the audience on 'statistical self-defense'.

The first lie discussed is the 'good-looking graph', which can be deceptive due to the use of different scales.

A misleading graph from the American Congress about Planned Parenthood's services is critiqued as an example of the first lie.

The second lie, 'the polluted pole', is exemplified by a potentially exaggerated statistic on sexual assault on campus.

The third lie, 'overconfidence in decimal points', is illustrated by the frequent revisions in GDP growth figures.

The fourth lie, 'the spectacular statistic', is debunked by providing context to seemingly impressive numbers, such as the risk of bowel cancer from processed meat.

The fifth and most dangerous lie, 'the cocky correlation', is explained through the example of brain tumor incidence and the introduction of NutraSweet.

The importance of not confusing correlation with causation is stressed, using the example of drug-related arrests among black Americans.

The speaker calls for vigilance against bad statistics to prevent harm to individuals and society.

Song concludes with a call to action for statistical literacy to protect oneself and others from misleading data.

Transcripts
Rate This

5.0 / 5 (0 votes)

Thanks for rating: