COVID swab test

Understanding COVID-19 data: Comparing data across time


This piece is part of a series, presented by our partner SAS, that explores the role of data in understanding the COVID-19 pandemic. SAS is a pioneer in the data management and analytics field. (Check out other posts in the series on our Get Smart About COVID-19 Misinformation page.)

In addition to comparisons of COVID-19 data across countries, other information that can keep us glued to our seats is seeing how the number of cases and deaths are changing over time. While these insights can help us draw some conclusions, it’s important to consider the numbers critically. Because this disease and our responses to it are changing daily, so are the factors that go into each number.

For example, consider the two graphs in this Business Insider article, showing the case fatality rate by country.

It’s interesting to see that just a few weeks ago, the U.S. had the worst case fatality rate of any country. Was that because we were woefully unprepared? Or was it because our medical system was failing? Or was it because of the lack of tests we had available at the time?

The final factor is likely a big part of the answer. We can infer this because the fatality rate appears to have declined significantly since that time. We’re either getting better at treating COVID-19, or we are improving our detection of less severe cases through increased testing. An increase in testing results in a rise in the number of cases reported, thus lowering the case fatality rate. Either way, the fundamental sources driving the data are changing over time.

You can see a very clear example of this when you look at almost any graph of cases in China (including the one below). Most show a significant jump in cases on or around Feb. 12, when China made a decision to reclassify thousands of people as positive for COVID-19 without confirming those cases through diagnostic testing. This additional context is critical in being able to accurately interpret the data.

2020 coronavirus cases by date of report.svg
By Phoenix7777Own work
Data source: Coronavirus disease (COVID-2019) situation reports, World Health Organization (WHO), CC BY-SA 4.0, Link

Unfortunately, most of the changes in underlying factors don’t have such a clear demarcation. In many countries, testing has slowly ramped up, making it possible to detect more cases over time but difficult to determine if an increase in cases is due to wider testing or simply the spread of the disease.

Similarly, changes to policy — in various degrees and with various impacts — have rolled out in many countries. This includes differences in social isolation requirements or whether people with mild symptoms should be tested. Each decision impacts the data we see, so we can’t interpret the resulting changes in cases or deaths over time as concrete evidence of how the disease spreads. It also makes it challenging to predict what will happen in the future.

Other articles in this series:

SAS logoAbout SAS: Through innovative analytics software and services, SAS helps customers around the world transform data into intelligence.

More Updates

Washington Post’s Valerie Strauss showcases The Sift

Washington Post education reporter Valerie Strauss features content from The Sift®,  NLP’s free weekly newsletter for educators, in her blog throughout the school year. Twitter chaos, LeBron James, Kyrie Irving and more news literacy lessons (Nov. 20, 2022) Katy Perry’s eye, zombie rumors and other news literacy lessons (Nov. 3, 2022) No, Eagles fans didn’t…

NLP in the News

NewsLitCamp Philadelphia

Please join us on Friday, Dec. 2 from 8 a.m. to 3 p.m. ET for NewsLitCamp® Philadelphia, a day of immersive, in-person professional learning built exclusively for educators and focusing on key news literacy concepts.