How to build a data-driven company

10 analytics best practices (including 3 you can’t ignore)

15 MIT startups to watch

Credit: Rob Dobi

Ideas Made to Matter


MIT Sloan research about the coronavirus pandemic

Below is a list of working papers — preliminary versions of academic research –— by MIT Sloan faculty and PhD students related to the coronavirus/COVID-19 pandemic. This page will be updated with new papers as they become available. A list of projects and other resources led by MIT Sloan faculty is at the end of this page.

Effects of a universal basic income during the pandemic

Abhijit Banerjee, Michael Faye, Alan Krueger, Paul Niehaus, Tavneet Suri

We examine some effects of Universal Basic Income during the COVID-19 pandemic using a large-scale experiment in rural Kenya. Transfers significantly improved well-being on common measures such as hunger, sickness, and depression in spite of the pandemic, but with modest effect sizes. They may have had public health benefits, as they reduced hospital visits and decreased social (but not commercial) interactions that influence contagion rates. During the pandemic (and contemporaneous agricultural lean season) recipients lost the income gains from starting new non-agricultural enterprises that they had initially obtained, but also suffered smaller increases in hunger. This pattern is consistent with the idea that UBI induced recipients to take on more income risk in part by mitigating the most harmful consequences of adverse shocks.

Download the research

Predicting COVID-19 infection risk and related risk drivers in nursing homes: A machine learning approach

Christopher L.F. Sun, Eugenio Zuccarelli, El Ghali A. Zerhouni, Jason Lee, James Muller, Karen M. Scott, Alida M. Lujan, Retsef Levi

A machine-learning model trained on COVID-19 outcomes from 1,146 nursing homes identified a nursing home’s county’s infection rate, number of units, historical health deficiencies from Centers of Medicare and Medicaid Services inspections, percent of non-Hispanic White residents, and density as predictive of infection risk.

Read the research

COVID-19 risk among airline passengers: Should the middle seat stay empty?   

Arnold Barnett

We use recent data and research results to approximate the probability that an air traveler in coach will contract COVID-19 on a U.S. domestic flight two hours long, both when all coach seats are full and when all but middle seats are full. The point estimates we reach based on data from late June 2020 are 1 in 4,300 for full flights and 1 in 7,700 when middle seats are kept empty. These estimates are subject to both quantifiable and nonquantifiable sources of uncertainty, and sustain known margins of error of a factor about 2.5. However, because uncertainties in key parameters affect both risk estimates the same way, they leave the relative risk ratio for fill all seats compared to middle seat open close to 1.8 (i.e., close to 1/4,300)/(1/7,700). We estimate the mortality risks caused by COVID-19 infections contracted on airplanes, taking into account that infected passengers can in turn infect others. The point estimates, which use 2019 data about the percentage of seats actually occupied on U.S. flights, range from one death per 400,000 passengers to one death per 600,000. These death-risk levels are considerably higher than those associated with plane crashes but comparable to those arising from two hours of everyday activities during the pandemic.

Read the research

Estimating the global spread of COVID-19

Hazhir Rahmandad, TY Lim, John Sterman

Limited and inconsistent testing and differences in age distribution, health care resources, social distancing, and policies have caused large variations in the extent and dynamics of the COVID-19 pandemic across nations, complicating the estimation of prevalence, the infection fatality rate (IFR), and other factors important to care providers and policymakers. Using data for all 84 countries with reliable testing data (spanning 4.75 billion people) we develop a dynamic epidemiological model integrating data on cases, deaths, excess mortality and other factors to estimate how asymptomatic transmission, disease acuity, hospitalization, and behavioral and policy responses to risk condition prevalence and IFR across nations and over time. For these nations we estimate IFR averages 0.68% (0.64%-0.7%). Cases and deaths through June 18, 2020 are estimated to be 11.8 and 1.48 times official reports, respectively, at 88.5 (85-95.3) million and 600 (586-622) thousand. Prevalence and IFR vary substantially, e.g., Ecuador (18%; 0.61%), Chile (15.5%; 0.57%), Mexico (8.8%; 0.69%), Iran (7.9%; 0.44%), USA (5.3%; 0.99%), UK (5.2%; 1.59%), Iceland (1.65%, 0.56%), New Zealand (0.1%, 0.64%), but all nations remain well below the level needed for herd immunity. By alerting the public earlier and reducing contacts, extensive testing when the pandemic was declared could have averted 35.3 (32.7-42.7) million cases and 197 (171-232) thousand deaths. However, future outcomes are less dependent on testing and more contingent on the willingness of communities and governments to reduce transmission. Absent breakthroughs in treatment or vaccination and with mildly improved responses we project 249 (186-586) million cases and 1.75 (1.40-3.67) million deaths in the 84 countries by Spring 2021.

Read the research

The short-run and long-run effects of COVID-19 on energy and the environment

Kenneth T. Gillingham, Christopher R. Knittel, Jing Li, Martin Ovaere, Mar Reguant

We explore how the short-run effects of Covid-19 in reducing CO2 and local air pollutant emissions can easily be outweighed by the long-run effects of a slowing of clean energy innovation. Focusing on the United States, we show that in the short run, COVID-19 has reduced consumption for jet fuel and gasoline dramatically, by 50% and 30% respectively, while electricity demand has declined by less than 10%. CO2 emissions have declined by 15%, while local air pollutants have also declined, saving about 200 lives per month. However, there could be a deep impact on long-run innovation in clean energy, leading to an additional 2,500 MMT CO2 and 40 deaths per month on average to 2035. Even pushing back renewable electricity generation investments by one year would outweigh the emission reductions and avoided deaths from March-June 2020. The policy response will determine how COVID-19 ultimately influences the future path of emissions.

Read the research  

The fiscal cost of COVID-19: Evidence from states

Christos Makridis, Robert McNab

How has the COVID-19 pandemic affected state budgets? Using a quarterly panel of states between 1994 and 2019, we estimate how changes in employment affect tax revenues. We find that a one percentage point (pp) rise in employment is associated with a 1.56pp rise in total tax revenue, which is concentrated among sales taxes (a 1.19pp increase), individual income taxes (a 1.63pp increase), and corporate income taxes (a 4.13pp increase). These results are robust to a wide array of controls, such as state composition and housing price growth, and instrumental variable specifications. After estimating state-specific elasticities, and forecasting counterfactual employment within states using continued unemployment claims, we find that the average state will experience a 20% decline in their tax revenues (or $4.9 billion) and $254.8 billion nationally in 2012 prices, assuming a recovery that follows trends up to April. We are incorporating additional employment to account for the economic recovery.

Read the research

Transitioning out of the coronavirus lockdown: A framework for evaluating zone-based social distancing

Eric Friedman, John Friedman, Simon Johnson, Adam Landsberg

In the face of elevated pandemic risk, canonical epidemiological models imply the need for extreme social distancing over a prolonged period. Alternatively, people could be organized into zones, with more interactions inside their zone than across zones. Zones can deliver significantly lower infection rates, with less social distancing, particularly if combined with simple quarantine rules and contact tracing. This paper provides a framework for understanding and evaluating the implications of zones, quarantines, and other complementary policies.

Read the research

Working remotely and the supply-side impact of COVID-19

Dimitris Papanikolaou, Lawrence Schmidt

We analyze the supply-side disruptions associated with COVID-19 across firms and workers. To do so, we exploit differences in the ability of workers across industries to work remotely using data from the American Time Use Survey. We find that sectors in which a higher fraction of the workforce is not able to work remotely experienced significantly greater declines in employment, significantly more reductions in expected revenue growth, worse stock market performance, and higher expected likelihood of default. In terms of individual employment outcomes, lower-paid workers, especially female workers with young children, were significantly more affected by these disruptions. Last, we combine these ex-ante heterogeneous industry exposures with daily financial market data to create a stock return portfolio that most closely replicates the supply-side disruptions resulting from the pandemic.

Read the research

The impact of COVID-19 on the U.S. child care market: Evidence from stay-at-home orders

Umair Ali, Chris M. Herbst, Christos Makridis

Stay-at-home orders (SAHOs) have been implemented in most U.S. states to mitigate the spread of COVID-19. This paper quantifies the short-run impact of these containment policies on the supply of and demand for child care. The child care market may be particularly vulnerable to a SAHO-type policy shock, given that many providers are liquidity-constrained. Using plausibly exogenous variation from the staggered adoption of SAHOs across states, we find that online job postings for early care and education teachers declined by 13% after enactment. This effect is driven exclusively by private-sector services. Indeed, hiring by public programs like Head Start and pre-kindergarten has not been influenced by SAHOs. In addition, we find little evidence that child care search behavior among households has been altered. Because forced supply-side changes appear to be at play, our results suggest that households may not be well-equipped to insure against the rapid transition to the production of child care. We discuss the implications of these results for child development and parental employment decisions.

Read the research

Analysis of misinformation during the COVID-19 outbreak in China: cultural, social and political entanglements

Yan Leng, Yujia Zhai, Shaojing Sun, Yifei Wu, Jordan Selzer, Sharon Strover, Julia Fensel, Alex Pentland, Ying Ding

COVID-19 resulted in an infodemic, which could erode public trust, impede virus containment, and outlive the pandemic itself. The evolving and fragmented media landscape is a key driver of the spread of misinformation. Using misinformation identified by the fact-checking platform by Tencent and posts on Weibo, our results showed that the evolution of misinformation follows an issue-attention cycle, pertaining to topics such as city lockdown, cures, and preventions, and school reopening. Sources of authority weigh in on these topics, but their influence is complicated by peoples' pre-existing beliefs and cultural practices. Finally, social media has a complicated relationship with established or legacy media systems. Sometimes they reinforce each other, but in general, social media may have a topic cycle of its own making. Our findings shed light on the distinct characteristics of misinformation during the COVID-19 and offer insights into combating misinformation in China and across the world at large.

Read the research

Interdependence and the cost of uncoordinated responses to COVID-19

David Holtz, Michael Zhao, Seth G. Benzell, Cathy Y. Cao, M. Amin Rahimian, Jeremy Yang, Jennifer Allen, Avinash Collis, Alex Moehring, Tara Sowrirajan, Dipayan Ghosh, Yunhao Zhang, Paramveer S. Dhillon, Christos Nicolaides, Dean Eckles, Sinan Aral

Social distancing is the core policy response to COVID-19. But as federal, state and local governments begin opening businesses and relaxing shelter-in-place orders worldwide, we lack quantitative evidence on how policies in one region affect mobility and social distancing in other regions and the consequences of uncoordinated regional policies adopted in the presence of such spillovers. We therefore combined daily, county-level data on shelter-in-place and business closure policies with movement data from over 27 million mobile devices, social network connections among over 220 million of Facebook users, daily temperature and precipitation data from 62,000 weather stations and county-level census data on population demographics to estimate the geographic and social network spillovers created by regional policies across the United States. Our analysis showed the contact patterns of people in a given region are significantly influenced by the policies and behaviors of people in other, sometimes distant, regions. When just one third of a state’s social and geographic peer states adopt shelter in place policies, it creates a reduction in mobility equal to the state’s own policy decisions. These spillovers are mediated by peer travel and distancing behaviors in those states. A simple analytical model calibrated with our empirical estimates demonstrated that the “loss from anarchy” in uncoordinated state policies is increasing in the number of non-cooperating states and the size of social and geographic spillovers. These results suggest a substantial cost of uncoordinated government responses to COVID-19 when people, ideas, and media move across borders.

Download the research

Learning from friends in a pandemic: Social networks and the macroeconomic response of consumption

Christos Makridis, Tao Wang

How do individuals adjust their consumption in response to information disseminated through peers and the social network? Using new micro-data on consumption, coupled with geographic friendship ties to measure social connectivity, this paper quantifies the role of social networks as a propagation mechanism for understanding aggregate fluctuations in consumption. Using the COVID-19 pandemic as a source of variation, we find that a 10% rise in cases and deaths in counties connected through the social network is associated with a 0.64% and 0.33% decline in consumption expenditures--roughly three to seven times as large as the direct effects of local cases or deaths. Counties more socially connected to epicenter countries of the pandemic also saw a bigger drop in consumption. These effects are concentrated among consumer goods and services that rely more on social-contact, suggesting that individuals incorporate the experiences from their social network to inform their own consumption choices. We are working on incorporating this micro-economic evidence into a heterogeneous agent model and social interaction to study the aggregate demand implications.

Read the research

Beat COVID-19 through innovation

Pierre Azoulay, Benjamin Jones

As coronavirus disease 2019 (COVID-19) has spread, public health and economic well-being are increasingly in conflict. Governments are prioritizing public health, but the current solution—social isolation—is costly as commerce remains shut down. Restarting economies could rekindle the pandemic and cause even worse human suffering. Innovation can help societies escape the untenable choice between public and economic health. The world needs effective vaccines, therapies, or other solutions. But how do we achieve these solutions, and achieve them quickly?

Read the editorial in Science

Weather conditions and COVID-19 transmission: Estimates and projections

Ran Xu, Hazhir Rahmandad, Marichi Gupta, Catherine DiGennaro, Navid Ghaffarzadegan, Mohammad Jalali

Background: Understanding and projecting the spread of COVID-19 requires reliable estimates of how weather components are associated with the transmission of the virus. Prior research on this topic has been inconclusive. Identifying key challenges to reliable estimation of weather impact on transmission we study this question using one of the largest assembled databases of COVID-19 infections and weather.

Methods: We assemble a dataset that includes virus transmission and weather data across 3,739 locations from December 12, 2019 to April 22, 2020. Using simulation, we identify key challenges to reliable estimation of weather impacts on transmission, design a statistical method to overcome these challenges, and validate it in a blinded simulation study. Using this method and controlling for location-specific response trends we estimate how different weather variables are associated with the reproduction number for COVID-19. We then use the estimates to project the relative weather-related risk of COVID-19 transmission across the world and in large cities.

Results: We show that the delay between exposure and detection of infection complicates the estimation of weather impact on COVID-19 transmission, potentially explaining significant variability in results to-date. Correcting for that distributed delay and offering conservative estimates, we find a negative relationship between temperatures above 25 degrees Celsius and estimated reproduction number (R ̂), with each degree Celsius associated with a 3.1% (95% CI, 1.5% to 4.8%) reduction in R ̂. Higher levels of relative humidity strengthen the negative effect of temperature above 25 degrees. Moreover, one millibar of additional pressure increases R ̂ by approximately 0.8% (95% CI, 0.6% to 1%) at the median pressure (1016 millibars) in our sample. We also find significant positive effects for wind speed, precipitation, and diurnal temperature on R ̂. Sensitivity analysis and simulations show that results are robust to multiple assumptions. Despite conservative estimates, weather effects are associated with a 43% change in R ̂ between the 5th and 95th percentile of weather conditions in our sample.

Conclusions: These results provide evidence for the relationship between several weather variables and the spread of COVID-19. However, the (conservatively) estimated relationships are not strong enough to seasonally control the epidemic in most locations.

Read the research

A multi-risk SIR model with optimally targeted lockdown

Daron Acemoglu, Victor Chernozhukov, Iván Werning, Michael D. Whinston

We develop a multi-risk SIR model (MR-SIR) where infection, hospitalization and fatality rates vary between groups — in particular between the “young,” the “middle-aged” and the “old.” Our MR-SIR model enables a tractable quantitative analysis of optimal policy similar to those already developed in the context of the homogeneous-agent SIR models. For baseline parameter values for the COVID-19 pandemic applied to the U.S., we find that optimal policies differentially targeting risk/age groups significantly outperform optimal uniform policies and most of the gains can be realized by having stricter lockdown policies on the oldest group. For example, for the same economic cost (24.3% decline in GDP), optimal semi–targeted or fully-targeted policies reduce mortality from 1.83% to 0.71% (thus, saving 2.7 million lives) relative to optimal uniform policies. Intuitively, a strict and long lockdown for the most vulnerable group both reduces infections and enables less strict lockdowns for the lower-risk groups. We also study the impacts of social distancing, the matching technology, the expected arrival time of a vaccine, and testing with or without tracing on optimal policies. Overall, targeted policies that are combined with measures that reduce interactions between groups and increase testing and isolation of the infected can minimize both economic losses and deaths in our model.

Read the research

COVID-19 and the welfare effects of reducing contagion

Robert S. Pindyck

I use a simple SIR model, augmented to include deaths, to elucidate how pandemic progression is affected by the control of contagion, and examine the key trade-offs that underlie policy design. I illustrate how the cost of reducing the “reproduction number” R0 depends on how it changes the infection rate, the total and incremental number of deaths, the duration of the pandemic, and the possibility and impact of a second wave. Reducing R0 reduces the number of deaths, but extends the duration (and hence economic cost) of the pandemic, and it increases the fraction of the population still susceptible at the end, raising the possibility of a second wave. The benefit of reducing R0 is largely lives saved, and the incremental number of lives saved rises as R0 is reduced. But using a VSL estimate to value those lives is problematic.

Download the research

Earnings expectations in the COVID crisis

Augustin Landier, David Thesmar

We analyze firm-level analyst forecasts during the COVID crisis. First, we describe expectations dynamics about future corporate earnings. Downward revisions have been sharp, especially for 2020 and 2021, but much less drastic than the lower bound estimated by Koijen et al. (2020). Analysts' consensus forecast does not exhibit evidence of over-reaction: Forecasts over 2020 earnings have slowly decreased by 10.2% over the course of March and April 2020 before stabilizing. Long-run forecasts, as well as expected "Long-Term Growth'' have reacted less than short-run forecasts, and feature less disagreement. However, even the 2024 forecasts are revised down. Second, we ask how much forecast revisions explain market dynamics. Without change in discount rates, mean forecast-implied cumulative returns from mid-February to mid-April should be around -9%, while they were actually -20%. The difference between forecast-implied returns and actual returns implies a rise in the average discount rate of about 1%. This rise can entirely be accounted for by the increase in market leverage that firms have faced during the crisis. In other words, analyst forecast revisions explain most of the downward revision in equity values.

Read the research

Did the paycheck protection program hit the target?

Joao Granja, Christos Makridis, Constantine Yannelis, Eric Zwick

This paper takes an early look at the Paycheck Protection Program (PPP), a large and novel small business support program that was part of the initial policy response to the COVID-19 pandemic. We use new data on the distribution of PPP loans and high-frequency micro-level employment data to consider two dimensions of program targeting. First, we do not find evidence that funds flowed to areas more adversely affected by the economic effects of the pandemic, as measured by declines in hours worked or business shutdowns. If anything, funds flowed to areas less hard hit. Second, we find significant heterogeneity across banks in terms of disbursing PPP funds, which does not only reflect differences in underlying loan demand. The top-4 banks alone account for 36% of total pre-policy small business loans, but disbursed less than 3% of all PPP loans. Areas that were significantly more exposed to low-PPP banks received much lower loan allocations. As data become available, we will study employment and establishment responses to the program and the impact of PPP support on the economic recovery. Measuring these responses is critical for evaluating the social insurance value of the PPP and similar policies.

Read the research

Ties that bind (and social distance): How social capital helps communities weather the COVID-19 pandemic

Christos Makridis, Cary Wu

The COVID-19 pandemic represents the largest worldwide shock in at least a decade. Moreover, the spread of the virus has been highly heterogeneous. This paper investigates the role of social capital as a potential mediating factor for the spread of the COVID-19 virus. On one hand, higher social capital could imply greater in-person interaction and risk of contagion. On the other hand, because social capital is associated with greater trust and relationships within a community, it could endow individuals with a greater concern for others, thereby leading to more hygienic practices and social distancing. Our results suggest that moving a county from the 25th to the 75th percentile of the distribution of social capital would lead to a 20% decline in the number of infections, as well as a 0.28 percentage point decline in the growth rate of the virus (nearly 20% of the median growth rate). Moreover, our results are robust to many demographic characteristics, controls, and alternative measures of social capital.

Read the research

Internal and external effects of social distancing in a pandemic

Maryam Farboodi, Gregor Jarosch, Robert Shimer

We use a conventional dynamic economic model to integrate individual optimization, equilibrium interactions, and policy analysis into the canonical epidemiological model. Our tractable framework allows us to represent both equilibrium and optimal allocations as a set of differential equations that can jointly be solved with the epidemiological model in a unified fashion. Quantitatively, the laissez-faire equilibrium accounts for the decline in social activity we measure in U.S. micro-data from SafeGraph. Relative to that, we highlight three key features of the optimal policy: it imposes immediate, discontinuous social distancing; it keeps social distancing in place for a long time or until treatment is found; and it is never extremely restrictive, keeping the effective reproduction number mildly above the share of the population susceptible to the disease.

Read the research

Rationing social contact during the COVID-19 pandemic: Transmission risk and social benefits of U.S. locations

Seth Benzell, Avinash Collis, Christos Nicolaides

To prevent the spread of COVID-19, some types of stores and gathering places have been shut down while others remain open. The decision to shut down one type of location and leave another open constitutes a judgement about the relative danger and benefits of those locations. Using location data from a large sample of smartphones, nationally representative consumer preference surveys, and government statistics, we measure the relative transmission risk benefit and social cost of closing 30 different location categories in the U.S. Our categories include types of shops, schools, entertainments, and public spaces. We rank categories by those which should face stricter regulation via dominance across eight dimensions of risk and importance and through composite indexes. We find that from February to March, there were larger declines in visits to locations that our measures imply should be closed first. We hope this analysis will help policymakers decide how to reopen their economies.

Read the research

Transitioning out of the coronavirus lockdown: A framework for zone-based social distancing

Eric Friedman, John Friedman, Simon Johnson, Adam Landsberg

In the face of elevated pandemic risk, is it necessary to completely lock down the population, imposing extreme social distancing? Canonical epidemiological models suggest this may be unavoidable for months at a time, despite the heavy social and human cost of physically isolating people. Alternatively, people could retreat into socially or economically defined defensive zones, with more interactions inside their zone than across zones. Starting from a complete lockdown, zones could facilitate responsible reopening of education, government, and firms, as a well-implemented structure can dramatically slow the diffusion of the disease. This paper provides a framework for understanding and evaluating the effectiveness of zones for social distancing.

Read the research

Bayesian adaptive clinical trials for anti-infective therapeutics during epidemic outbreaks

Qingyang Xu, Shomesh Chaudhuri, Danying Xiao, Andrew W. Lo

In the midst of epidemics such as COVID-19, therapeutic candidates are unlikely to be able to complete the usual multi-year clinical trial and regulatory approval process within the course of an outbreak. We apply a Bayesian adaptive patient-centered model---which minimizes the expected harm of false positives and false negatives---to optimize the clinical trial development path during such outbreaks. When the epidemic is more infectious and fatal, the Bayesian-optimal sample size in the clinical trial is lower and the optimal statistical significance level is higher. For COVID-19 (assuming a static R0=2 and initial infection percentage of 0.1%), the optimal significance level is 7.1% for a clinical trial of a non-vaccine anti-infective therapeutic clinical trial and 13.6% for that of a vaccine. For a dynamic R0 ranging from 2 to 4, the corresponding values are 14.4% and 26.4%, respectively. Our results illustrate the importance of adapting the clinical trial design and the regulatory approval process to the specific parameters and stage of the epidemic.

Read the research

Estimating probabilities of success of clinical trials for vaccines and other anti-infective therapeutics

Chi Heem Wong, Kien Wei Siah, Andrew W. Lo

A key driver in biopharmaceutical investment decisions is the probability of success of a drug development program. We estimate the probabilities of success (PoS) of clinical trials for vaccines and other anti-infective therapeutics using 43,414 unique triplets of clinical trial, drug, and disease between January 1, 2000, and January 7, 2020, yielding 2,544 vaccine programs and 6,829 non-vaccine programs targeting infectious diseases. The overall estimated PoS for an industry-sponsored vaccine program is 39.6%, and 16.3% for an industry-sponsored anti-infective therapeutic. Among industry-sponsored vaccines programs, only 12 out of 27 disease categories have seen at least one approval, with the most successful being against monkeypox (100%), rotavirus (78.7%), and Japanese encephalitis (67.6%). The three infectious diseases with the highest PoS for industry-sponsored non-vaccine therapeutics are smallpox (100%), CMV (31.8%), and onychomycosis (29.8%). Non-industry-sponsored vaccine and non-vaccine development programs have lower overall PoSs: 6.8% and 8.2%, respectively. Viruses involved in recent outbreaks---MERS, SARS, Ebola, Zika---have had a combined total of only 45 non-vaccine development programs initiated over the past two decades, and no approved therapy to date (Note: our data was obtained just before the COVID-19 outbreak and do not contain information about the programs targeting this disease.) These estimates offer guidance both to biopharma investors as well as to policymakers seeking to identify areas most likely to be undeserved by private-sector engagement and in need of public-sector support.

Read the research

A checklist for prosocial messaging campaigns such as COVID-19 prevention appeals

Erez Yeoli, David Rand

Addressing public good problems typical requires people to adopt behaviors that are personally burdensome but beneficial for society. For instance, during the COVID-19 pandemic, people have been asked to stay home except in extenuating circumstances, maintain social distance, and wash their hands frequently. What is the best way to phrase one’s requests to ensure that people are maximally motivated to adhere? We distill three key insights from the behavioral science literature on social norms to create a simple messaging checklist: communicate the benefit to the community; make the ask unambiguous, categorical, and concise; and generate the impression that others expect compliance. We justify this guidance and illustrate it using practical examples, with a focus on COVID-19 prevention behaviors.

Read the research

Which retail outlets generate the most physical interactions?

Avi Goldfarb, Catherine E. Tucker

This paper seeks to answer the simple question of what category of retail outlets generates the most physical interactions in the regular course of life. In this way, we aim to bring a marketing perspective to discussions about which businesses may be most risky from the standpoint of spreading contagious disease. We use detailed data from people's mobile devices prior to the implementation of social distancing measures in the United States. With this data, we examine a number of potential indicators of risk of contagion: The absolute number of visits and visitors, how many of the visits are generated by the same people, the median average distance traveled by the visitor to the retailer, and the number of customers from Canada and Mexico. We find that retailers with a single outlet tend to attract relatively few visitors, fewer one-off visitors, and have fewer international customers. For retailers that have multiple stores the patterns are non-linear. Retailers that have such a large number of stores that they are ubiquitous tend to exhibit fewer visits and visitors and attract customers from a smaller distance. However, retailers that have a large enough footprint to be well known, but not large enough to be ubiquitous tend to attract a large number of visitors who make one-off visits, travel a long distance, and are disproportionately international.

Read the research


Social distancing and school closures: Documenting disparity in internet access among school children

Ananya Sen, Catherine E. Tucker

Social distancing directives across the United States have led to school closures. Some districts are moving towards online instruction, but this requires internet access at home. We examine the factors that determine whether school children have access to the internet at home. We document that poor and non-white children still have lower access to the internet. Moreover, in areas where poor and non-white children have relatively lower test scores, such children are more likely to not have access to the internet. However, there is some evidence of positive spillovers from the historic presence of [information and communication technologies] industries in the local area in improving the access of disadvantaged children to the internet. The empirical insights highlight how the digital divide might exacerbate existing educational inequalities in the face of school closures due to social distancing.

Read the research

COVID-19 and remote work: An early look at U.S. data

Erik Brynjolfsson, John Horton, Adam Ozimek, Daniel Rock, Garima Sharma, Hong Yi Tu Ye

We report the results of a nationally representative sample of the U.S. population on how they are adapting to the COVID-19 pandemic. The survey ran from April 1 - 5, 2020. Of those employed four weeks earlier, 34.1% report they were commuting and are now working from home. In addition, 11.8% report being laid off or furloughed in the last four weeks. There is a strong negative relationship between the fraction in a state still commuting to work and the fraction working from home which suggests that many workers currently commuting could be converted to remote workers. We find that the share of people switching to remote work can be predicted by the incidence of COVID-19 and that younger people were more likely to switch to remote work. Furthermore, using data on state unemployment insurance claims, we find that states with higher fractions of remote workers have higher than expected UI claims.

Download the research

Don’t get it or don’t spread it? Comparing self-interested versus pro-socially framed COVID-19 prevention messaging

Jillian Jordan, Erez Yoeli, David G. Rand

The COVID-19 pandemic threatens millions of lives, and an effective response will require individuals to take costly and difficult measures to slow the rate of transmission. Yet it is unclear how to best motivate preventative actions, which can be conceptualized as either self-interested or cooperative efforts. Should public health messaging focus on the benefits of prevention to individuals, society, or both? We shed light on this question across two studies conducted online via Amazon Mechanical Turk (total n = 2176 Americans) during the early days of the COVID-19 pandemic reaching the United States. We investigated the effects of three treatments, consisting of a written appeal and a flier, on intentions to engage in coronavirus prevention behaviors. We presented identical information across treatments, but varied our framing to emphasize the personal, public, or both personal and public benefits of prevention behaviors. We found evidence for the power of prosocial framing: the public treatment was more effective than the personal treatment, and the personal + public treatment was no more effective than the pure public treatment. Our results thus suggest that emphasizing the public benefits of prevention efforts may be an effective pandemic response strategy.

Read the research

The early effects of coronavirus-related social distancing restrictions on brands

Catherine E. Tucker, Shuyi Yu

This paper presents some of the first evidence on the effect of the spread of coronavirus (COVID-19) in the U.S. on retail footfall traffic. The paper uses granular visit data from cell-phone tracking to estimate the shift in visits to different types of restaurants as coronavirus spread in the U.S. across the first three weeks of March 2020. The descriptive empirical work provides three useful insights. First, the precise level of coronavirus spread in the state or the timing of any in-person dining ban in the state has had far smaller effects than the pronounced nationwide overall collapse in demand. Second, there is little evidence of substitution towards restaurants focused on delivery as a result of the bans. Though dine-in restaurants suffered the largest drop in customers as a result of state-imposed restaurant bans, quick-service restaurants experienced a steep decline. Last, the biggest individual effects of these state-specific bans appears to have been [on] top-ranked brands focusing on full-service dining. Both top and non-top-ranked brands suffered drops for restaurants not focused on dining in, with top brands suffering a slightly smaller decline.

Read the research

Social distancing, internet access, and inequality

Lesley Chiou, Catherine E. Tucker

This paper measures the role of the diffusion of high-speed internet on an individual's ability to self-isolate during a global pandemic. We use data that tracks 20 million mobile devices and their movements across physical locations, and whether the mobile devices leave their homes that day. We show that while income is correlated with differences in the ability to stay at home, the unequal diffusion of high-speed internet in homes across regions drives much of this observed income effect. We examine compliance with state-level directives to avoid leaving your home. Devices in regions with either high-income or high-speed internet are less likely to leave their homes after such a directive. However, the combination of having both high income and high-speed internet appears to be the biggest driver of propensity to stay at home. Our results suggest that the digital divide — or the fact that income and home internet access are correlated — appears to explain much inequality we observe in people's ability to self-isolate.

Read the research

Pandemics depress the economy, public health interventions do not: Evidence from the 1918 flu

Sergio Correia, Stephan Luck, Emil Verner

What are the economic consequences of an influenza pandemic? And given the pandemic, what are the economic costs and benefits of non-pharmaceutical interventions (NPI)? Using geographic variation in mortality during the 1918 flu pandemic in the U.S., we find that more exposed areas experience a sharp and persistent decline in economic activity. The estimates imply that the pandemic reduced manufacturing output by 18%. The downturn is driven by both supply- and demand-side channels. Further, building on findings from the epidemiology literature establishing that NPIs decrease influenza mortality, we use variation in the timing and intensity of NPIs across U.S. cities to study their economic effects. We find that cities that intervened earlier and more aggressively do not perform worse and, if anything, grow faster after the pandemic is over. Our findings thus indicate that NPIs not only lower mortality; they may also mitigate the adverse economic consequences of a pandemic.

Read the research

Financing vaccines for global health security

Jonathan Vu, Benjamin Kaplan, Shomesh Chaudhuri, Monique Mansoura, Andrew Lo

Recent outbreaks of infectious pathogens such as Zika, Ebola, and COVID-19 have underscored the need for the dependable availability of vaccines against emerging infectious diseases (EIDs). The cost and risk of R&D programs and uniquely unpredictable demand for EID vaccines have discouraged vaccine developers, and government and nonprofit agencies have been unable to provide timely or sufficient incentives for their development and sustained supply. We analyze the economic returns of a portfolio of EID vaccine assets, and find that under realistic financing assumptions, the expected returns are significantly negative, implying that the private sector is unlikely to address this need without public-sector intervention. We have sized the financing deficit for this portfolio and propose several potential solutions, including price increases, enhanced public-private partnerships, and subscription models through which individuals would pay annual fees to obtain access to a portfolio of vaccines in the event of an outbreak.

Read the research

The cost of COVID-19: A rough estimate of the 2020 U.S. GDP

Christos Makridis, Jonathan Hartley

This paper provides an early estimate of the economic effects of the national quarantine policy associated with the COVID-19 pandemic in the United States. Using a measure of digital intensity from Gallipoli and Makridis (2018), we exploit counties' exposure to industries that vary in their digital intensity as a proxy for the adverse effects of the decline in physical activity and in-person shutdown. Our baseline estimates suggest that the average county experiences a 5% decline in real GDP growth for two months of economic shutdown, which amounts to $2.14 trillion. Our results also suggest that the effects are highly heterogeneous across countries: for example, the most adversely affected counties may experience as much as a 15% decline in real GDP growth. Counties with higher shares of college graduates, digital workers, and higher median household income are less adversely affected, whereas those with higher shares of non-tradables employment are more affected.

Read the research

Simulation-based estimation of the spread of COVID-19 in Iran

Navid Ghaffarzadegan, Hazhir Rahmandad

Background: The 2019 Coronavirus (COVID-19) has turned into a global pandemic with unprecedented challenges for the global community. Understanding the state of the disease and planning for future trajectories relies heavily on data on the spread and mortality. Yet official data coming from various countries are highly unreliable: Symptoms similar to common cold in majority of cases and limited screening resources and delayed testing procedures may contribute to under-estimation of the burden of disease. Anecdotal and more limited data are available, but few have systematically combined those with official statistics into a coherent view of the epidemic. This study is a modeling-in-real-time of the emerging outbreak for understanding the state of the disease. Our focus is on the case of the spread of disease in Iran, as one of the epicenters of the disease in the first months of 2020. Method: We develop a simple dynamic model of the epidemic to provide a more reliable picture of the state of the disease based on existing data. Building on the generic SEIR (Susceptible, Exposed, Infected, and Recovered) framework we incorporate two behavioral and logistical considerations. First we capture the endogenous changes in contact rate (average contact per person) as more deaths are reported. As a result the reproduction number changes endogenously in the model. Second we differentiate reported and true cases by including simple formulations for how only a fraction of cases might be diagnosed, and how that fraction changes in response to epidemic's progression. In estimating the model we use both the official data as well as the discovered infected travelers and unofficial medical community estimates and triangulate these sources to build a more complete picture. Calibration is completed by forming a likelihood function for observing the actual time series data conditional on model parameters, and conducting a Markov Chain Monte Carlo simulations. The model is used to estimate current "true" cases of infection and death. We analyze the future trajectory of the disease under six conditions related to the seasonal effects and policy measures targeting social distancing. Findings: The model closely replicates the past data but also shows the true number of cases is likely far larger. We estimate about 493,000 current infected cases (90% CI: 271K-810K) as of March 20, 2020. Our estimate for cumulative cases of infection until that date is 916,000 (90% CI: 508K, 1.5M), and for total death is 15,485 (90% CI: 8.4K, 25.8K). These numbers are significantly (more than one order of magnitude) higher than official statistics. The trajectory of the epidemic until the end of June could take various paths depending on the impact of seasonality and policies targeting social distancing. In the most optimistic scenario for seasonal effects, depending on policy measures, 1.6 million Iranians (90% CI: 0.9M-2.6M) are likely to get infected, and death toll will reach about 58,000 cases (90% CI: 32K-97K), while in the more pessimistic scenarios, death toll may exceed 103,000 cases (90% CI: 56K-172K). Implication: Our results suggest that the number of cases and deaths may be over an order of magnitude larger than official statistics in Iran. Absent extended testing capacity other countries may face a significant under-count of existing cases and thus be caught off guard about the actual toll of the epidemic.

Read the research

Welfare costs of catastrophes: lost consumption and lost lives

Ian W. R. Martin, Robert S. Pindyck

Most of the literature on the economics of catastrophes assumes that such events cause a reduction in the stream of consumption, as opposed to widespread fatalities. Here we show how to incorporate death in a model of catastrophe avoidance, and how a catastrophic loss of life can be expressed as a welfare-equivalent drop in consumption. We examine how potential fatalities affect the policy interdependence of catastrophic events and “willingness to pay” (WTP) to avoid them. Using estimates of the “value of a statistical life” (VSL), we find the WTP to avoid major pandemics, and show it is large (10% or more of annual consumption) and partly driven by the risk of macroeconomic contractions. Likewise, the risk of pandemics significantly increases the WTP to reduce consumption risk. Our work links the VSL and consumption disaster literatures.

Download the research


Fighting COVID-19 misinformation on social media: Experimental evidence for a scalable accuracy nudge intervention

Gordon Pennycook, Jonathon, McPhetres, Yunhao Zhang, Jackson Lu, David G. Rand

Misinformation can amplify humanity's greatest challenges. A salient recent example of this is the COVID-19 pandemic, which has bred a multitude of falsehoods even as truth has increasingly become a matter of life and death. Here we investigate why people believe and spread false (and true) news content about COVID-19, and test an intervention intended to increase the truthfulness of the content people share on social media. Across two studies with over 1,600 participants (quota-matched to the American public on age, gender, ethnicity, and geographic region), we find support for the idea that people share false claims about COVID-19 in part because they simply fail to think sufficiently about whether or not content is accurate when deciding what to share. In Study 1, participants were far worse at discerning between true and false content when deciding what they would share on social media relative to when they are asked directly about accuracy. Furthermore, participants who engaged in more analytic thinking and had greater science knowledge were more discerning in their belief and sharing. In Study 2, we found that a simple accuracy reminder at the beginning of the study — i.e., asking people to judge the accuracy of a non-COVID-19-related headline — more than doubled the level of truth discernment in participants’ sharing intentions. In the control, participants were equally like to say they would share false versus true headlines at COVID-19 whereas, in the treatment, sharing of true headlines was significantly higher than false headlines. Our results — which mirror those found previously for political fake news — suggest that nudging people to think about accuracy is a simple way to improve choices about what to share on social media. Accuracy nudges are straightforward for social media platforms to implement on top of the other approaches they are currently employing, and could have an immediate positive impact on stemming the tide of misinformation about the COVID-19 outbreak.

Read the research

Projects and resources led by MIT Sloan faculty

COVID-19 Policy Alliance

The COVID-19 Policy Alliance rapidly generates actionable data intelligence and operational recommendations that can help government entities and organizations make better policy decisions regarding the health care system and economy.

COVID-19 Rapid Innovation Dashboard

The MIT community has been activated to fight the pandemic. From ways to combat feelings of isolation to open-source ventilators — students, faculty, staff, and alumni from academic departments and labs across campus are living the MIT mission to, “…work wisely, creatively, and effectively for the betterment of humankind.” We’ve aggregated the many MIT efforts here for you to explore, learn more, and get activated.


We are a group of researchers from the MIT Operations Research Center, led by Professor Dimitris Bertsimas. We aim to quickly develop and deliver tools for hospitals and policymakers in the U.S. to combat the spread of COVID-19. This work represents a collaborative effort with Hartford HealthCare and ASST Cremona, which have been providing us with data and support through the model creation process.

MIT Laboratory for Financial Engineering COVID-19 and Infectious Diseases Resources

Despite the threat that infectious diseases pose to global health and security – exemplified by the current COVID-19 pandemic – there are few incentives for biotech and pharmaceutical companies to develop vaccines and other treatments for such diseases due to the high costs of R&D and the uncertain future demand. Furthermore, although policymakers and global actors seek to diminish the danger these diseases pose to the wellbeing of nations, regions, and the world, they are often forced to choose between priorities based on limited information – there is large range of potential biological threats that are unpredictable in nature, and limited resources available to address them. Given these challenges, it is clear that new business models and financing structures are urgently needed. Do you have a working paper that should be included here? Faculty members and PhD students can submit papers for inclusion to Zach Church, editorial director, at