Work from Home During the COVID-19 Pandemic: An Observational Study Based on A Large Geo-Tagged COVID-19 Twitter Dataset (UsaGeoCov19) [Paper]


As COVID-19 swept over the world, people discussed facts, expressed opinions, and shared sentiments about the pandemic on social media. Since policies such as travel restriction and lockdown in reaction to COVID-19 were made at different levels of the society (e.g., schools and employers) and the government, we build a large geo-tagged Twitter dataset titled UsaGeoCov19 and perform an exploratory analysis by geographic location. Specifically, we collect 650,563 unique geo-tagged tweets across the United States covering the date range from January 25 to May 10, 2020. Tweet locations enable us to conduct region-specific studies such as tweeting volumes and sentiment, sometimes in response to local regulations and reported COVID-19 cases. During this period, many people started working from home. The gap between workdays and weekends in hourly tweet volumes inspire us to propose algorithms to estimate work engagement during the COVID-19 crisis. This paper also summarizes themes and topics of tweets in our dataset using both social media exclusive tools (i.e., #hashtags, @mentions) and the latent Dirichlet allocation model. We welcome requests for data sharing and conversations for more insights.

Dataset Links

State-wise Tweets (50 States and D.C.)

All Tweets in the United States

Known Data Gaps

Python Script to Retrieve Raw Tweet Data Using Tweet IDs

How to Cite This Dataset

      title={Work from home during the COVID-19 pandemic: An observational study based on a large geo-tagged COVID-19 Twitter dataset (UsaGeoCov19)},
      author={Feng, Yunhe and Zhou, Wenjun},
      journal={Information processing \& management},