This project analyzes global activity patterns using Twitter timestamps as a proxy for daily sleep–wake rhythms. The objective was to examine how geographic location (time zone) and occupation relate to shifts in online activity that may indicate circadian rhythm disruption.
Dataset: jobs_sleepwalk (2020)
Scale: 4.5M+ records
Key fields:
characteristic — occupation / roleutc_timestamp — posting time (UTC)user_hash — anonymized user identifierlocation — user-reported locationThe full data preprocessing, exploratory analysis, and visualization workflow was implemented in Python using Pandas and visualization libraries. The notebook includes timestamp transformations, location filtering, and multiple plots used in the final report.
Twitter activity is a behavioral proxy and does not directly measure sleep duration or sleep quality.