Analyzing Time Series Data

Topics

This week’s assignments will guide you through the following topics:

  • Know the essential components of time series analysis
  • Analyze time series data on Wikipedia

Reading

Please watch these short Youtube videos:

Please read these two papers:

  • Jennifer Pan and Margaret E. Roberts. “Censorship’s Effect on Incidental Exposure to Information: Evidence From Wikipedia” [Link]
  • Brian C. Keegan and Chenhao Tan. “A Quantitative Portrait of Wikipedia’s High-Tempo Collaborations during the 2020 Coronavirus Pandemic” [Link]
    • It’s a bit long but demonstrates the many uses of several Wikipedia data

If you have time and interested in real-world applications:

Tasks

Complete the following tasks:

  • Continue to compute/optimize the M-Statistic for all English Wikipedia Articles.
  • Plot time series of edit history of some highly-edited pages.
    • Do you notice some patterns?
    • Do the patterns differ between human vs. bot edits?
  • Think about and describe some potential ways you can quantify the effect of bot policies on the quality of collaboration on Wikipedia using concepts from the readings.

Weekly Questions

Answer the standard participation questions