yilin@iu-luddy: ~ home cv
$ whoami

Yilin Gong

PhD Student · Information Science · Luddy School, Indiana University Bloomington
advisor: Prof. Siqi Wu · previously: MS Data Science, University of Chicago
$ cat about.md
I study crowdsourced fact-checking and the governance of online information.
Current focus: a large-scale audit of X's Community Notes · contributor labor, topic coverage, and the effectiveness-fairness paradox.
Tools: Python · R · topic modeling · network analysis · mixed-effects models · LLMs.
Venues: ICWSM · AAAI · IC2S2.
$ ls papers/ --long
[2026]

The Effects of Request Alerts on the Diversity and Visibility of Community Notes

Request Alerts redirect existing contributor labor rather than generating new effort, with a quality-distance trade-off when writers venture outside their topic expertise.
Yilin Gong, Siqi Wu · in submission · [arxiv] [pdf] [code]
$ contact
yg24@iu.edu · [cv] · [scholar] · [github] · [linkedin]
yilin@iu-luddy: ~/cv home cv
$ cat cv.txt

Yilin Gong

PhD Student · Information Science · Luddy School, Indiana University Bloomington

education

2025 – present
PhD, Information Science
Luddy School of Informatics, Computing, and Engineering · Indiana University Bloomington
Advisor: Prof. Siqi Wu. Focus: crowdsourced fact-checking, Community Notes, platform governance.
2022 – 2023
MS, Data Science
University of Chicago · GPA 3.87
Coursework: time-series forecasting, data mining, big-data platforms, marketing analytics, credit and insurance risk analytics, data engineering, data visualization.
2018 – 2021
BS, Information Science
University of Colorado Boulder · GPA 3.9 / 4.0
Minor in Creative Technology Design.

publications

2026
The Effects of Request Alerts on the Diversity and Visibility of Community Notes
Yilin Gong, Siqi Wu · in submission · [arxiv]
Request Alerts redirect existing contributor labor rather than generating new effort, with a quality-distance trade-off when writers venture outside their topic expertise.

research projects

May – Sep 2024
Dissemination and Online Influence of Films in the Advancement of Disability Rights
Northwestern University · with Hongjin Chen
Cross-cultural mixed-methods study across US and China (N = 770+). SPSS reliability testing (Cronbach's α, KMO) and factor analysis; LDA topic modeling on reviews from IMDb, Rotten Tomatoes, and Douban; Granger causality on Google Trends signals (0.8725 Pearson).
Feb – Dec 2023
Non-EDI Account Usage Prediction
University of Chicago · capstone with Power Kiosk
Random Forest + AdaBoost ensemble reaching 95% accuracy (30% gain over baseline). Built a Google Search API + Beautiful Soup pipeline that scraped ~10,000 company profiles for enterprise classification.
Sep – Dec 2022
Twitter Users Education Analysis
University of Chicago · big-data course project
Analyzed ~100M tweets (~500 GB) with PySpark on GCP. MinHash-based similarity to surface duplicate content across government, health, and media accounts.
Sep – Nov 2021
Data-Driven Investment Analysis
University of Colorado Boulder
Financial data on 1,000+ public companies; KMV credit-risk modeling to estimate default probabilities and shortlist potential targets.

experience

Feb 2024 – May 2025
Data Strategy & Operations Lead
Dope Film · Chicago
Built KPI dashboards, ran competitive analysis, and designed data-driven retention strategies for a platform connecting filmmakers and small businesses.
Jul – Sep 2023
Economic Researcher / Data Analyst
People's Bank of China · Wuhan, Hubei
Designed a mixed-methods study of interest-rate transmission; built a seasonally adjusted transmission model in R that informed end-of-year fiscal review. Contributed to a ~15% improvement in the bank's trend forecasts.
May – Jul 2023
Special Education Technology Volunteer
Aite Special Children's Inclusive Education · Wuhan
Participatory research with special-education practitioners. Led a team building an AI tool that converts teacher observations into parent-friendly progress reports.
Mar – Aug 2022
Litigation Support Analyst
Zenith American Solutions · Remote
Managed document-review databases (1 TB+) for 15+ legal teams; designed search strategies across 500K+ documents, cutting research time by ~35%.

awards

2021
Reilly News Ed Scholarship
University of Colorado Boulder
2020
Honor of Outstanding Performance
DIAN Insurance

skills

methods
Mixed-methods research, experimental design, survey design, statistical analysis, time-series analysis, mixed-effects models, Granger causality, factor analysis (Cronbach's α, KMO), topic modeling (BERTopic, LDA), network analysis, sentiment analysis.
programming
Python (Pandas, NumPy, scikit-learn, NLTK, statsmodels, Beautiful Soup), R, SQL, PySpark, Google Cloud Platform, Plotly, Matplotlib, Seaborn, SPSS.
ml / nlp
Random Forest, AdaBoost, cross-validation and hyperparameter tuning, LLM fine-tuning (BERT, TinyLlama), embedding-based retrieval and clustering (UMAP + HDBSCAN).
languages
English (professional), Mandarin Chinese (native).
© 2026 Yilin Gong · hosted on GitHub Pages last updated