README

This repository contains the code and data for Zhao et al. "Mapping the Spiral of Silence: Surveying Unspoken Opinions in Online Communities" to appear at CHI 2026.

Repository Structure

spiral-of-silence/
├── generate_viewpoints.py    # Script to generate topics and viewpoints using LLMs
├── utils.py                  # Utility functions for Reddit API and OpenAI API
├── paper_analyses.ipynb      # Python notebook for data analysis and visualizations
├── Regression.Rmd            # R markdown file for statistical modeling
├── data.csv                  # Survey data (participant responses)
├── prompts/                  # LLM prompts for topic/viewpoint generation
│   ├── topic_1.json          # Prompt for generating controversial topics
│   ├── viewpoint_1.json      # Prompt for generating viewpoints on topics
│   └── summarize_1.json     # Prompt for shortening viewpoints
└── README.md                 # This file

Setup

Prerequisites

= Python 3.10
R (for statistical analyses)
Reddit API credentials
OpenAI API key

Installation

Clone this repository:

git clone <repository-url>
cd spiral-of-silence

Install Python dependencies:

pip install praw openai python-dotenv pandas tqdm numpy matplotlib scipy jupyter

Create a .env file in the root directory with your API credentials:

REDDIT_CLIENT=your_reddit_client_id
REDDIT_SECRET=your_reddit_secret
OPENAI_API=your_openai_api_key

Usage

Generating Topics and Viewpoints

The generate_viewpoints.py script generates controversial topics and corresponding viewpoints for specified Reddit subreddits:

from generate_viewpoints import main

# Generate topics and viewpoints for subreddits
subreddits = ['politics', 'technology', 'science']
filepath = 'output_{}.json'  # Output file pattern
main(subreddits, filepath, model='gpt-4o')

The script performs three steps:

Topic Generation: Generates controversial topics that would lead to disagreement within each subreddit
Viewpoint Generation: Creates viewpoints representing different sides of each topic
Viewpoint Shortening: Condenses viewpoints to shorter statements

Data

The data.csv file contains survey responses with the following key variables:

Participant characteristics: Age, Gender, Race, Political, Karma, Usage, Posting, WTSC
Community characteristics: Subreddit, Diversity, Inclusion, Safety, percent (content removal rate), modratio
Topic/Viewpoint: Topic, Viewpoint, Agreement, Sub_Agree, Incongruency
Outcomes: Comment_Likelihood, Upvote_Likelihood

Citation

@inproceedings{zhao2026spiral,
   author = {Dora Zhao and Diyi Yang and Michael S. Bernstein},
   title = {Mapping the Spiral of Silence: Surveying Unspoken Opinions in Online Communities},
   booktitle = {Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems},
   year = {2026}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

README

Repository Structure

Setup

Prerequisites

Installation

Usage

Generating Topics and Viewpoints

Data

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
prompts		prompts
LICENSE		LICENSE
README.md		README.md
Regression.Rmd		Regression.Rmd
data.csv		data.csv
generate_viewpoints.py		generate_viewpoints.py
paper_analyses.ipynb		paper_analyses.ipynb
utils.py		utils.py

Folders and files

Latest commit

History

Repository files navigation

README

Repository Structure

Setup

Prerequisites

Installation

Usage

Generating Topics and Viewpoints

Data

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages