How collective bargaining for information, public AI, and HCI research all fit together

Published

October 11, 2025

Source: Data Leverage Substack
Date Published: October 11, 2025

Photo by Kelly Sikkema on Unsplash

This is a recap post (a big round-up of links to content I’ve written recently). It will likely be updated once or twice, with a change log up top.

Change log:

I’ve written quite a few newsletters in the past months. One of my reasons for working on all these newsletters is to write, and thus think, in public. (I’ve also been trying to populate more content on several sites that provide “externalized notes”, e.g. on data licenses and data napkin math). To contextualize these numerous posts, I’m going to summarize the various positions I’ve taken. I’ll also try to pull out a few resolvable predictions from my “positions”. Also: nothing in this post is meant to reflect the opinions of my co-authors, i.e., opinions expressed here are my own, do not reflect my employer or colleagues, etc.

At a high level, (I think) my “core positions” consist of two distinct ideas:

Data leverage: Data flow can, and should, be used as a governance lever. [2020 FAccT paper: ACM DL | arxiv] [2022 Dissertation]

Public AI (pAI): we should build AI systems that are publicly accessible and accountable [public AI network website] [publicai.co inference utility product]

While not positions per se, in my writing and research I also promote a more general “we should bring empirical human-computer interaction and computational social science to AI” attitude. This involves writing about interfaces for data-dependent technologies, evaluating new AI models (e.g., auditing and analyzing LLM behavior in high-stakes contexts), studying online platforms (e.g., continuing to study knowledge gaps in Wikipedia, studying governance and responsible AI practices on HuggingFace), and thinking about “AI literacy”.

To recap chronologically, here is a list of blogs, summarized in one or two sentences, starting from November 2023:

During this time, various research projects I’ve been involved with also intersect with these various positions (some are mentioned above):

Some topics that I’ve micro-blogged about (I sometimes microblog directly to a “blogs” GitHub repo), and hope to write some longer thoughts on: