Lightnews — Scholar-powered news

Thomas Steinke

@stein.ke

4.1K followers 680 following 350 posts

Computer science, math, machine learning, (differential) privacy Researcher at Google DeepMind Kiwi🇳🇿 in California🇺🇸 http://stein.ke/

stein.ke

Posts Media Videos Starter Packs

Pinned

Thomas Steinke @stein.ke · Nov 14

I'm going to slowly repost my math notes from the other site🐦 here🦋; it's the only thing I posted over there that I think may have some long-term value & worth not deleting.

These started out as notes for myself, but people seem to appreciate them. 😅

I'll keep track of all of them in this thread.

6 32 200

Reposted by Thomas Steinke

Aaron Roth @aaroth.bsky.social · 2d

The FORC 2026 call for papers is out! responsiblecomputing.org/forc-2026-ca... Two reviewing cycles with two deadlines: Nov 11 and Feb 17. If you haven't been, FORC is a great venue for theoretical work in "responsible AI" --- fairness, privacy, social choice, CS&Law, explainability, etc.

FORC 2026: Call for Papers

The 7th annual Symposium on Foundations of Responsible Computing (FORC) will be held on June 3-5, 2026 at Harvard University. Brief summary for those who are familiar with past editions (prior to 2…

responsiblecomputing.org

1 8 13

Thomas Steinke @stein.ke · 11h

IMHO, the best analog to the AI bubble is the dotcom bubble. Yes, the internet proved to be economically transformative, but there was still a bubble. Companies made a lot of money in the end, but it wasn't necessarily the ones that people expected -- e.g., see CISCO:

Plot of CISCO's stock price from 1990-2025 showing huge growth around 2000 before dropping and slowly growing again.

Thomas Steinke @stein.ke · 4d

The final piece of the puzzle is how do you choose the subsamples? Here's where having a relative who knows combinatorics was helpful. 😁 Basically, the subsets should form a covering design. And the minimal size of a covering design parameterizes the tradeoff.

Thomas Steinke @stein.ke · 4d

OK, so can we get the best of both worlds? or at least trade off between the cost of privacy in terms of accuracy/data and the number of subsamples we need to evaluate on?

That's what we address in our new paper. The answer is yes, but the tradeoff is quite steep (and we have a lower bound).

Differential Privacy @differentialprivacy.org · 6d

Privately Estimating Black-Box Statistics

Günter F. Steinke, Thomas Steinke

http://arxiv.org/abs/2510.00322

1 1

Thomas Steinke @stein.ke · 4d

In this paper we showed that we can do this kind of versatile black-box estimation with only an *additive* cost of privacy, where sample-and-aggregate suffers a *multiplicative* cost. But this uses exponentially many subsamples - i.e., all subsets of size n-t instead of a partition into t parts.

Differential Privacy @differentialprivacy.org · Mar 26

Privately Evaluating Untrusted Black-Box Functions
Ephraim Linder, Sofya Raskhodnikova, Adam Smith, Thomas Steinke
http://arxiv.org/abs/2503.19268

$Privately Evaluating Untrusted Black-Box Functions Ephraim Linder, Sofya Raskhodnikova, Adam Smith, Thomas Steinke http://arxiv.org/abs/2503.19268 We provide tools for sharing sensitive data when the data curator doesn't know in advance what questions an (untrusted) analyst might ask about the data. The analyst can specify a program that they want the curator to run on the dataset. We model the program as a black-box function $f$. We study differentially private algorithms, called privacy wrappers, that, given black-box access to a real-valued function $f$ and a sensitive dataset $x$, output an accurate approximation to $f(x)$. The dataset $x$ is modeled as a finite subset of a possibly infinite set $U$, in which each entry represents data of one individual. A privacy wrapper calls $f$ on the dataset $x$ and on some subsets of $x$ and returns either an approximation to $f(x)$ or a nonresponse symbol $\perp$. The wrapper may also use additional information (that is, parameters) provided by the analyst, but differential privacy is required for all values of these parameters. Correct setting of these parameters will ensure better accuracy of the wrapper. The bottleneck in the running time of our wrappers is the number of calls to $f$, which we refer to as queries. Our goal is to design wrappers with high accuracy and low query complexity. We introduce a novel setting, the automated sensitivity detection setting, where the analyst supplies the black-box function $f$ and the intended (finite) range of $f$. In the previously considered setting, the claimed sensitivity bound setting, the analyst supplies additional parameters that describe the sensitivity of $f$. We design privacy wrappers for both settings and show that our wrappers are nearly optimal in terms of accuracy, locality (i.e., the depth of the local neighborhood of the dataset $x$ they explore), and query complexity. In the claimed sensitivity bound setting, we provide the first accuracy guarantees that have n$

1 1

Thomas Steinke @stein.ke · 4d

The missing ingredient is a better aggregation method.
Enter the shifted inverse mechanism cse.hkust.edu.hk/~yike/Shifte...

Beyond Local Sensitivity via Down Sensitivity

In our previous post, we discussed local sensitivity and how we can get accuracy guarantees that scale with local sensitivity, which can be much better than the global sensitivity guarantees attained ...

differentialprivacy.org

1 2

Thomas Steinke @stein.ke · 4d

So the obvious question is *can sample-and-aggregate can be made more data-efficient?* 🤔
E.g., instead of partitioning the dataset, can we use overlapping subsamples?
Unfortunately, using standard aggregation methods, this doesn't work (because overlapping subsamples means higher sensitivity).

1 1

Thomas Steinke @stein.ke · 4d

...it's inefficient in terms of data. You only get accuracy from evaluating on a subsample & each subsample gets treated like one datapoint for the privacy analysis. In terms of sample complexity, the cost of privacy is multiplicative, when it _should_ be additive.

But it can be practical e.g.:

Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data

Some machine learning applications involve training data that is sensitive, such as the medical histories of patients in a clinical trial. A model may inadvertently and implicitly store some of its tr...

arxiv.org

1 2

Thomas Steinke @stein.ke · 4d

First, what is sample-and-aggregate?

It's a generic method for differentially private estimation. Partition your data into 1/ε subsamples; evaluate your function on each subsample; then combine the values in a ε-differentially private manner.

It's flexible - it works for *any* function - but...

cs-people.bu.edu

1 1

Thomas Steinke @stein.ke · 4d

I'm excited to share this paper.

It answers a question that has bugged me for a long time: Can sample-and-aggregate be made more data-efficient? The answer is yes, but at a steep price in computational efficiency. See 🧵 for more details.

Also, it was a fun opportunity to add a new coauthor. 😁

Differential Privacy @differentialprivacy.org · 6d

Privately Estimating Black-Box Statistics

Günter F. Steinke, Thomas Steinke

http://arxiv.org/abs/2510.00322

1 2 10

Thomas Steinke @stein.ke · 15d

The fourth deadliest war in history killed 20-30 million people (more than WW1) and was a rebellion led by a man claiming to be the brother of Jesus Christ, and you've never even heard of it. 🤯

Thomas Steinke @stein.ke · 20d

Apps need to ask for permission to access the camera/microphone. The same should apply to the speaker. Some apps should never make a sound. 🔇

2 22

Thomas Steinke @stein.ke · 27d

Yeah, why not?
I was taught "non-decreasing" and mostly never questioned it, but really "non-strictly increasing" seems better.

Thomas Steinke @stein.ke · 27d

Sure, of course it can matter, but do we need totally different terms? Wouldn't it make more sense to add a qualifier (strictly increasing vs. non-strictly increasing).

1 1

Thomas Steinke @stein.ke · 27d

Why do we use "non-decreasing" as a synonym for increasing? The negation is confusing. E.g., the sine function is not decreasing. Is the distinction between strictly and non-strictly increasing meaningful?

5 4

Thomas Steinke @stein.ke · Sep 4

On my way to work I saw a billboard that just said "caffeine" – as if that shit needs to advertise.

Thomas Steinke @stein.ke · Aug 29

I just had to enter my birthdate using an interface that required me to click once for every month of my life. This is offensive on multiple levels.

1 1 15

Thomas Steinke @stein.ke · Aug 13

🤦🤦🤦 this is not how two factor identification works 🤦🤦🤦🤦

[Screenshot of an app]
Code Verification
Call customer service at 1-866-4220306 (outside the U.S. call 1-210-677-0065) to retrieve your One-time identification Code .

1 22

Reposted by Thomas Steinke

FOCS 2025 @focs2025.bsky.social · Aug 11

To reduce barriers to attendance, #FOCS2025 will try to facilitate childcare for attendees during the conference.

Information, how to register your interest, and how to apply for financial support: focs.computer.org/2025/childca...

Deadline (for the latter): ⏰ September 19th (AoE)

Childcare Support – FOCS 2025

focs.computer.org

1 9 16

Thomas Steinke @stein.ke · Jul 25

Not _good_ code, evidently.

Thomas Steinke @stein.ke · Jul 25

Today I wrote some code in which print("failed successfully") made perfect sense. 🙃

1 5

Thomas Steinke @stein.ke · Jul 21

It's probably just laziness. Email-based authentication is easy to implement, cheaper than SMS, & more reliable than passwords etc. It offloads security to your email provider.

Thomas Steinke @stein.ke · Jul 21

The year is 2100. The verb "to coldplay" means to inadvertently confess by acting guilty when caught doing something otherwise innocuous. No one remembers the etymology.

2 4 62

Thomas Steinke @stein.ke · Jul 8

Ban in-app browsers.

1 9

Thomas Steinke @stein.ke · Jul 8

Amazon informs me that Prime day is expected to occur between July 8 and July 11. Does anyone know what the confidence level for that interval is? Is it 95%? When will we get more precise measurements?

1 6