We read everything.
So our clients don't have to.

Staying ahead of the curve is part of the job. Every week we read frontier research from the AI labs, primary papers from academia, and the major firms' annual outlooks — and translate them into something an operator can actually use.

This page collects the work that has shaped how we think — annual reports we re-read every year, frontier research notes from the labs, and the historical case studies we still cite in client engagements.

Operator-grade decisions need primary research, not a recap.

Our edge is that we read the source material — directly, regularly, and with the question every operator actually has in mind: what does this change about how the work gets done?

Frontier

AI labs, end to end

OpenAI, Anthropic, and the rest — read across their product announcements, interpretability work, and formal research. The leading indicator for what's about to be deployable — and what's about to be unsafe — in client workflows.

Annual

Bain, McKinsey, Deloitte — every year, end to end

Read with an operator's eye, not a strategist's. We look past the trend lists for the operating-model gap that determines who actually captures the gain.

Historical

Case studies we keep returning to

Some operating-model lessons travel decades. We catalog the historical scale events — Tesla 2018, others — that still inform how we design for clients today.

Notes on the primary research shaping the next eighteen months.

Working notes on papers from OpenAI, Anthropic, and Stanford's CGRI — read for what they imply about deploying agents and rebuilding workflows in real organizations.

2026
Frontier
Frontier Research — Anthropic

Cross-Architecture Model Diffing with Crosscoders

Anthropic on a method for surfacing — without telling it where to look — what's actually different between two LLMs. The operating-model implication for every team about to swap a model in production.

7 min read1 source analyzedAnthropic
Read
2026
Frontier
Frontier Research — Anthropic / Transformer Circuits

Emotions in Models — Interpretability Research

Anthropic's interpretability team on emotional patterns inside frontier models — and why that matters when one is sitting inside a client workflow.

7 min read1 source analyzedAnthropic / Transformer Circuits
Read
2025
Frontier
Frontier Research — OpenAI

Monitoring Internal Coding Agents for Misalignment

OpenAI's own playbook for catching their internal coding agents going off-task — and what it tells every operator about deploying agents inside a real workflow.

7 min read1 source analyzedOpenAI
Read
2025
Frontier
Frontier Research — OpenAI

GPT as a Measurement Tool

Treating an LLM as a measurement instrument — and what it changes about how you instrument a workflow.

8 min read1 source analyzedOpenAI
Read
2025
Frontier
Frontier Research — OpenAI

Graviton

Reading OpenAI's Graviton paper for what it implies about the cost curve of running agents in production — not just training models.

6 min read1 source analyzedOpenAI
Read
2025
Frontier
Frontier Research — Anthropic

Long-Running Claude

Anthropic on what changes when an agent runs for hours instead of seconds — and why most workflows haven't caught up to it.

7 min read1 source analyzedAnthropic
Read
2025
Frontier
Frontier Research — Anthropic

Anthropic's 81,000 Interviews

What it means when a frontier AI lab grounds its product decisions in 81,000 user interviews — and what it tells operators about disciplined research practice.

6 min read1 source analyzedAnthropic
Read
2024
Frontier
Frontier Research — Stanford GSB

Trust and Its Consequences

Stanford's Corporate Governance Research Initiative on what trust does — and what its absence costs — inside an organization.

6 min read1 source analyzedStanford GSB — Corporate Governance Research Initiative
Read

One reading per year, since the year we started.

We read the major firms' tech outlooks the same way an operator does — looking past the trend lists for the operating-model gap that determines who actually captures the gain. One synthesis per year, 2021 through 2025.

2025
Annual
Annual Research — 2025

Agents Arrived. Most Operating Models Aren't Ready.

McKinsey, Bain, and Deloitte's 2025 outlooks tell three versions of the same story: the unit of automation moved from a task to a workflow to an agent — and most companies are about to find out their workflows weren't built for software that acts.

13 min read3 sources analyzedMcKinsey & Company · Bain & Company · Deloitte
Read
2024
Annual
Annual Research — 2024

From Pilot Theater to P&L Impact

McKinsey and Deloitte's 2024 outlooks both ratified what operators had already learned the hard way: the AI dividend was real, but it lived inside the operating model — not the model.

11 min read2 sources analyzedMcKinsey & Company · Deloitte
Read
2023
Annual
Annual Research — 2023

Generative AI Was the Headline. Operating Discipline Was the Story.

Three of the year's defining reports converged on the same uncomfortable truth: most companies didn't have the operational shape to absorb the technology they were buying.

12 min read3 sources analyzedMcKinsey & Company · Bain & Company · Deloitte
Read
2022
Annual
Annual Research — 2022

The Year the Music Slowed

Two reports, one operator-grade conclusion: the firms that survive a downturn aren't the ones that cut hardest — they're the ones whose systems were already lean.

11 min read2 sources analyzedBain & Company · McKinsey & Company
Read
2021
Annual
Annual Research — 2021

The Pandemic Pull-Forward

Tech got two years of demand in twelve months — and most operating models didn't catch up.

9 min read1 source analyzedBain & Company
Read

The historical scale events we still cite in client work.

Some operating-model lessons travel decades. These are the ones we return to most often when a leadership team is about to take their organization through the next jump in scale.

2024
Case Study
Case Study

Anthropic, 2024: When the Research Lab Became a Product Company

How Anthropic restructured to ship enterprise product at scale without losing the research identity that differentiated it — and the operating-model lessons for any company trying to be two things at once.

13 min read1 source analyzedAnthropic
Read
2024
Case Study
Case Study

Replit, 2024: The Stack They'd Built for Eight Years Was the Pivot

How Replit recognized that the unglamorous integrated infrastructure they'd been compounding since 2016 — Repls, Nix environments, hosting, deploys, integrated database, object storage, auth — was the asset that made AI-agent app generation actually shippable, and how they restructured the company around that recognition without abandoning the existing 30M+ user base.

16 min read4 sources analyzedReplit · Replit · Replit / Amjad Masad · Replit / Amjad Masad
Read
2022
Case Study
Case Study

Netflix, 2022: When the Subscriber Story Stopped Telling Itself

An operator's reading of Netflix's 2022 — the first subscriber loss in over a decade, the layoffs, the ad-tier reversal, the password-sharing crackdown, and the operating-model rebuild that produced the largest sub adds in years within eighteen months.

13 min read1 source analyzedNetflix, Inc.
Read
2020
Case Study
Case Study

Airbnb, 2020: The Eight Weeks That Rebuilt the Company

An operator's reading of Airbnb's 2020 — when ~80% of revenue evaporated in eight weeks, and the leaner, more focused company that emerged IPO'd at a higher valuation than the pre-pandemic one.

15 min read1 source analyzedAirbnb, Inc.
Read
2018
Case Study
Case Study

Tesla, 2018: When the System Almost Killed the Company That Built It

An operator's reading of Tesla's 2018 — the production crisis, the financial near-miss, and the operating-model lessons that apply to every team scaling faster than its systems.

16 min read1 source analyzedTesla, Inc.
Read

If any of this matches what you're carrying — let's talk.

Most of these reports describe a gap. We close gaps. The first conversation is short, specific, and free.