top of page

Dario Just Got His General



Audio cover
Dario Just Got His General

Why Andrej Karpathy choosing Anthropic is the most significant talent move in AI history.


If you've been offline this week, here's what you missed.

Andrej Karpathy — co-founder of OpenAI, the man who built the brain behind every Tesla on the road, the person who coined the term "vibe coding," and one of the most trusted voices in all of AI — announced he's joining Anthropic. One sentence on X. Understated. Precise. Classic Karpathy. "I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative."


The AI world promptly lost its mind. And they were right to.


You Need to Understand Who This Person Is

Andrej Karpathy isn't famous the way tech executives are famous — through press releases and conference keynotes and carefully managed personal brands.


He's famous the way scientists are famous. Because the work is undeniable.

In 2015, he was in the founding room at OpenAI — the original group that included Sam Altman, Elon Musk, Ilya Sutskever, and Greg Brockman. He was there when the mission was set. In 2017, Elon Musk personally pulled him out to run Tesla's Autopilot and Full Self-Driving programs. He spent five years building the visual intelligence system that drives — literally — millions of vehicles. When you hear Tesla talk about its neural network approach to autonomy, that architecture has Karpathy's fingerprints all over it. He left Tesla in 2022. Came back to OpenAI in 2023 for about a year. Then left again in 2024 to found Eureka Labs, an AI education startup, because one of his other roles — the one his followers care about as much as anything — is teacher. His YouTube channel and his Neural Networks: Zero to Hero course have probably done more to build the next generation of AI researchers than any university program running right now. He explains the hardest concepts in the field with a clarity that is genuinely rare. Millions of people understand transformers and backpropagation because of him. He coined "vibe coding" — the practice of describing what you want to build in natural language and letting AI generate the implementation. Not a joke. Not a meme. A description of how serious builders are now actually working.


He had been independent for over a year. He could have gone anywhere.

He chose Anthropic.


What He's Actually Going to Do There

This is the part that separates signal from hype. Karpathy is joining Anthropic's pre-training team, working under Nick Joseph. His specific mandate: build and lead a team focused on using Claude to accelerate pre-training research. Let that land.


He's not going to Anthropic to manage. He's going as an individual contributor who will build a team. And that team's job is to use Claude — the model — to make Claude better.


Pre-training is where the foundational intelligence of a model is baked in. It's the most expensive, compute-intensive, technically demanding phase of building a frontier model. It's where the real capability ceiling gets set. Everything else — fine-tuning, alignment, safety work — operates within the space that pre-training defines.


Karpathy is one of a very small number of researchers on the planet who can bridge the gap between LLM theory and the realities of large-scale training at frontier scale. He has done this before. At Tesla, he did it for perception. At OpenAI, he did it for language models in their earliest form.

Now he's going to try to do it recursively — using the model to improve the model. That's not incremental. That's structural.


Why This Matters More Than a Headline

There's a version of this story that reads as another big tech talent announcement. Researcher moves from one lab to another. Labs compete for talent. Film at eleven. That's not what this is.


The AI field is in a specific phase right now where pre-training improvements are becoming harder to achieve through pure compute scaling. The easy gains from throwing more GPUs at bigger datasets are narrowing. The next frontier is research efficiency: finding better ways to train, better architectures, better approaches to what data means and how models learn from it. Karpathy is exactly the kind of researcher who operates at that frontier. Not a manager who supervises people who do the work. Someone who does the work and understands it deeply enough to see what others miss. Anthropic has consistently positioned itself as the lab most serious about research rigor. Constitutional AI, interpretability work, the mechanistic understanding of how models actually function — these aren't marketing initiatives. They're genuine research programs that the rest of the field reads and responds to. Adding Karpathy to that environment, specifically to lead a team applying Claude to its own pre-training, signals something about where Anthropic thinks the real leverage is in the next phase of this race. They think the answer isn't more compute. It's smarter research. And they just hired the person who might be best positioned to figure out what that actually means in practice.


The Signal in the Choice

Here's what I keep thinking about. Karpathy had been independent for over a year. He'd built his own thing. He had credibility, visibility, and options that most people in any field never see. He could have stayed independent. He could have gone back to OpenAI. He could have started something new.

He looked at every lab operating at the frontier and chose Anthropic.

That's a signal.


When someone who doesn't need the job, doesn't need the visibility, and has seen the inside of multiple frontier organizations chooses one over the others — that tells you something about what they see when they look at the work, the culture, and where the genuinely hard problems are getting attacked. Karpathy's announcement said: "I think the next few years at the frontier of LLMs will be especially formative."


He's going to where he thinks the most consequential work will happen.

The labs are kingdoms in competition right now. Each one is betting on different research directions, different scaling strategies, different theories about what intelligence actually is and how to build more of it.

Dario Amodei just brought in one of the best technical minds in the history of the field. That's not aura. That's a structural advantage.


What Comes Next

The obvious question is: what does this mean for the competitive landscape?


The honest answer is that it takes time. Karpathy joining Anthropic in May doesn't change what Claude can do in June. Research cycles are long. Pre-training runs take months. The payoff from building a team focused on AI-accelerated research won't show up in a benchmark next quarter.

But frontier AI is a long game being played in short cycles. The organizations that attract researchers who think at this level are the ones that define where the ceiling moves.


OpenAI built its early dominance in part because of who was in the room when it started. Karpathy was one of them.


Now he's in a different room. And based on everything he's said publicly about what he believes matters — research rigor, understanding the models deeply rather than just scaling them, building education infrastructure for the next generation — Anthropic is a coherent choice. Not a surprising one, when you understand what he actually cares about.

"The next few years at the frontier of LLMs will be especially formative."


He said it himself. He's now at the lab where he thinks the most important formative work will happen. Pay attention to what comes out of Anthropic's pre-training team in the next 18 months.


Rich Washburn is a technologist and strategist working at the intersection of AI, infrastructure, and capital. He is Managing Partner and Chief AI Officer at Eliakim Capital and CIO of Data Power Supply.

Animated coffee.gif
cup2 trans.fw.png

© 2018 Rich Washburn

bottom of page