Video Summary

Andrej Karpathy: From Vibe Coding to Agentic Engineering

Sequoia Capital

Main takeaways

vibe coding: trusting LLMs to produce working code has changed programming workflows since late 2025.

software 3.0 reframes programming as prompting—your context window is the lever over an LLM interpreter.

agentic engineering is emerging as a discipline to coordinate powerful but stochastic agents safely and productively.

LLMs show 'jagged' capabilities: great on some tasks, unreliable on others, so human oversight remains essential.

automation advances fastest in verifiable domains where outputs can be checked or RL environments built for fine-tuning.

Key moments

Questions answered

What does Karpathy mean by 'vibe coding'?

Vibe coding describes a flow state where developers trust modern LLMs to generate functioning code with minimal correction, shifting the role of the programmer toward directing and composing prompts.

How does Karpathy define Software 3.0?

Software 3.0 is a paradigm where prompting an LLM is the primary programming act—the context window becomes the lever over an interpreter-like model that performs computations and actions.

What is agentic engineering?

Agentic engineering is an emerging discipline focused on designing, coordinating, and accelerating stochastic agents (LLM-based agents) while maintaining quality, safety, and human oversight.

Why does Karpathy call LLMs 'ghosts' rather than animals?

He argues LLMs are jagged, statistical, summoned entities—not biological minds—so they require new kinds of taste, judgment, and direction rather than anthropomorphic expectations.

What practical areas should founders prioritize now?

Founders should target verifiable tasks where outputs can be measured or RL environments can be created for fine-tuning, and build agent-native infrastructure that enables agents to act with context and permissions.

Feeling Behind as a Programmer 00:50

"I've never felt more behind as a programmer."

Andrej Karpathy expresses a unique mix of exhilaration and unease regarding his programming skills in the current landscape. He notes that while he has been utilizing agentic tools like low-code platforms, the evolution of AI has drastically changed expectations.
In December, Karpathy discovered that the latest AI models were producing functioning code with little need for correction on his part. This realization led him to trust the AI systems more and engage in "vibe coding," where he felt a sense of flow with the technology.
He emphasizes that this shift in AI capabilities requires a reevaluation of programming practices, suggesting that many people experienced AI in a limited way last year and didn't fully grasp the profound changes happening from December onwards.

The Evolution of Software Paradigms 02:28

"Software 3.0 is about prompting, and what's in the context window is your lever over the interpreter."

Karpathy outlines a new paradigm in software development, transitioning from Software 1.0, characterized by explicit rules, to Software 2.0, which relied on machine learning and dataset training, and now to Software 3.0, which centers around prompting language models and leveraging their capabilities intelligently.
He illustrates this change with the example of installing software, moving from complex shell scripts in Software 1.0 to simply copying and pasting commands into an AI agent in Software 3.0. This evolution allows for more flexible and powerful interactions with software by reducing the need for detailed technical knowledge.

New Opportunities with AI and Information Processing 08:00

"This is not just about programming becoming faster; this is a more general information processing that is automatable now."

Karpathy emphasizes that modern AI tools enable the automation of tasks that were previously impossible due to the need for structured data and coding, allowing new types of applications to flourish.
He discusses projects like knowledge base creation with LLMs, which automatically organize and present information without requiring traditional programming methods. This new approach opens doors to innovative solutions and business opportunities that were not feasible before.
He encourages viewers to not just focus on enhancing existing technologies but to recognize and leverage the radical shifts that AI introduces in how we process and interact with information.

Anticipating Future Developments in Technology 08:08

"A lot of this code shouldn't exist, and it's just the neural network doing most of the work."

Karpathy speculates on the future of technology, suggesting that as AI advances, many traditional coding practices will become obsolete. He predicts a future where creating applications and services will increasingly rely on the capabilities of AI to perform the bulk of the work.
He invites speculation on what will become obvious and ubiquitous in the tech landscape of 2026, hinting that a profound transformation is on the horizon and that the current state of development merely scratches the surface of possibilities yet to unfold.

The Evolution of Computing Towards Neural Networks 08:19

"In the early days of computing, it was unclear if computers would resemble calculators or neural networks. We ultimately advanced down the calculator path and built classical computing."

The transition from classical computing, which resembles calculators, to the potential emergence of neural networks as the dominant form of computing is a crucial point of discussion.
Today, neural networks operate on top of existing computers, but there's speculation about a future where neural networks will function as the primary computational host, with traditional CPUs playing a supporting role.
The rapid advancements in neural network capabilities indicate they may soon handle the majority of computing tasks, fundamentally changing how we perceive and utilize intelligence in technology.

The Role of Verifiability in AI Automation 09:41

"AI will automate faster and more efficiently in domains where the output is verifiable."

Verifiability is critical in determining the speed at which various domains can be automated by AI technologies. This concept suggests that AI is more effective in areas where the quality of output can be confidently assessed.
Traditional computers excel at automating specified tasks, while the latest advancements in large language models (LLMs) allow for more effective automation of verifiable outputs due to their training processes which emphasize reinforcement learning.
The uneven progress in LLM capabilities, particularly in tasks requiring strict verification like mathematics and coding, suggests that while some areas are advancing rapidly, others may lag behind due to differing focuses in training data and emphasis from AI laboratories.

The Mystery of Jagged Capabilities in AI Models 10:03

"These models often seem jagged. For example, they can rewrite a large codebase but may give nonsensical advice for simple tasks like walking to a nearby car wash."

The erratic performance of AI models raises questions about their reliability. They may excel in complex tasks yet fail at seemingly simple decisions, indicating a need for users to be cautious and engaged when utilizing these AIs.
This peculiarity, termed jaggedness, suggests a disconnect between the training environments provided and real-world applications, requiring users to actively monitor AI outputs and adapt them as necessary.
The improvements seen in areas like chess from models like GPT-4 can often be traced back to the inclusion of targeted training data, highlighting the importance of the datasets selected by the developers.

Guidance for Founders in the AI Space 13:37

"If you are in a verifiable setting where you could create RL environments or examples, that sets you up to potentially do your own fine-tuning."

Founders looking to innovate in the AI domain should focus on verifiable tasks, as there remains substantial opportunity for automation and optimization even in established fields.
The key for entrepreneurs will be to leverage reinforcement learning and drive advancements in areas not fully exploited by existing labs, potentially through fine-tuning and the integration of diverse datasets.
Ultimately, while major AI labs might dominate certain fields, emergent opportunities exist where founders can develop AI solutions that are both innovative and practical, provided they tap into the correct environments for training and application.

Agentic Engineering Defined 16:26

"Agentic engineering, when I call it that, because I do think it's kind of like an engineering discipline."

Andrej Karpathy introduces the concept of agentic engineering, which he defines as a discipline focusing on the coordination of powerful yet stochastic agents.
He emphasizes the challenge of accelerating the performance of these agents without compromising quality, highlighting the significance of effective management in this context.
Karpathy draws a distinction between agentic engineering and traditional engineering, suggesting that while traditional methods may raise the baseline quality, agentic engineering promises a much higher potential ceiling.

The 10x Engineer Concept 17:01

"I think that people who are very good at this peak a lot more than 10x."

He reflects on the concept of the "10x engineer," suggesting that the efficiency gains realized through agentic engineering far exceed this traditional metric.
Karpathy argues that individuals skilled in harnessing these agents can achieve multipliers greater than ten times in terms of productivity and effectiveness, pointing to the exponential promise of this engineering discipline.

The Evolving Role of a Software Engineer 17:55

"Investing into your own setup... utilizing a lot of the tools that are available to you."

Karpathy discusses how modern coders, particularly those who are adept at using AI, need to leverage the full capabilities of their tools through investment in their setups.
His comparison of traditional tools (like Vim and VS Code) with contemporary AI coding assistants illustrates the shift in required skills and approach in software engineering.
He asserts that strong agentic engineers will have to redefine their hiring processes, moving from traditional problem-solving puzzles to practical assessments of larger, collaborative projects.

The Importance of Human Oversight 19:39

"People have to be in charge of this aspect, this plan."

Karpathy emphasizes the ongoing need for human oversight in the development and deployment of AI agents. He points out the failures of AI, using anecdotes about errors in user identification.
This aspect highlights that while agents can perform many tasks, they still require guidance in understanding context and making judgments that reflect human values and functionalities.
He discusses the balance between allowing agents to handle intricate details while maintaining critical human supervision for aesthetic and design choices in projects.

The Future of Aesthetics in AI Development 22:25

"I do hope that this can improve in future models."

He expresses optimism that future AI models can enhance their performance in aspects of taste and judgment, which he feels are currently lacking.
Karpathy mentions his experiments with simplifying AI training, noting that achieving aesthetic quality remains a challenge for AI systems.
Despite acknowledging that AI agents don't inherently improve with traditional motivational tactics, he believes there's potential for growth and improvement in their aesthetic and judgment capabilities as the technology evolves.

Jagged Forms of Intelligence 23:31

"We're not building animals; we are summoning ghosts."

Karpathy introduces the concept of "jagged forms of intelligence," arguing against the notion that AI will mimic human-like understanding or emotions.
He suggests that acknowledging the unique, non-animal form of intelligence present in AI is crucial for developing effective engagement strategies with these systems.
This framing alters the approach to evaluating AI's performance, emphasizing that whether an AI behaves rationally is less important than its efficiency and effectiveness in fulfilling tasks.

The Shift from Human-Centric to Agent-Centric Systems 24:47

“Everything has to be rewritten. Everything is still fundamentally written for humans and has to be moved around.”

The conversation highlights the transition toward agent-native environments, where agents operate with real permissions and contextual awareness, taking action on behalf of users.
Current frameworks and libraries often cater to human users, leading to frustration in interactions as users seek seamless automation for their agents.
There's a strong desire for more intuitive system designs that allow agents to interact directly with data structures, reducing the need for manual intervention in tasks.

Automation and Agent-Focused Infrastructure 26:10

“How do we make it agent native? Basically describe it to agents first.”

The need for rethinking workloads involves decomposing tasks into manageable units that are specifically designed for agents.
Developers hope for a future where infrastructure enables agents to autonomously interact with various services, minimizing the annoying process of manual configuration.
The goal is for systems that allow prompts to instigate action without needing complex human setups, such as DNS configuration, making systems more efficient and user-friendly.

The Future of Interaction Between Agents 27:21

“I do think we're going towards a world where there's agent representation for people and organizations.”

The vision entails a world where agents can communicate and negotiate with each other, streamlining processes like scheduling and meeting coordination.
This evolution reflects a growing excitement about the possibilities of agent interaction, hinting at a future where personal agents take on more responsibility in managing tasks for their human counterparts.

The Importance of Understanding in the Age of AI 27:45

“You can outsource your thinking, but you can't outsource your understanding.”

As AI becomes more advanced, the speaker underscores the importance of maintaining a robust understanding of the systems and processes being managed.
Current tools, including language models, aid in understanding and processing information, but users remain the ultimate decision-makers who must grasp the implications of their tasks.
There is a perceived bottleneck in effectively directing AI agents if the user does not possess a deep understanding of the subject matter, emphasizing that tools should enhance rather than replace cognitive engagement.

Tools for Enhanced Understanding 29:31

“These are tools to enhance understanding in a certain way.”

The discussion points to the potential of AI and LLMs as tools to generate insights and facilitate deeper comprehension rather than merely automate tasks.
Users still need to actively engage with knowledge and information rather than passively rely on AI outputs, as the responsibility for understanding remains with them.
The journey toward fully automated systems is still underway, with a hopeful outlook for future advancements in AI’s capability to handle understanding tasks.

Browse ai summaries

Jump to the ai topic page and keep exploring related summaries.