1f:["$","$13",null,{"fallback":null,"children":["$","$L14",null,{"reason":"next/dynamic","children":["$","$L22",null,{"title":"What is an AI Engineer with Shawn Wang (a.k.a Swyx) of @LatentSpacePod","initialTldrHtml":"$23","sectionsData":[{"title":"A New Kind of Engineer for a New Kind of Stack","htmlContent":"

AI engineers are not ML researchers. They don’t need PhDs or deep math backgrounds. They’re full-stack developers who specialize in building with foundation models—prompting, integrating APIs, and shipping fast.

They emerged because “a wide range of AI tasks that used to take five years and a research team now just require API docs and a spare afternoon.” The role is product-first, not model-first.

\n"},{"title":"From Zero to One, Not One to N","htmlContent":"

ML engineers optimize existing models at scale. AI engineers build the first version of the product. They operate in the “zero to one” phase—creating co-pilots, agents, or generative tools from scratch.

ML engineers work on “one to n” problems like reducing fraud from 8% to 5%. AI engineers ask: can we build something useful with GPT-4 today?

\n"},{"title":"Prompting Is the New Programming","htmlContent":"

The core skills have shifted:

Prompt engineering to get useful outputs
Fine-tuning for control and performance
Deployment and orchestration of APIs

Traditional ML skills are helpful but not required to start. The deeper you go—especially if you want to build a moat—you’ll need more ML infra.

\n"},{"title":"APIs Replaced Research Teams","htmlContent":"

Foundation models have moved the API line. What used to be internal ML teams is now outsourced to OpenAI or Anthropic. Companies no longer need to train their own models—they just call an API.

This shift created space for a new kind of engineer on the other side of the API line: someone who knows how to wield the stack and ship products.

\n"},{"title":"Prompting as Collaboration","htmlContent":"

AI teams are cross-functional by default. PMs and domain experts now write prompts directly. They’re no longer just writing specs—they’re co-creating artifacts.

This changes the dynamic: “Instead of just playing this definitional, translational role, they’re collaborating very directly with the AI engineers.”

\n"},{"title":"Fire First, Aim Later","htmlContent":"

The old ML process was deliberative. The new one is fast and iterative. The mantra: Fire, Ready, Aim.

\n
“You win by moving really quickly and getting information from the market from shipping products.”
\n

Shipping fast creates a feedback loop—eval data, user behavior—that drives product improvement.

\n"},{"title":"Outsourcing the Hard Stuff","htmlContent":"

Companies increasingly outsource model training and focus on orchestration. The stack is deepening daily, and staying current is a full-time job.

ML complexity is abstracted away. AI engineers focus on wiring things together—retrieval, evals, agents—not building models from scratch.

\n"},{"title":"A Role Without a Ladder (Yet)","htmlContent":"

There’s no formal path for AI engineers yet—no “senior” or “staff” titles—but demand is exploding. Companies want people who live in the stack.

\n
“We’re not training enough ML engineers… so you need to scale them by supplementing with AI engineers.”
\n

The title is still low-status compared to ML researchers, but it’s gaining legitimacy fast.

\n"},{"title":"Vertical Beats Horizontal","htmlContent":"

The most successful AI startups are vertical:

Proprietary data
Clear use cases
High-margin markets
Non-technical customers

Examples: Harvey (legal), Midjourney (creative), Brightwave (hedge funds), Interior AI (real estate). These companies solve real problems with real pain points—and avoid being steamrolled by OpenAI.

\n"},{"title":"Buy First, Build Later","htmlContent":"

Buy tools early to move fast. Build only after you understand your unique needs.

Eval platforms
Monitoring/observability
Key management
Prompt/version tracking

\n
“You’re not special. Just buy it… then backfill once you understand the problem.”
\n

Avoid NIH syndrome. Most problems are shared across teams.

\n"},{"title":"Internal Tools Win First","htmlContent":"

Internal productivity tools see faster adoption than customer-facing ones:

Co-pilot, Cursor, Sourcegraph Cody
Meeting summarizers
Internal agents

Lower risk, less friction. Easier to experiment without worrying about hallucinations or user trust.

\n"},{"title":"The Rise of AI Employees","htmlContent":"

Autonomous agents are early-stage now but will grow into full-fledged “AI employees.” They’ll handle tasks humans used to do—faster, cheaper, 24/7.

\n
“You don’t have to manage humans to do that… these things can work all night for you.”
\n

Still early days, but the direction is clear.

\n"},{"title":"Hallucination as a Feature","htmlContent":"

Most AI use cases today are temperature-zero—deterministic, safe, boring. But there’s a growing space for high-temperature models that generate novel ideas.

\n
“What if hallucination was a feature and not a bug?”
\n

These models act as conjecture machines—useful for ideation, creativity, and even generating new knowledge.

\n"},{"title":"The Four Fronts of the AI War","htmlContent":"

If you’re serious about AI, you’ll be fighting on at least one of these fronts:

Data – proprietary access and collection
Compute – GPU access and efficiency
Model design – god model vs. domain-specific
Ops – evals, monitoring, deployment frameworks

These are zero-sum battles where not everyone can win.

\n"},{"title":"Research That Actually Matters","htmlContent":"

Top research directions worth tracking:

Long-context inference
Synthetic data generation
Alternative architectures
Mixture of experts & model merging
Online learning systems

Ignore the hype cycles—focus on what compounds over time.

\n"},{"title":"Moore’s Law for Intelligence","htmlContent":"

Model performance is improving while costs plummet:

GPT-4 level models cost $20/million tokens in 2022
Now: ~$2/million tokens
Soon: $0.25 or less

\n
“It makes sense to build a product that loses money today because Moore’s Law will bail you out.”
\n

Plan for cost curves when designing products.

\n"},{"title":"Context Windows Are Exploding","htmlContent":"

We’ve gone from 4K token contexts to 1M+ token models. This unlocks richer applications—long documents, full codebases, entire conversations.

Multimodal inputs/outputs are also becoming standard: text in, image out; audio in, video out.

\n"},{"title":"Let It Think Wildly","htmlContent":"

High-temperature models aren’t just for art—they’re for thinking differently.

They generate ideas you wouldn’t have thought of yourself. This is especially useful in domains where creativity is expensive or rare.

\n
“Come on in… I never thought of that.”
\n

\n"},{"title":"Conferences as Community Infrastructure","htmlContent":"

The AI Engineer World’s Fair isn’t just about talks—it’s about meeting your people.

Tracks include:

RAG & codegen
Multimodality
Evals & Ops
Agents
Fortune 500 deployments
VP-level strategy

\n
“The ultimate win condition is: you don’t even go for the talks… your conversations are so great you don’t need anything else.”
\n

\n"}],"goldenNuggetCount":19,"subtitle":"Why the AI engineer is the new product hacker—blending APIs, fast iteration, and domain smarts to ship AI fast","isPublicAccess":true,"materialType":"podcast_episode"}]}]}]