📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.
TL;DR
A recent whitepaper from Google argues that in AI-driven software development, the model itself accounts for only 10% of system behavior. The key to success lies in harness design and context engineering, shifting focus from models to configuration and verification.
A new whitepaper from Google, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, states that the AI model accounts for only about 10% of the behavior in AI-driven systems. The report emphasizes that the real value comes from the harness—tools, prompts, rules, and context—that surround the model, shifting the focus of AI development and deployment.
The whitepaper, titled The New SDLC With Vibe Coding, argues that the dominant challenge in AI-assisted software engineering is not the model itself but how developers configure, verify, and guide its outputs. It cites experiments where tweaking only the harness or context improved agent performance significantly, while changing the model had minimal impact. For example, one team moved a coding agent from outside the Top 30 to the Top 5 on a benchmark by adjusting the harness alone.
Furthermore, the authors differentiate between ‘vibe coding’—quick prompts with minimal oversight—and ‘agentic engineering,’ which involves structured, verified, and monitored AI workflows. They stress that the costs and risks associated with unstructured, prompt-based AI use are high, including token waste, security vulnerabilities, and maintenance burdens. The report suggests that investing in harness and context engineering offers a more sustainable and cost-effective approach.
The model is only 10%
A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.
The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.
Impact of Harness and Context on AI Development Success
This shift in understanding has major implications for organizations adopting AI. It suggests that building robust harnesses and managing context will determine the quality, reliability, and cost-efficiency of AI systems, rather than focusing solely on accessing the latest models. Companies that master configuration and verification can gain a durable competitive advantage, while those fixated on model improvements may see diminishing returns.
AI model testing tools
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Evolution of AI Coding Practices and Industry Insights
The whitepaper builds on ongoing trends where AI is increasingly integrated into software workflows. As of early 2026, reports indicate that 85% of developers use AI coding agents, with more than half doing so daily. The industry has moved from vibe coding—quick, minimal oversight—to more disciplined, structured approaches. Prior to this, the focus was primarily on model capabilities, but recent experiments and benchmarks underscore that configuration, scaffolding, and context management are now the key differentiators.
This perspective aligns with broader industry shifts emphasizing verification, testing, and cost management over raw model performance, reflecting a maturation in AI integration practices.
“The behavior you experience in AI tools is dominated by scaffolding you can build, own, and improve, not the model itself.”
— Addy Osmani
software verification tools
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Unclear Aspects of Model-Harness Dynamics
It remains unclear how universally applicable these findings are across different AI tasks and industries. The specific impact of harness design versus model improvements in real-world, large-scale deployments needs further empirical validation. Additionally, the long-term effects of this paradigm shift on AI model development strategies are still emerging.
AI prompt engineering toolkit
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Future Directions in AI System Engineering
Organizations are likely to invest more in developing sophisticated harnesses, context management, and verification frameworks. Further research will explore best practices for scalable harness design and the integration of dynamic context loading. Monitoring how this shift influences AI model development and industry standards will be key in the coming months.
AI development environment
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Key Questions
Why is the model only 10% of the system behavior?
The whitepaper shows that the surrounding harness—prompts, rules, tools, and context—has a much larger influence on the AI’s output than the model itself, which accounts for roughly 10%.
How does this change AI development strategies?
It shifts focus from chasing better models to designing better harnesses, managing context, and implementing verification to improve performance and reduce costs.
What are the risks of vibe coding versus agentic engineering?
Vibe coding, which relies on quick prompts and minimal oversight, can lead to high token costs, security vulnerabilities, and maintenance issues. Agentic engineering emphasizes structured, verified workflows, which are more cost-effective long-term.
Will this approach work for all AI tasks?
It is still uncertain how universally applicable this paradigm is across different domains. More research and real-world testing are needed to validate its effectiveness broadly.
Source: ThorstenMeyerAI.com