<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Gradient Flow]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com</link><image><url>https://substackcdn.com/image/fetch/$s_!JgYc!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F42dc987e-4b38-4003-b259-4283ad63d445_256x256.png</url><title>Gradient Flow</title><link>https://gradientflow.substack.com</link></image><generator>Substack</generator><lastBuildDate>Sat, 23 May 2026 09:53:14 GMT</lastBuildDate><atom:link href="https://gradientflow.substack.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Gradient Flow]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[gradientflow@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[gradientflow@substack.com]]></itunes:email><itunes:name><![CDATA[Ben Lorica 罗瑞卡]]></itunes:name></itunes:owner><itunes:author><![CDATA[Ben Lorica 罗瑞卡]]></itunes:author><googleplay:owner><![CDATA[gradientflow@substack.com]]></googleplay:owner><googleplay:email><![CDATA[gradientflow@substack.com]]></googleplay:email><googleplay:author><![CDATA[Ben Lorica 罗瑞卡]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Stop upgrading your LLM. Start fixing your data.]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/stop-upgrading-your-llm-start-fixing</link><guid isPermaLink="false">https://gradientflow.substack.com/p/stop-upgrading-your-llm-start-fixing</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 19 May 2026 13:02:28 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/30d0dfc4-712c-48b1-90e4-dea6c033df29_1661x1012.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>Integration Is the New Moat: Moving Beyond the LLM</strong></h1><p>The <strong><a href="https://www.agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong> in New York was one of the better events I&#8217;ve attended to get a read on what&#8217;s actually happening with enterprise AI. The formal sessions were great, but the hallway conversations was where I got the inside scoop. The consistent message: deploying AI agents is much harder than most organizations expect, and the reasons are rarely the ones they anticipate. What follows is my attempt to distill what I heard into a practical view of why enterprise agents are so hard to deploy.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption"><em>Reading on a regular basis? Consider becoming a paid supporter &#128591;</em></p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>The first barrier is integration. In one enterprise after another, agents run into older CRMs, finance systems, document stores, and homegrown tools that were never designed for autonomous software. What looks like a wiring problem usually isn&#8217;t. A logistics company built a capable agent that fell apart the moment it had to touch their order management system, a decade-old platform never designed to be queried by software making autonomous decisions. The agent wasn&#8217;t broken. The process it was dropped into was. The worst failures are often the ones that stay hidden. A financial services firm ran an agent successfully in its test environment for months, only to discover during a quarterly audit that production CRM records had silently stopped updating. No error, no alert, just bad data accumulating for three months. Teams that treat integration as a technical handoff rather than a workflow redesign problem consistently get stuck at the same wall.</p><p>The data situation compounds this at every layer. Enterprise knowledge doesn&#8217;t live in clean, queryable databases. It lives in Confluence pages nobody maintains, Slack threads from two years ago, and the heads of three people who&#8217;ve been at the company long before the current CTO arrived. One study found that more than a quarter of agent deployment failures trace directly to critical knowledge that was never captured anywhere a system could reach. When agents fail on company-specific terminology, like non-standard product codes or internal procurement shorthand, the instinct is to upgrade to a more powerful model. That instinct is almost always wrong. The fix is domain-specific examples and better knowledge capture, not a bigger model. And underneath all of it, security cannot be an afterthought. Agents need governed, scoped access to data, with proper permissions and audit trails built in from the start. Without that foundation, even a well-integrated, well-trained agent is a liability waiting to surface.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4I-c!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb61fa1e-758a-4c14-a230-49d96d09461e_1517x1020.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4I-c!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb61fa1e-758a-4c14-a230-49d96d09461e_1517x1020.jpeg 424w, https://substackcdn.com/image/fetch/$s_!4I-c!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb61fa1e-758a-4c14-a230-49d96d09461e_1517x1020.jpeg 848w, https://substackcdn.com/image/fetch/$s_!4I-c!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb61fa1e-758a-4c14-a230-49d96d09461e_1517x1020.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!4I-c!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb61fa1e-758a-4c14-a230-49d96d09461e_1517x1020.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4I-c!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb61fa1e-758a-4c14-a230-49d96d09461e_1517x1020.jpeg" width="582" height="391.33104395604397" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eb61fa1e-758a-4c14-a230-49d96d09461e_1517x1020.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:979,&quot;width&quot;:1456,&quot;resizeWidth&quot;:582,&quot;bytes&quot;:373052,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/195682387?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb61fa1e-758a-4c14-a230-49d96d09461e_1517x1020.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4I-c!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb61fa1e-758a-4c14-a230-49d96d09461e_1517x1020.jpeg 424w, https://substackcdn.com/image/fetch/$s_!4I-c!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb61fa1e-758a-4c14-a230-49d96d09461e_1517x1020.jpeg 848w, https://substackcdn.com/image/fetch/$s_!4I-c!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb61fa1e-758a-4c14-a230-49d96d09461e_1517x1020.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!4I-c!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb61fa1e-758a-4c14-a230-49d96d09461e_1517x1020.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h4>Agents Don&#8217;t Fix Broken Processes, They Find Them</h4><p>But even when integration is solid and data is in order, most deployments hit a third wall that almost nobody budgets for: the organization itself. Agents do not arrive as neutral software upgrades. They ask people to change how work is routed, approved, measured, and owned. In a hospital system, that might mean deciding whether an agent can prepare a prior authorization packet before a clinician reviews it. In an industrial manufacturer, it might mean exposing the informal workaround a plant manager has used for years because the official process is too slow. In an insurance operation, it might mean discovering that no one can explain who is accountable when an agent recommends a coverage decision that later gets challenged. These are not edge cases. They are the work.</p><p>This is why early agent projects often fail in a way that is politically expensive. A weak first deployment does not just miss a KPI. It convinces managers, operators, and risk teams that the whole category is immature or unsafe. The harder second attempt, the one that would involve process audits, clearer ownership, better evaluations, and serious <a href="https://en.wikipedia.org/wiki/Change_management">change management</a>, may never get funded. In practice, deploying agents means changing the organization while the organization is still running. That is slower and messier than a demo, but it is also where the real implementation work begins.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Ijpz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9da51d32-ab7e-4f94-bcd1-0c46f4cfbe4a_1345x1020.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Ijpz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9da51d32-ab7e-4f94-bcd1-0c46f4cfbe4a_1345x1020.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Ijpz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9da51d32-ab7e-4f94-bcd1-0c46f4cfbe4a_1345x1020.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Ijpz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9da51d32-ab7e-4f94-bcd1-0c46f4cfbe4a_1345x1020.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Ijpz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9da51d32-ab7e-4f94-bcd1-0c46f4cfbe4a_1345x1020.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Ijpz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9da51d32-ab7e-4f94-bcd1-0c46f4cfbe4a_1345x1020.jpeg" width="596" height="451.98513011152414" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9da51d32-ab7e-4f94-bcd1-0c46f4cfbe4a_1345x1020.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1020,&quot;width&quot;:1345,&quot;resizeWidth&quot;:596,&quot;bytes&quot;:342322,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/195682387?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9da51d32-ab7e-4f94-bcd1-0c46f4cfbe4a_1345x1020.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Ijpz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9da51d32-ab7e-4f94-bcd1-0c46f4cfbe4a_1345x1020.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Ijpz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9da51d32-ab7e-4f94-bcd1-0c46f4cfbe4a_1345x1020.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Ijpz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9da51d32-ab7e-4f94-bcd1-0c46f4cfbe4a_1345x1020.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Ijpz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9da51d32-ab7e-4f94-bcd1-0c46f4cfbe4a_1345x1020.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Compounding all of this is a talent problem with no clean solution. Deploying agents at scale requires people who understand process design, LLM behavior, systems integration, and organizational change, all at once. That combination is genuinely rare, and companies that hire for one or two of those skills and assume the rest will follow are setting themselves up for the same failure pattern, just with different symptoms.</p><h4>Agents Are Implemented, Not Installed</h4><p>The real implementation work isn&#8217;t getting an agent to run correctly in isolation. It&#8217;s getting the agent to run correctly inside a live organization, where the work is distributed across roles, the accountability is murky, and the processes were designed around human judgment calls that nobody ever wrote down. What looks like an AI project is usually a process redesign project with an AI component attached. The companies that figure this out early tend to scope their first deployments around a specific, broken workflow rather than a whole function or department. Not &#8220;automate the sales team,&#8221; but &#8220;automate the part of lead qualification that requires pulling the same three fields from two systems and writing the same email 200 times a week.&#8221; That narrower target is less exciting to demo but far more likely to survive contact with the organization. Enterprise agents are not installed. They are stood up, governed, monitored, and gradually expanded.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!oz8s!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a7dfee-bd8b-4642-925d-5fe45be1ac2e_1659x1016.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!oz8s!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a7dfee-bd8b-4642-925d-5fe45be1ac2e_1659x1016.jpeg 424w, https://substackcdn.com/image/fetch/$s_!oz8s!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a7dfee-bd8b-4642-925d-5fe45be1ac2e_1659x1016.jpeg 848w, https://substackcdn.com/image/fetch/$s_!oz8s!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a7dfee-bd8b-4642-925d-5fe45be1ac2e_1659x1016.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!oz8s!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a7dfee-bd8b-4642-925d-5fe45be1ac2e_1659x1016.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!oz8s!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a7dfee-bd8b-4642-925d-5fe45be1ac2e_1659x1016.jpeg" width="1456" height="892" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/19a7dfee-bd8b-4642-925d-5fe45be1ac2e_1659x1016.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:892,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:357568,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/195682387?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a7dfee-bd8b-4642-925d-5fe45be1ac2e_1659x1016.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!oz8s!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a7dfee-bd8b-4642-925d-5fe45be1ac2e_1659x1016.jpeg 424w, https://substackcdn.com/image/fetch/$s_!oz8s!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a7dfee-bd8b-4642-925d-5fe45be1ac2e_1659x1016.jpeg 848w, https://substackcdn.com/image/fetch/$s_!oz8s!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a7dfee-bd8b-4642-925d-5fe45be1ac2e_1659x1016.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!oz8s!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a7dfee-bd8b-4642-925d-5fe45be1ac2e_1659x1016.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The implementation gap has also created a staffing model that most organizations haven&#8217;t fully absorbed yet. The most effective deployments involve a dedicated operational owner for each agent in production, someone who sits at the intersection of the business workflow and the technical system, and who can tell the difference between an agent that&#8217;s working and an agent that&#8217;s producing plausible-looking output that nobody has actually verified. Some vendors have formalized this with embedded specialists, technical product managers or <strong>forward-deployed engineers</strong> whose job is to live inside the customer&#8217;s workflow until the deployment actually holds. Without that kind of ownership, agents in production become side projects maintained by whoever built the prototype, and the research bears this out: organizations that skip dedicated ownership are dramatically more likely to face failures that require rolling the whole thing back. The talent profile this requires doesn&#8217;t map cleanly onto any existing job title, which is part of why so many companies are either building it from scratch or outsourcing it to implementation specialists who&#8217;ve learned these lessons on someone else&#8217;s dime.</p><h4>Integration Is the New Moat</h4><p>The next durable advantage in enterprise agents will not come from picking a cleverer model. It will come from making agents usable inside messy, specific, high-stakes workflows. A legal team does not need a model that can sound like general counsel in the abstract. It needs a system that can find the right contract, apply the company&#8217;s negotiation playbook, respect approval thresholds, and leave a clean audit trail. A customer support operation does not gain much by giving every human agent a better copilot. It gains real leverage by redesigning the service so routine cases resolve end to end, with people handling the work that requires judgment, empathy, or escalation.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DDck!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09831e3c-0632-41e4-92a1-80765db60adf_1766x988.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DDck!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09831e3c-0632-41e4-92a1-80765db60adf_1766x988.jpeg 424w, https://substackcdn.com/image/fetch/$s_!DDck!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09831e3c-0632-41e4-92a1-80765db60adf_1766x988.jpeg 848w, https://substackcdn.com/image/fetch/$s_!DDck!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09831e3c-0632-41e4-92a1-80765db60adf_1766x988.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!DDck!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09831e3c-0632-41e4-92a1-80765db60adf_1766x988.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DDck!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09831e3c-0632-41e4-92a1-80765db60adf_1766x988.jpeg" width="1456" height="815" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/09831e3c-0632-41e4-92a1-80765db60adf_1766x988.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:815,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:338045,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/195682387?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09831e3c-0632-41e4-92a1-80765db60adf_1766x988.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!DDck!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09831e3c-0632-41e4-92a1-80765db60adf_1766x988.jpeg 424w, https://substackcdn.com/image/fetch/$s_!DDck!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09831e3c-0632-41e4-92a1-80765db60adf_1766x988.jpeg 848w, https://substackcdn.com/image/fetch/$s_!DDck!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09831e3c-0632-41e4-92a1-80765db60adf_1766x988.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!DDck!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09831e3c-0632-41e4-92a1-80765db60adf_1766x988.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A manufacturer running fragmented ERP, procurement, and plant systems will not close the competitive gap by waiting for larger models. It will close the gap by modernizing the connective tissue of the business. The raw intelligence of foundation models is becoming a commodity faster than most people expected. What remains scarce is the integration depth, the governed data access, the workflow redesign, and the operational ownership that make agents actually hold in production.</p><p>That gap is also a market. Vendors, integrators, and internal transformation teams that can do the unglamorous work, connecting legacy systems, capturing institutional knowledge, building evaluation frameworks, managing the organizational change, are sitting in front of a real and durable opportunity. The distance between what foundation models can theoretically do and what enterprises can actually deploy is not closing on its own. The companies that treat integration, governance, and workflow redesign as the product rather than the plumbing will be the ones that turn agents from demos into operating leverage.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!u06F!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbddd853-5a26-49ff-bb72-3987e22372f9_1920x1080.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!u06F!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbddd853-5a26-49ff-bb72-3987e22372f9_1920x1080.jpeg 424w, https://substackcdn.com/image/fetch/$s_!u06F!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbddd853-5a26-49ff-bb72-3987e22372f9_1920x1080.jpeg 848w, https://substackcdn.com/image/fetch/$s_!u06F!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbddd853-5a26-49ff-bb72-3987e22372f9_1920x1080.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!u06F!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbddd853-5a26-49ff-bb72-3987e22372f9_1920x1080.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!u06F!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbddd853-5a26-49ff-bb72-3987e22372f9_1920x1080.jpeg" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cbddd853-5a26-49ff-bb72-3987e22372f9_1920x1080.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:836318,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/195682387?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbddd853-5a26-49ff-bb72-3987e22372f9_1920x1080.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!u06F!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbddd853-5a26-49ff-bb72-3987e22372f9_1920x1080.jpeg 424w, https://substackcdn.com/image/fetch/$s_!u06F!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbddd853-5a26-49ff-bb72-3987e22372f9_1920x1080.jpeg 848w, https://substackcdn.com/image/fetch/$s_!u06F!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbddd853-5a26-49ff-bb72-3987e22372f9_1920x1080.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!u06F!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbddd853-5a26-49ff-bb72-3987e22372f9_1920x1080.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><h1><a href="https://mp.weixin.qq.com/s/4BX75t0z-pJ_kQWrEPcLkw?utm_source=gradientflow&amp;utm_medium=newsletter">The Three Pillars Behind DeepSeek&#8217;s $51.5B Premium</a></h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RWBX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52f1d887-821a-4a75-ae46-1b04d6f11902_1882x1062.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RWBX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52f1d887-821a-4a75-ae46-1b04d6f11902_1882x1062.jpeg 424w, https://substackcdn.com/image/fetch/$s_!RWBX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52f1d887-821a-4a75-ae46-1b04d6f11902_1882x1062.jpeg 848w, https://substackcdn.com/image/fetch/$s_!RWBX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52f1d887-821a-4a75-ae46-1b04d6f11902_1882x1062.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!RWBX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52f1d887-821a-4a75-ae46-1b04d6f11902_1882x1062.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RWBX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52f1d887-821a-4a75-ae46-1b04d6f11902_1882x1062.jpeg" width="1456" height="822" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/52f1d887-821a-4a75-ae46-1b04d6f11902_1882x1062.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:822,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:487969,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/195682387?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52f1d887-821a-4a75-ae46-1b04d6f11902_1882x1062.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!RWBX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52f1d887-821a-4a75-ae46-1b04d6f11902_1882x1062.jpeg 424w, https://substackcdn.com/image/fetch/$s_!RWBX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52f1d887-821a-4a75-ae46-1b04d6f11902_1882x1062.jpeg 848w, https://substackcdn.com/image/fetch/$s_!RWBX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52f1d887-821a-4a75-ae46-1b04d6f11902_1882x1062.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!RWBX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52f1d887-821a-4a75-ae46-1b04d6f11902_1882x1062.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/05/DeepSeek-Pillars-of-Valuation.jpeg">enlarge</a></strong>)</figcaption></figure></div><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits <a href="https://ethics.dev/">Ethics.dev</a> and the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a>, and he hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong> and the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[Why your AI bills are going up (even as tokens get cheaper) 📉💸]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/why-your-ai-bills-are-going-up-even</link><guid isPermaLink="false">https://gradientflow.substack.com/p/why-your-ai-bills-are-going-up-even</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 12 May 2026 13:03:08 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/1b9e1028-eb83-433a-97c2-cf6427e705be_1734x975.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>The End of the AI Experiment: Surviving the CFO&#8217;s New ROI Demands</strong></h1><h4><strong>Why This Has Become an Executive Issue</strong></h4><p><strong>Why is AI spend no longer just an IT budget problem?</strong> AI has crossed a threshold where aggregate spend across every department requires capital allocation discipline, not just software procurement review. <em>Every</em> function now has a case for AI investment, and someone has to decide which requests deserve ongoing funding. That decision has landed with the CFO, which means technology leaders who frame AI proposals as feature requests will lose funding to peers who can demonstrate measurable business outcomes.</p><p><strong>What do &#8220;tokenomics&#8221; and &#8220;tokenmaxxing&#8221; actually mean in practice?</strong> <em>Tokenomics</em> is simply the practical economics of AI usage: how prompts, automated workflows, and background agents translate into real spending, and whether that spending is producing value. <em>Tokenmaxxing</em> is the emerging habit of pushing more work through AI because tokens feel cheap, or because high-consumption workflows appear more productive. The instinct can be rational, but it creates a governance problem because organizations need a way to distinguish productive consumption from wasteful consumption, and most have not built that capability yet.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption"><em>Reading on a regular basis? Consider becoming a paid supporter &#128591;</em></p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p><strong>Why are AI bills climbing even though token prices keep falling?</strong> Lower unit prices are encouraging more consumption, not less. As tokens get cheaper, teams build more ambitious systems: more automated, more context-heavy, always-on agents running continuously in the background. The marginal cost of any single query feels negligible, so consumption expands to fill whatever budget exists. Organizations focused purely on negotiating lower unit prices while ignoring how their systems are designed will find their total bills climbing regardless.</p><p><strong>Why is CFO scrutiny intensifying right now?</strong> The broad experimentation phase is ending. Many organizations have deployed AI in some form, but far fewer believe those deployments have produced tangible value. Once that gap becomes visible, finance teams stop treating AI as a learning exercise and start demanding evidence for continued investment. The funding logic shifts from supporting a large portfolio of loosely defined experiments to concentrating resources on fewer workflows with a clear payback case.</p><h4><strong>What Leaders Should Actually Govern</strong></h4><p><strong>What is the right unit of control: seats, teams, vendors, or workflows?</strong> The most useful unit of governance is the individual application or workflow, not the software seat or department budget. AI costs are generated by usage patterns, not by who holds a license. A single automated workflow can quietly consume more tokens than dozens of human users combined. Budgeting at the workflow level makes it possible to see which use cases are scaling, which are overrunning, and which should be redesigned or shut down.</p><p><strong>When do spending caps help, and when do they backfire?</strong> Caps help when they prevent undisciplined growth in low-value usage, particularly when nobody can explain where the spending is coming from. They backfire when they suppress the most productive work. If your highest-consuming teams are also your highest-performing ones, a blanket ceiling is a tax on performance dressed up as financial discipline. The right sequence is to instrument outcomes first, then decide where controls belong.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5d36!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5047bd15-1772-4b22-b937-e9748868a891_1627x965.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5d36!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5047bd15-1772-4b22-b937-e9748868a891_1627x965.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5d36!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5047bd15-1772-4b22-b937-e9748868a891_1627x965.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5d36!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5047bd15-1772-4b22-b937-e9748868a891_1627x965.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5d36!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5047bd15-1772-4b22-b937-e9748868a891_1627x965.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5d36!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5047bd15-1772-4b22-b937-e9748868a891_1627x965.jpeg" width="514" height="305.010989010989" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5047bd15-1772-4b22-b937-e9748868a891_1627x965.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:864,&quot;width&quot;:1456,&quot;resizeWidth&quot;:514,&quot;bytes&quot;:316898,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/194967910?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5047bd15-1772-4b22-b937-e9748868a891_1627x965.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5d36!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5047bd15-1772-4b22-b937-e9748868a891_1627x965.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5d36!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5047bd15-1772-4b22-b937-e9748868a891_1627x965.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5d36!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5047bd15-1772-4b22-b937-e9748868a891_1627x965.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5d36!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5047bd15-1772-4b22-b937-e9748868a891_1627x965.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>What should leaders actually ask when a vendor proposes outcome-based pricing?</strong> <a href="https://gradientflow.substack.com/i/169593891/11-price-based-on-outcomes-not-usage">Outcome-based pricing</a> sounds appealing because it appears to align vendor incentives with business results. That alignment is not automatic. It depends entirely on how the outcome is defined, how success is verified, and what happens when the system produces something that technically triggers a charge but does not create real value. Leaders should ask who defines what counts as a valid outcome, how disputes are handled, and whether the vendor has any incentive to maximize billable events in ways that diverge from the customer&#8217;s actual objective.</p><p><strong>Why do different AI pricing models need different governance approaches? </strong>Not all AI spend behaves the same way. Subscription pricing buys predictability but can conceal waste inside a flat fee. Usage-based pricing makes activity visible but creates volatile invoices. <a href="https://gradientflow.substack.com/i/169593891/11-price-based-on-outcomes-not-usage">Outcome-based pricing</a> sounds more business-friendly, but it can obscure the operational work required to verify whether the billed result was correct, complete, and valuable. The shift toward seats-plus-consumption adds another complication: buyers may renew a familiar per-seat contract while also taking on usage charges, credits, agent actions, or outcome fees that behave very differently. Leaders need governance that matches how value is claimed, how cost is incurred, and how performance can fail. Otherwise, they risk optimizing the old pricing model while their real exposure has already moved somewhere else.</p><div class="pullquote"><p>The seat is no longer the product. Increasingly, it is just the wrapper around prepaid consumption.</p></div><h4><strong>Visibility: The Prerequisite for Everything Else</strong></h4><p><strong>What is the single most important governance gap right now?</strong> Attribution. Most organizations cannot answer the basic question of which team, workflow, or agent is consuming how many tokens, and what business outcome that consumption supports. Without that visibility, every other governance mechanism, whether caps, chargebacks, or ROI thresholds, operates on incomplete information. Solving attribution is the prerequisite for everything else.</p><p><strong>What does good visibility infrastructure actually look like?</strong> It means purpose-built dashboards that surface per-workflow and per-agent consumption in near real time, not month-end invoices that arrive with no ability to trace costs back to specific decisions or teams. <a href="https://engineering.salesforce.com/how-salesforce-engineering-operationalized-ai-productivity-at-scale/?utm_source=gradientflow&amp;utm_medium=newsletter">Salesforce</a> expanded its internal Engineering 360 dashboards to track AI usage at the workflow and team level, showing how companies often need custom visibility tools when standard reporting does not give leaders a clear view of token consumption, agent activity, and adoption patterns. This is an area where early investment in custom observability pays off rather than waiting for the vendor ecosystem to catch up.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!iDof!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11a77d4b-66fe-4376-a5fa-38b952f1f61c_1518x977.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!iDof!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11a77d4b-66fe-4376-a5fa-38b952f1f61c_1518x977.jpeg 424w, https://substackcdn.com/image/fetch/$s_!iDof!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11a77d4b-66fe-4376-a5fa-38b952f1f61c_1518x977.jpeg 848w, https://substackcdn.com/image/fetch/$s_!iDof!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11a77d4b-66fe-4376-a5fa-38b952f1f61c_1518x977.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!iDof!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11a77d4b-66fe-4376-a5fa-38b952f1f61c_1518x977.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!iDof!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11a77d4b-66fe-4376-a5fa-38b952f1f61c_1518x977.jpeg" width="520" height="334.64285714285717" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/11a77d4b-66fe-4376-a5fa-38b952f1f61c_1518x977.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:937,&quot;width&quot;:1456,&quot;resizeWidth&quot;:520,&quot;bytes&quot;:209575,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/194967910?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11a77d4b-66fe-4376-a5fa-38b952f1f61c_1518x977.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!iDof!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11a77d4b-66fe-4376-a5fa-38b952f1f61c_1518x977.jpeg 424w, https://substackcdn.com/image/fetch/$s_!iDof!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11a77d4b-66fe-4376-a5fa-38b952f1f61c_1518x977.jpeg 848w, https://substackcdn.com/image/fetch/$s_!iDof!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11a77d4b-66fe-4376-a5fa-38b952f1f61c_1518x977.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!iDof!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11a77d4b-66fe-4376-a5fa-38b952f1f61c_1518x977.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>How does token consumption become a productivity signal rather than just a cost metric?</strong> High token consumption and high-quality output often correlate. Before setting any controls, connect token spend to actual business outcomes: deals closed, issues resolved, code shipped, churn prevented. Once you have that picture, invest more in the high-correlation workflows and scrutinize the rest. Organizations that skip this step and go straight to spending ceilings risk penalizing their most productive teams first.</p><h4><strong>Practical Governance Mechanisms That Work</strong></h4><p><strong>What is the most actionable governance step we can take right now?</strong> Set per-application token budgets with automated alerting thresholds, and require cost-impact assessments for any new AI feature before it ships. Build that review into sprint planning rather than treating it as a finance team afterthought. This embeds financial discipline into the development process rather than bolting it on after costs have already run up.</p><p><strong>What are FinOps practices and why do they matter for AI?</strong> FinOps is the discipline of bringing financial accountability to technology spend through collaboration between engineering, finance, and business teams. Applied to AI, it means forecasting token demand before projects launch, setting ROI approval gates for competing use cases, and implementing chargebacks so business units bear the actual cost of their own consumption. The chargeback mechanism in particular creates real incentives for teams to ask whether their usage is justified.</p><div class="pullquote"><p>If your highest-consuming teams are also your highest-performing ones, a blanket spending cap is just a tax on performance dressed up as financial discipline.</p></div><p><strong>How should infrastructure choices factor into AI cost governance?</strong> Stop treating all AI workloads as equivalent from a cost perspective. Public cloud is the right choice for experimentation and burst capacity where flexibility justifies the premium. Predictable, high-volume inference workloads are better suited to private or on-premises infrastructure where fixed costs outperform consumption pricing over time. Defaulting everything to public cloud absorbs a premium that compounds significantly as workloads scale.</p><h4><strong>Procurement and Organizational Risk</strong></h4><p><strong>Our vendor contracts are still per-seat. Is that a problem?</strong> Yes. Per-seat pricing no longer maps cleanly to how AI systems generate costs. In many AI-heavy products, the seat is becoming a wrapper around a base level of included usage rather than a reliable proxy for total cost. Every prompt, automated workflow, and background agent can burn tokens regardless of how many people are licensed, creating invoice volatility that per-seat budgeting cannot predict. Push for hybrid models that combine a predictable baseline fee with usage-based pricing above agreed thresholds, with explicit price caps, volume commitments, reporting rights, and overage terms built in.</p><p><strong>What changes when a seat becomes a consumption bundle?</strong> The license still matters because it controls access, but it no longer tells you enough about cost. Two teams with the same number of seats can generate very different bills if one uses AI for occasional drafting and the other runs context-heavy agents across customer support, software development, or security workflows. Procurement teams therefore need to negotiate included usage, overage rates, usage reporting, and contractual limits on unexpected consumption. The buying question shifts from &#8220;how many people need access?&#8221; to &#8220;how much machine work are we authorizing?&#8221;</p><p><strong>What is the governance maturity gap for agentic AI?</strong> Agentic AI refers to systems that take sequences of actions autonomously rather than responding to a single prompt. That matters economically because an agent is not naturally a seat-based user. It performs tasks, calls tools, consumes tokens, and may keep working after the human has stepped away. Research suggests only about one in five organizations planning to deploy agentic AI has a mature governance model in place. Without clear accountability structures and performance metrics, organizations accumulate what practitioners call &#8220;content debt,&#8221; meaning AI-generated outputs requiring human remediation that erode the ROI case for further investment. Building governance before you scale is significantly cheaper than retrofitting it after problems surface.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DjvW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b580ad3-38af-42bd-a802-b7ca0c5c68e9_1804x1019.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DjvW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b580ad3-38af-42bd-a802-b7ca0c5c68e9_1804x1019.jpeg 424w, https://substackcdn.com/image/fetch/$s_!DjvW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b580ad3-38af-42bd-a802-b7ca0c5c68e9_1804x1019.jpeg 848w, https://substackcdn.com/image/fetch/$s_!DjvW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b580ad3-38af-42bd-a802-b7ca0c5c68e9_1804x1019.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!DjvW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b580ad3-38af-42bd-a802-b7ca0c5c68e9_1804x1019.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DjvW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b580ad3-38af-42bd-a802-b7ca0c5c68e9_1804x1019.jpeg" width="1456" height="822" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3b580ad3-38af-42bd-a802-b7ca0c5c68e9_1804x1019.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:822,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:427898,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/194967910?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b580ad3-38af-42bd-a802-b7ca0c5c68e9_1804x1019.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!DjvW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b580ad3-38af-42bd-a802-b7ca0c5c68e9_1804x1019.jpeg 424w, https://substackcdn.com/image/fetch/$s_!DjvW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b580ad3-38af-42bd-a802-b7ca0c5c68e9_1804x1019.jpeg 848w, https://substackcdn.com/image/fetch/$s_!DjvW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b580ad3-38af-42bd-a802-b7ca0c5c68e9_1804x1019.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!DjvW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b580ad3-38af-42bd-a802-b7ca0c5c68e9_1804x1019.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>How should we frame AI cost governance to get board-level attention?</strong> Frame it as a competitive risk, not a budget management problem. Unmanaged AI consumption erodes margins in a way that compounds over time, and organizations that govern their AI economics well will have a structural cost advantage over those that do not. Tokens are becoming a real operational input, and treating them with the same rigor applied to energy procurement or capital expenditure is not optional for organizations that intend to scale AI seriously</p><div><hr></div><h1>&#127895;&#65039;Cerebras IPO&#127895;&#65039;</h1><p><a href="https://www.cerebras.ai/press-release/cerebras-systems-announces-filing-of-registration-statement-for-proposed-initial-ipo?utm_source=gradientflow&amp;utm_medium=newsletter">Cerebras is going public</a> <strong>this week</strong>, a milestone for an AI infrastructure company I have followed since its early days. I first met CEO Andrew Feldman in early 2018, before Cerebras had released its first processors and when the company was still focused mainly on AI <strong>training</strong>. After its first-generation chip came out, one of the first talks the team gave was at a conference I co-chaired in 2019. What makes this IPO especially interesting now is Cerebras&#8217;s growing focus on <strong>inference</strong>, the work of running trained AI models to produce answers, code, images, or other outputs. That shift matters as more enterprises move AI into production and as reasoning models use more compute while generating responses, not just during training. For those of us who build, buy, or use AI applications, another strong, speed-focused alternative to Nvidia is welcome news.</p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/bigdata/status/1171833782963326976&quot;,&quot;full_text&quot;:&quot;Great to hear about <span class=\&quot;tweet-fake-link\&quot;>@cerebras</span> new Wafer Scale hardware technology from their CEO Andrew Feldman  <span class=\&quot;tweet-fake-link\&quot;>#OReillyAI</span> &quot;,&quot;username&quot;:&quot;bigdata&quot;,&quot;name&quot;:&quot;Ben Lorica &#32599;&#29790;&#21345;&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/235298259/bigdata_logo_center3_normal.jpg&quot;,&quot;date&quot;:&quot;2019-09-11T17:12:00.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/EEMwc0MU4AESqpI.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/O4HFyHXBkl&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:1,&quot;retweet_count&quot;:7,&quot;like_count&quot;:8,&quot;impression_count&quot;:0,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits <a href="https://ethics.dev/">Ethics.dev</a> and the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a>, and he hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong> and the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[Your AI agent looks capable. But can it actually finish the job?]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/your-ai-agent-looks-capable-but-can</link><guid isPermaLink="false">https://gradientflow.substack.com/p/your-ai-agent-looks-capable-but-can</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 05 May 2026 13:02:02 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/653c7fc9-f9f9-4278-af19-c3fe0c1fbb45_1426x983.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>Why Your AI Agents Fail in Production (And How to Actually Test Them)</strong></h1><p>In a <strong><a href="https://gradientflow.substack.com/p/your-ai-model-isnt-the-problem-its">previous post</a></strong>, I argued that deploying autonomous AI agents reliably is not primarily a model problem. It is an environment problem. The gap between a capable foundation model and a production-ready system is bridged by <a href="https://gradientflow.substack.com/p/your-ai-model-isnt-the-problem-its">harness engineering</a>: the discipline of building structured workflows, validation loops, and governance mechanisms around the model rather than inside it. The central argument was that organizations that treat the surrounding environment as the primary engineering target outperform those that chase better models, and that this principle applies across every domain where agents handle complex, consequential work.</p><p>That argument raises an immediate practical question: how do you actually know whether an agent is ready for that work? Most existing evaluations still measure narrow, low-friction tasks in controlled or synthetic environments. They can tell you whether a model produces a plausible answer or completes a neat subtask, but they reveal far less about whether an agent can stay coherent across a long workflow, adapt when something breaks, and finish a job that actually runs. Many benchmarks were simple enough that top models were already approaching perfect scores, leaving no meaningful signal about which systems could handle real work and which could not.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption"><em>Value this newsletter? Consider becoming a paid supporter &#128591;</em></p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>What production usually demands is sustained execution under friction: long chains of interdependent actions, genuine error recovery, and deep domain knowledge applied to messy, open-ended objectives. That is a fundamentally different test than anything most benchmarks were designed to measure. The commercial stakes behind closing that gap are no longer abstract. Terminal-based coding agents alone are already generating billions in revenue, which means accurate measurement of what these systems can and cannot do in realistic conditions has moved from research interest to commercial necessity for anyone building, deploying, or investing in AI agent products.</p><h4>Measuring the Agent, Not the Demo</h4><p>Some of the most capable autonomous agents in production today are still concentrated in coding and software engineering. That makes sense. The terminal is one of the few environments where success criteria are clear and feedback arrives immediately. An agent cannot hide behind a fluent answer when a build fails, a dependency breaks, or a command returns the wrong output. It has to keep working until the job is done.</p><p><strong><a href="https://www.tbench.ai/?utm_source=gradientflow&amp;utm_medium=newsletter">Terminal Bench</a></strong> was built around that reality. It places agents inside real terminal environments loaded with the files, packages, and system configurations needed for the task. Each problem includes an instruction, a verification script, and a reference solution. What gets measured is not whether the agent followed a preferred sequence of steps, but whether it reached a machine-checkable result. There is no partial credit for looking competent. The output either works or it does not.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!pdgf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff088d0d1-8c71-4068-af1a-338a56047165_1217x898.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!pdgf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff088d0d1-8c71-4068-af1a-338a56047165_1217x898.jpeg 424w, https://substackcdn.com/image/fetch/$s_!pdgf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff088d0d1-8c71-4068-af1a-338a56047165_1217x898.jpeg 848w, https://substackcdn.com/image/fetch/$s_!pdgf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff088d0d1-8c71-4068-af1a-338a56047165_1217x898.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!pdgf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff088d0d1-8c71-4068-af1a-338a56047165_1217x898.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!pdgf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff088d0d1-8c71-4068-af1a-338a56047165_1217x898.jpeg" width="446" height="329.0944946589975" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f088d0d1-8c71-4068-af1a-338a56047165_1217x898.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:898,&quot;width&quot;:1217,&quot;resizeWidth&quot;:446,&quot;bytes&quot;:172332,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/194341875?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff088d0d1-8c71-4068-af1a-338a56047165_1217x898.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!pdgf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff088d0d1-8c71-4068-af1a-338a56047165_1217x898.jpeg 424w, https://substackcdn.com/image/fetch/$s_!pdgf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff088d0d1-8c71-4068-af1a-338a56047165_1217x898.jpeg 848w, https://substackcdn.com/image/fetch/$s_!pdgf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff088d0d1-8c71-4068-af1a-338a56047165_1217x898.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!pdgf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff088d0d1-8c71-4068-af1a-338a56047165_1217x898.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The significance is not just that Terminal Bench is harder. It is harder in ways that matter. Previous benchmarks often measured narrow command-line skills, relied on synthetic environments, or used tasks so short that they revealed little about sustained execution. Terminal Bench instead asks whether an agent can manage long sequences of dependent actions, recover from real error messages, and apply domain knowledge to open-ended work. Its rigor also comes from the curation. Every task in Terminal Bench is manually reviewed to reduce broken tests, underspecified instructions, and loopholes that let agents game the evaluation. The early results show why that level of manual verification matters. Frontier agents still fail more than a third of the tasks, and smaller models perform much worse. For anyone building, deploying, or investing in agents, that makes Terminal Bench less a leaderboard curiosity than a practical instrument for separating systems that look capable from systems that can actually finish difficult work.</p><div class="pullquote"><p>Evaluation is no longer a report card at the end of a cycle. It belongs directly inside the development stack.</p></div><h4>The Emerging Infrastructure for Real Agent Evaluation</h4><p>Terminal Bench has not developed in isolation. A small but important set of related efforts is now building on the same premise that agent evaluation should look more like real technical work and less like a polished demo. <a href="https://arxiv.org/html/2602.14337v2">LongCLI-Bench</a> pushes this further by focusing on longer command line tasks and by adding step-level scoring, so an agent is penalized not only for failing to finish the job but also for breaking something that previously worked. That is a meaningful advance for anyone building production agents, because regression is often the real failure mode. <a href="https://arxiv.org/html/2601.20882v1">DevOps-Gym</a> pushes the boundary further into real-world software operations, evaluating agents on their ability to configure builds, monitor systems, and resolve live issues instead of just completing isolated terminal prompts.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ulE3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa2ed9dc-b967-43ec-acfa-5d70d4f5c077_1411x979.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ulE3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa2ed9dc-b967-43ec-acfa-5d70d4f5c077_1411x979.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ulE3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa2ed9dc-b967-43ec-acfa-5d70d4f5c077_1411x979.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ulE3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa2ed9dc-b967-43ec-acfa-5d70d4f5c077_1411x979.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ulE3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa2ed9dc-b967-43ec-acfa-5d70d4f5c077_1411x979.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ulE3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa2ed9dc-b967-43ec-acfa-5d70d4f5c077_1411x979.jpeg" width="564" height="391.32246633593195" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/aa2ed9dc-b967-43ec-acfa-5d70d4f5c077_1411x979.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:979,&quot;width&quot;:1411,&quot;resizeWidth&quot;:564,&quot;bytes&quot;:348308,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/194341875?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa2ed9dc-b967-43ec-acfa-5d70d4f5c077_1411x979.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ulE3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa2ed9dc-b967-43ec-acfa-5d70d4f5c077_1411x979.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ulE3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa2ed9dc-b967-43ec-acfa-5d70d4f5c077_1411x979.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ulE3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa2ed9dc-b967-43ec-acfa-5d70d4f5c077_1411x979.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ulE3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa2ed9dc-b967-43ec-acfa-5d70d4f5c077_1411x979.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/05/Terminal-Bench-and-Agent-Eval-%E2%80%94-evolution.jpeg">enlarge</a></strong>)</figcaption></figure></div><p>The ecosystem is also expanding into the infrastructure needed to train and compare terminal agents at scale. <a href="https://arxiv.org/html/2602.07274v1">TermiGen</a> addresses one of the clearest bottlenecks exposed by Terminal Bench, namely the cost of hand-building realistic environments and trajectories for training and evaluation. Terminus and Terminus 2 provide reference implementations for how a terminal agent can interact with a live shell over many steps, which makes them useful both as engineering baselines and as cleaner testbeds for comparing models. And the appearance of <strong>fine-tuned terminal-focused systems</strong> such as <a href="https://github.com/terminal-agent/reptile">Reptile</a>, <a href="https://huggingface.co/Lite-Coder/LiteCoder-Terminal-30b-a3b-sft">LiteCoder-Terminal</a>, and <a href="https://arxiv.org/html/2602.07274v1">TerminalAgent</a> suggests that terminal competence is now being treated as a distinct capability worth training for directly. Taken together, these developments make Terminal Bench look less like a standalone benchmark and more like the anchor for a broader effort to measure and improve agents that have to do real work under real constraints.</p><h4>What Comes Next, and What It Means Outside the Terminal</h4><p>The near-term roadmap for Terminal Bench is really about keeping the benchmark informative. As agents improve, benchmarks only stay useful if they keep stretching the best systems, which means adding harder tasks, expanding domain coverage, and refreshing the suite before leaderboard movement stops meaning anything. Just as important, the Terminal Bench team is making an explicit argument that manual verification is not optional. Their experience showed that confirming task correctness, closing loopholes, and fully specifying success conditions takes substantial human effort, and that cost rises with task complexity.</p><p>The infrastructure supporting Terminal-Bench has become more valuable than the benchmark itself. <strong><a href="https://www.harborframework.com/?utm_source=gradientflow&amp;utm_medium=newsletter">Harbor</a></strong> serves as the expanded framework that allows developers to go beyond basic testing, giving them the tools to optimize AI prompts, run trial-and-error learning, and perform automated quality checks on their agents. That is a meaningful shift. Evaluation is no longer a report card at the end of a development cycle. It is moving into the development stack, not sitting outside it. The ecosystem growing around Terminal Bench is what that shift looks like in practice.</p><div class="pullquote"><p>The real leverage comes from the surrounding system, not just the model.</p></div><p>For companies building agents outside coding, this is probably the clearest near-term lesson. The winning approach is unlikely to be full autonomy across messy, high-stakes workflows. It is more likely to be structured human-agent collaboration inside tightly engineered environments. That fits the larger argument in my <a href="https://gradientflow.substack.com/p/your-ai-model-isnt-the-problem-its">previous post</a>. The real leverage comes from the surrounding system, not just the model. Terminal Bench sharpens that claim by showing that even in a domain where feedback is fast and success is machine-checkable, reliable autonomous performance remains limited. In domains where mistakes are subtler and more consequential, companies will need even more harness, more evaluation, and more deliberate handoffs between automated execution and human judgment.</p><div><hr></div><h1><a href="https://gradientflow.com/quantum-computing-supply-chains/">Quantum Computing Supply Chains</a></h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!PnBL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78588544-e241-4930-8a56-c6236a506489_2752x2217.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!PnBL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78588544-e241-4930-8a56-c6236a506489_2752x2217.jpeg 424w, https://substackcdn.com/image/fetch/$s_!PnBL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78588544-e241-4930-8a56-c6236a506489_2752x2217.jpeg 848w, https://substackcdn.com/image/fetch/$s_!PnBL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78588544-e241-4930-8a56-c6236a506489_2752x2217.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!PnBL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78588544-e241-4930-8a56-c6236a506489_2752x2217.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!PnBL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78588544-e241-4930-8a56-c6236a506489_2752x2217.jpeg" width="1456" height="1173" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/78588544-e241-4930-8a56-c6236a506489_2752x2217.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1173,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:780261,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/194341875?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78588544-e241-4930-8a56-c6236a506489_2752x2217.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!PnBL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78588544-e241-4930-8a56-c6236a506489_2752x2217.jpeg 424w, https://substackcdn.com/image/fetch/$s_!PnBL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78588544-e241-4930-8a56-c6236a506489_2752x2217.jpeg 848w, https://substackcdn.com/image/fetch/$s_!PnBL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78588544-e241-4930-8a56-c6236a506489_2752x2217.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!PnBL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78588544-e241-4930-8a56-c6236a506489_2752x2217.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/05/Quantum-Computing-Supply-Chain-unpacked.jpeg">enlarge</a></strong>)</figcaption></figure></div><div><hr></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9-Az!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F731b9cfa-33da-4ac0-acc8-b5f016254df1_1851x1003.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9-Az!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F731b9cfa-33da-4ac0-acc8-b5f016254df1_1851x1003.jpeg 424w, https://substackcdn.com/image/fetch/$s_!9-Az!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F731b9cfa-33da-4ac0-acc8-b5f016254df1_1851x1003.jpeg 848w, https://substackcdn.com/image/fetch/$s_!9-Az!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F731b9cfa-33da-4ac0-acc8-b5f016254df1_1851x1003.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!9-Az!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F731b9cfa-33da-4ac0-acc8-b5f016254df1_1851x1003.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9-Az!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F731b9cfa-33da-4ac0-acc8-b5f016254df1_1851x1003.jpeg" width="1456" height="789" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/731b9cfa-33da-4ac0-acc8-b5f016254df1_1851x1003.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:789,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:414005,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/194341875?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F731b9cfa-33da-4ac0-acc8-b5f016254df1_1851x1003.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!9-Az!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F731b9cfa-33da-4ac0-acc8-b5f016254df1_1851x1003.jpeg 424w, https://substackcdn.com/image/fetch/$s_!9-Az!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F731b9cfa-33da-4ac0-acc8-b5f016254df1_1851x1003.jpeg 848w, https://substackcdn.com/image/fetch/$s_!9-Az!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F731b9cfa-33da-4ac0-acc8-b5f016254df1_1851x1003.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!9-Az!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F731b9cfa-33da-4ac0-acc8-b5f016254df1_1851x1003.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ul><li><p><strong><a href="https://amzn.to/4mLr84L">The Laws of Thought: The Quest for a Mathematical Theory of the Mind</a></strong>. AI is starting to shape the economy, the job market, and national security. That&#8217;s why I think more people should understand the concepts in this highly readable book.</p></li><li><p><strong><a href="https://amzn.to/3QfKWkP">Mutiny: The Rise and Revolt of the College-Educated Working Class</a></strong>. The college grads making lattes and running Apple Store demos are already living the gap between what their degrees promised and what the economy delivers. With AI moving into knowledge work, that gap may widen for many more people.</p></li><li><p><strong><a href="https://amzn.to/4cuCmpu">London Falling</a></strong>. One of my favorite writers delivers again here. I came away feeling like London is not just the setting but almost a character in its own right, with all its glamour, fraud, and seedy underbelly baked into the story.</p></li></ul><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits <a href="https://ethics.dev/">Ethics.dev</a> and the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a>, and he hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong> and the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[Generation is cheap. Evaluation is everything.]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/the-real-ai-bottleneck-isnt-generation</link><guid isPermaLink="false">https://gradientflow.substack.com/p/the-real-ai-bottleneck-isnt-generation</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 28 Apr 2026 13:02:07 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/891cbf52-c61d-4cb8-a5be-ca88a55d2c76_1447x899.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>What mathematicians figured out about AI that most enterprises haven't</strong></h1><p>Recent results suggest that research mathematics is no longer a purely speculative test case for AI. A growing set of examples shows AI contributing not just to short <em>contest puzzles,</em> but to open-ended mathematical work that requires literature search, cross-domain connection-making, revision, and verification. The important lesson for enterprise AI teams is not that AI has suddenly become a mathematician. It is that progress accelerates in settings where outputs can be checked, workflows are iterative, and human experts remain responsible for choosing the right problems. Mathematics makes these conditions unusually visible, but the broader pattern maps directly to enterprise AI systems built around retrieval, structured feedback, and human oversight.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!0525!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903dce67-71dd-4f09-a272-c92cc2a9a8fd_1878x997.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!0525!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903dce67-71dd-4f09-a272-c92cc2a9a8fd_1878x997.jpeg 424w, https://substackcdn.com/image/fetch/$s_!0525!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903dce67-71dd-4f09-a272-c92cc2a9a8fd_1878x997.jpeg 848w, https://substackcdn.com/image/fetch/$s_!0525!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903dce67-71dd-4f09-a272-c92cc2a9a8fd_1878x997.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!0525!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903dce67-71dd-4f09-a272-c92cc2a9a8fd_1878x997.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!0525!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903dce67-71dd-4f09-a272-c92cc2a9a8fd_1878x997.jpeg" width="558" height="296.2458791208791" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/903dce67-71dd-4f09-a272-c92cc2a9a8fd_1878x997.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:773,&quot;width&quot;:1456,&quot;resizeWidth&quot;:558,&quot;bytes&quot;:230245,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/193517012?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903dce67-71dd-4f09-a272-c92cc2a9a8fd_1878x997.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!0525!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903dce67-71dd-4f09-a272-c92cc2a9a8fd_1878x997.jpeg 424w, https://substackcdn.com/image/fetch/$s_!0525!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903dce67-71dd-4f09-a272-c92cc2a9a8fd_1878x997.jpeg 848w, https://substackcdn.com/image/fetch/$s_!0525!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903dce67-71dd-4f09-a272-c92cc2a9a8fd_1878x997.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!0525!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903dce67-71dd-4f09-a272-c92cc2a9a8fd_1878x997.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Research mathematics might seem like the last domain to yield to AI, given its reputation for requiring deep intuition and creative leaps. Yet several structural features make it surprisingly well-suited for AI augmentation. Mathematical claims are either correct or incorrect, which means <strong>AI outputs can be verified with certainty, either by human experts or by formal proof systems</strong>. This eliminates the trust problem that plagues AI in domains where ground truth is subjective. Additionally, a large fraction of research work involves tasks that are tedious but not conceptually deep: writing experimental code, checking computations, finding citations, exploring minor cases, and surveying literature across subfields. AI handles these tasks well, freeing human researchers for higher-level reasoning.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption"><em>Gradient Flow is a reader-supported publication, consider becoming a paid subscriber.</em></p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>The existence of formal proof languages and large mathematical libraries creates an infrastructure that is uniquely favorable for AI integration. Tools like <a href="https://lean-lang.org/">Lean</a> and <a href="https://lean-lang.org/use-cases/mathlib/">Mathlib</a> translate abstract mathematics into machine-readable, computationally certified code, providing both a training ground for AI systems and a verification layer for their outputs. This ecosystem enables a genuinely new workflow where AI proposes and humans verify, or where AI explores at scale and humans direct the search. The result is not replacement but restructuring: mathematics is becoming a hybrid process where large-scale exploration, formal verification, and human-guided insight combine into a new mode of discovery.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-N24!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91b5d5e8-0927-43d7-85f7-6d2b62e12168_1892x945.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-N24!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91b5d5e8-0927-43d7-85f7-6d2b62e12168_1892x945.jpeg 424w, https://substackcdn.com/image/fetch/$s_!-N24!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91b5d5e8-0927-43d7-85f7-6d2b62e12168_1892x945.jpeg 848w, https://substackcdn.com/image/fetch/$s_!-N24!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91b5d5e8-0927-43d7-85f7-6d2b62e12168_1892x945.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!-N24!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91b5d5e8-0927-43d7-85f7-6d2b62e12168_1892x945.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-N24!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91b5d5e8-0927-43d7-85f7-6d2b62e12168_1892x945.jpeg" width="1456" height="727" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/91b5d5e8-0927-43d7-85f7-6d2b62e12168_1892x945.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:727,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:253073,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/193517012?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91b5d5e8-0927-43d7-85f7-6d2b62e12168_1892x945.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-N24!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91b5d5e8-0927-43d7-85f7-6d2b62e12168_1892x945.jpeg 424w, https://substackcdn.com/image/fetch/$s_!-N24!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91b5d5e8-0927-43d7-85f7-6d2b62e12168_1892x945.jpeg 848w, https://substackcdn.com/image/fetch/$s_!-N24!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91b5d5e8-0927-43d7-85f7-6d2b62e12168_1892x945.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!-N24!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91b5d5e8-0927-43d7-85f7-6d2b62e12168_1892x945.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h4>How AI Is Reshaping Mathematical Workflows</h4><p>Research mathematicians currently deploy AI across a few workflows, starting as a disciplined assistant for work that is useful but easy to verify. They hand off tasks such as writing experimental code, checking large numbers of computational cases, and finding citations, but only when the result can be tested independently. That trust boundary matters. Researchers are not treating a language model as an oracle, but rather using it where mistakes are visible and fixable. A second pattern is broader literature scanning. Because mathematics is so specialized, important ideas are often buried in distant subfields or older papers. AI helps widen the search, systematically surfacing analogies and relevant results from outside a researcher&#8217;s immediate area. The system acts as a multiplier for breadth, while human experts provide the depth of insight needed to judge which connections are genuinely useful.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7hWF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559a2c8e-da5d-4acc-baf0-271d5eb9637a_3897x2249.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7hWF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559a2c8e-da5d-4acc-baf0-271d5eb9637a_3897x2249.jpeg 424w, https://substackcdn.com/image/fetch/$s_!7hWF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559a2c8e-da5d-4acc-baf0-271d5eb9637a_3897x2249.jpeg 848w, https://substackcdn.com/image/fetch/$s_!7hWF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559a2c8e-da5d-4acc-baf0-271d5eb9637a_3897x2249.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!7hWF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559a2c8e-da5d-4acc-baf0-271d5eb9637a_3897x2249.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7hWF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559a2c8e-da5d-4acc-baf0-271d5eb9637a_3897x2249.jpeg" width="1456" height="840" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/559a2c8e-da5d-4acc-baf0-271d5eb9637a_3897x2249.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:840,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:796486,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/193517012?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559a2c8e-da5d-4acc-baf0-271d5eb9637a_3897x2249.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7hWF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559a2c8e-da5d-4acc-baf0-271d5eb9637a_3897x2249.jpeg 424w, https://substackcdn.com/image/fetch/$s_!7hWF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559a2c8e-da5d-4acc-baf0-271d5eb9637a_3897x2249.jpeg 848w, https://substackcdn.com/image/fetch/$s_!7hWF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559a2c8e-da5d-4acc-baf0-271d5eb9637a_3897x2249.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!7hWF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559a2c8e-da5d-4acc-baf0-271d5eb9637a_3897x2249.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/04/AI-in-Math-%E2%80%94-current-workflows.jpeg">enlarge</a></strong>)</figcaption></figure></div><p>A more advanced set of workflows is now taking shape where verification is even stronger. Some researchers use systems that combine pattern-based AI with formal logic to get verified answers to specific research questions. The AI performs extended reasoning and translates the result into a formal proof language, ensuring the output is mathematically guaranteed. In parallel, other mathematicians are using AI to conduct large-scale experimental surveys of entire problem classes. The AI systematically maps the landscape, resolving routine cases automatically and flagging the smaller set of problems that still demand new conceptual approaches. <strong>The broader lesson is not that AI replaces expert reasoning. It is that AI becomes transformative when paired with structured feedback, clear verification, and a workflow that lets humans concentrate on the genuinely hard parts.</strong></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!J_l0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef607324-5beb-4b1f-8165-39d2c9fe5521_1908x1072.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!J_l0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef607324-5beb-4b1f-8165-39d2c9fe5521_1908x1072.jpeg 424w, https://substackcdn.com/image/fetch/$s_!J_l0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef607324-5beb-4b1f-8165-39d2c9fe5521_1908x1072.jpeg 848w, https://substackcdn.com/image/fetch/$s_!J_l0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef607324-5beb-4b1f-8165-39d2c9fe5521_1908x1072.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!J_l0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef607324-5beb-4b1f-8165-39d2c9fe5521_1908x1072.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!J_l0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef607324-5beb-4b1f-8165-39d2c9fe5521_1908x1072.jpeg" width="1456" height="818" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ef607324-5beb-4b1f-8165-39d2c9fe5521_1908x1072.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:818,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:452694,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/193517012?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef607324-5beb-4b1f-8165-39d2c9fe5521_1908x1072.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!J_l0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef607324-5beb-4b1f-8165-39d2c9fe5521_1908x1072.jpeg 424w, https://substackcdn.com/image/fetch/$s_!J_l0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef607324-5beb-4b1f-8165-39d2c9fe5521_1908x1072.jpeg 848w, https://substackcdn.com/image/fetch/$s_!J_l0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef607324-5beb-4b1f-8165-39d2c9fe5521_1908x1072.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!J_l0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef607324-5beb-4b1f-8165-39d2c9fe5521_1908x1072.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/04/AI-in-Math-%E2%80%94-reaction-of-mathematicians.jpeg">enlarge</a></strong>)</figcaption></figure></div><h4>Possible Future Workflows for AI-Guided Mathematical Discovery</h4><p>As these systems mature, research mathematics will likely illustrate a broader shift now visible in many knowledge-heavy fields. The mathematician of the future will spend less time carrying out long derivations by hand and more time deciding which problems deserve attention and where scarce computing resources should be focused. The transition mirrors the shift in software engineering from writing individual lines of code to designing system architectures. AI will handle the execution layer by searching for possible proof paths, scanning literature across distant subfields, and checking results in formal verification environments. The human role moves upward toward judgment, priority setting, and significance. For enterprise AI teams, this is a familiar pattern. As systems improve, the bottleneck shifts away from raw execution and toward problem selection, workflow design, and review.</p><p>A more ambitious trajectory sees AI moving from a disciplined assistant to an active research partner. In this model, the system proposes hypotheses, surfaces analogies, and explores many more directions than any one person could track alone. In specific settings, this could eventually extend to largely autonomous research loops where the AI identifies tractable open problems, tests candidate solutions, and drafts papers. Early versions of these autonomous agents already exist and have successfully resolved minor historical math puzzles. Yet the likely near-term outcome at the frontier of research is not full autonomy. It is a staged model in which AI expands the search space while humans retain responsibility for deciding what matters, which results are credible, and who bears accountability for the final product. That lesson should resonate well beyond mathematics. In enterprise environments, the most valuable AI systems will not be the ones that fully replace experts. They will be the ones that widen exploration, tighten verification, and make expert attention more selective and strategic.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hA2L!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1eca913-b790-4b12-984e-8e5ba1251590_3932x1993.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hA2L!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1eca913-b790-4b12-984e-8e5ba1251590_3932x1993.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hA2L!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1eca913-b790-4b12-984e-8e5ba1251590_3932x1993.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hA2L!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1eca913-b790-4b12-984e-8e5ba1251590_3932x1993.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hA2L!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1eca913-b790-4b12-984e-8e5ba1251590_3932x1993.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hA2L!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1eca913-b790-4b12-984e-8e5ba1251590_3932x1993.jpeg" width="1456" height="738" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e1eca913-b790-4b12-984e-8e5ba1251590_3932x1993.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:738,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:818454,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/193517012?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1eca913-b790-4b12-984e-8e5ba1251590_3932x1993.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!hA2L!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1eca913-b790-4b12-984e-8e5ba1251590_3932x1993.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hA2L!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1eca913-b790-4b12-984e-8e5ba1251590_3932x1993.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hA2L!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1eca913-b790-4b12-984e-8e5ba1251590_3932x1993.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hA2L!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1eca913-b790-4b12-984e-8e5ba1251590_3932x1993.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/04/AI-in-Math-%E2%80%94-future-workflows.jpeg">enlarge</a></strong>)</figcaption></figure></div><h4>Beyond Generation: What Math Reveals About Production AI</h4><p>What mathematical AI makes especially clear is that hallucination is not mainly a prompting problem. It is an architectural problem, and it demands structural solutions. For high-stakes enterprise systems, prompt engineering and occasional human spot-checks are too weak to carry the load. The more durable pattern is to combine the breadth and fluency of generative models with deterministic checks: a code model paired with automated tests, a contract assistant constrained by a compliance engine, or an agent whose actions are filtered through policies, schemas, and hard business rules. Once those components are embedded in iterative loops in which the system generates, critiques, and revises, organizations can push AI further without simply scaling the risk. That is the broader lesson from mathematics. As generation gets cheaper and more abundant, the real bottleneck shifts to evaluation. Teams that build strong verification environments will be far better positioned than those that produce large volumes of plausible but weakly checked output.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!yn_E!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F100c8039-0b2b-4c1d-bd92-49085c7fffee_3948x1762.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!yn_E!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F100c8039-0b2b-4c1d-bd92-49085c7fffee_3948x1762.jpeg 424w, https://substackcdn.com/image/fetch/$s_!yn_E!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F100c8039-0b2b-4c1d-bd92-49085c7fffee_3948x1762.jpeg 848w, https://substackcdn.com/image/fetch/$s_!yn_E!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F100c8039-0b2b-4c1d-bd92-49085c7fffee_3948x1762.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!yn_E!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F100c8039-0b2b-4c1d-bd92-49085c7fffee_3948x1762.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!yn_E!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F100c8039-0b2b-4c1d-bd92-49085c7fffee_3948x1762.jpeg" width="1456" height="650" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/100c8039-0b2b-4c1d-bd92-49085c7fffee_3948x1762.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:650,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:916669,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/193517012?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F100c8039-0b2b-4c1d-bd92-49085c7fffee_3948x1762.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!yn_E!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F100c8039-0b2b-4c1d-bd92-49085c7fffee_3948x1762.jpeg 424w, https://substackcdn.com/image/fetch/$s_!yn_E!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F100c8039-0b2b-4c1d-bd92-49085c7fffee_3948x1762.jpeg 848w, https://substackcdn.com/image/fetch/$s_!yn_E!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F100c8039-0b2b-4c1d-bd92-49085c7fffee_3948x1762.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!yn_E!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F100c8039-0b2b-4c1d-bd92-49085c7fffee_3948x1762.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/04/AI-in-Math-%E2%80%94-lessons-for-AI-teams.jpeg">enlarge</a></strong>)</figcaption></figure></div><p>That design choice also changes the role of the expert. When systems are built for verifiability from the start, the machine can take on more of the first-pass burden of checking, leaving humans to focus on higher-order judgment: deciding what matters, what is worth pursuing, and where ambiguity still requires experience and taste.</p><p>Mathematics offers an early preview of that shift. The strongest mathematicians, perhaps even future <strong>Fields Medalists</strong>, may be the ones who are best at using AI to explore more possibilities, test more ideas, and concentrate their own effort where originality matters most. The same logic applies across the enterprise. <a href="https://gradientflow.substack.com/p/are-you-using-ai-or-is-it-replacing">The most effective knowledge workers</a> will not simply be those who use AI to move faster. They will be the <a href="https://gradientflow.substack.com/p/are-you-using-ai-or-is-it-replacing">ones who know how to direct it </a>across a large search space, surround it with structural feedback, and intervene at the moments where human judgment creates the most leverage.</p><div><hr></div><h1><a href="https://gradientflow.com/us-china-frontier-models/">The AI Race Is No Longer Just About Benchmarks</a></h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bRGu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b212f3f-dd0e-4778-bd42-43d5647247cd_1844x1043.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bRGu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b212f3f-dd0e-4778-bd42-43d5647247cd_1844x1043.jpeg 424w, https://substackcdn.com/image/fetch/$s_!bRGu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b212f3f-dd0e-4778-bd42-43d5647247cd_1844x1043.jpeg 848w, https://substackcdn.com/image/fetch/$s_!bRGu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b212f3f-dd0e-4778-bd42-43d5647247cd_1844x1043.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!bRGu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b212f3f-dd0e-4778-bd42-43d5647247cd_1844x1043.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bRGu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b212f3f-dd0e-4778-bd42-43d5647247cd_1844x1043.jpeg" width="1456" height="824" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5b212f3f-dd0e-4778-bd42-43d5647247cd_1844x1043.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:824,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:491984,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/193517012?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b212f3f-dd0e-4778-bd42-43d5647247cd_1844x1043.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!bRGu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b212f3f-dd0e-4778-bd42-43d5647247cd_1844x1043.jpeg 424w, https://substackcdn.com/image/fetch/$s_!bRGu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b212f3f-dd0e-4778-bd42-43d5647247cd_1844x1043.jpeg 848w, https://substackcdn.com/image/fetch/$s_!bRGu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b212f3f-dd0e-4778-bd42-43d5647247cd_1844x1043.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!bRGu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b212f3f-dd0e-4778-bd42-43d5647247cd_1844x1043.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">From: <strong><a href="https://gradientflow.com/us-china-frontier-models/">"China's AI Strengths Are Real. So Are the Structural Drags Behind Them."</a></strong></figcaption></figure></div><p></p><div><hr></div><h1>Introducing rote&#8482;: Procedural Memory for AI Agents</h1><p>In a <a href="https://gradientflow.substack.com/p/the-missing-layer-in-todays-agent">previous piece</a>, I argued that AI teams need operational memory, not just larger context windows or more elaborate prompts. <strong><a href="https://www.modiqo.ai/?utm_source=gradientflow&amp;utm_medium=newsletter">Modiqo&#8217;s rote&#8482;</a></strong> pushes that idea into a concrete system: agents should reason through genuinely new work once, then capture what worked as reusable procedural memory. That matters because many teams are now discovering the same failure mode in production agent systems: demos are easy, but reliable, repeatable workflows are hard. The challenge is no longer just connecting a model to tools. It is designing an environment where agents can remember successful work, reuse proven flows, and <strong>reduce the cost of rediscovering the same answer over and over.</strong> For teams trying to move from impressive prototypes to durable AI infrastructure, Modiqo is worth watching. <strong><a href="https://www.modiqo.ai/?utm_source=gradientflow&amp;utm_medium=newsletter">Check out rote&#8482;</a></strong>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!d2e0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01f41d2c-d741-499e-b10a-79ef62696e97_1816x663.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!d2e0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01f41d2c-d741-499e-b10a-79ef62696e97_1816x663.jpeg 424w, https://substackcdn.com/image/fetch/$s_!d2e0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01f41d2c-d741-499e-b10a-79ef62696e97_1816x663.jpeg 848w, https://substackcdn.com/image/fetch/$s_!d2e0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01f41d2c-d741-499e-b10a-79ef62696e97_1816x663.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!d2e0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01f41d2c-d741-499e-b10a-79ef62696e97_1816x663.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!d2e0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01f41d2c-d741-499e-b10a-79ef62696e97_1816x663.jpeg" width="1456" height="532" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/01f41d2c-d741-499e-b10a-79ef62696e97_1816x663.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:532,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:186979,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/193517012?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01f41d2c-d741-499e-b10a-79ef62696e97_1816x663.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!d2e0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01f41d2c-d741-499e-b10a-79ef62696e97_1816x663.jpeg 424w, https://substackcdn.com/image/fetch/$s_!d2e0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01f41d2c-d741-499e-b10a-79ef62696e97_1816x663.jpeg 848w, https://substackcdn.com/image/fetch/$s_!d2e0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01f41d2c-d741-499e-b10a-79ef62696e97_1816x663.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!d2e0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01f41d2c-d741-499e-b10a-79ef62696e97_1816x663.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.modiqo.ai/?utm_source=gradientflow&amp;utm_medium=newsletter&quot;,&quot;text&quot;:&quot;Learn More&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.modiqo.ai/?utm_source=gradientflow&amp;utm_medium=newsletter"><span>Learn More</span></a></p><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits <a href="https://ethics.dev/">Ethics.dev</a> and the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a>, and he hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong> and the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[​​Stop tweaking your AI models. Do this instead.]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/your-ai-model-isnt-the-problem-its</link><guid isPermaLink="false">https://gradientflow.substack.com/p/your-ai-model-isnt-the-problem-its</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 21 Apr 2026 13:01:33 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/bc360f01-45ce-41cd-af1d-4758539702e1_1419x1023.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>The Missing Layer: Why Your AI Agent Fails &#8212; and What Actually Fixes It</strong></h1><p>As organizations move autonomous AI agents from experimental sandboxes into live production, a critical bottleneck has emerged. Foundation models are remarkably capable but structurally unsuited to complex, multi-step work on their own. They have no persistent memory, no built-in sense of what is allowed, and no reliable way to stay on track across a long workflow. Left to their own devices, foundation models hallucinate bad decisions, lose track of context mid-task, and generate cascading errors that are expensive to unwind.</p><p>Software engineering teams were the first to hit this wall at scale, and their response offers a practical blueprint for every domain now building AI-powered applications. Their conclusion was counterintuitive: scaling AI reliably is not primarily about making the underlying model smarter. It requires a completely different discipline focused on building a structured, automated environment around the model. That discipline is called <strong>harness engineering</strong>, and its principles extend well beyond writing software.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption"><em>Fan of the newsletter? Consider becoming a paid supporter &#128591;</em></p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h4>The Core Concept</h4><p>Harness engineering treats the AI model as a fixed engine and builds the entire operational system around it: workflows, specifications, validation loops, context strategies, tool interfaces, and governance mechanisms. The model stays the same. Everything around it changes.</p><p>The distinction from adjacent approaches is fundamental, not incremental. <em>Prompt engineering</em> optimizes what you say to a model for a single interaction. <em>Model fine-tuning</em> adjusts the model&#8217;s internal weights to adapt it to a specific domain. Harness engineering does neither. It accepts the model as it is and focuses entirely on the environment the agent operates inside. Practitioners describe it as &#8220;meta engineering: building the factory rather than the product.&#8221;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!GUvA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc28d2714-744e-456b-9099-80984803401f_1593x853.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!GUvA!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc28d2714-744e-456b-9099-80984803401f_1593x853.jpeg 424w, https://substackcdn.com/image/fetch/$s_!GUvA!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc28d2714-744e-456b-9099-80984803401f_1593x853.jpeg 848w, https://substackcdn.com/image/fetch/$s_!GUvA!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc28d2714-744e-456b-9099-80984803401f_1593x853.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!GUvA!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc28d2714-744e-456b-9099-80984803401f_1593x853.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!GUvA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc28d2714-744e-456b-9099-80984803401f_1593x853.jpeg" width="506" height="271.07142857142856" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c28d2714-744e-456b-9099-80984803401f_1593x853.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:780,&quot;width&quot;:1456,&quot;resizeWidth&quot;:506,&quot;bytes&quot;:208714,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/192873052?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc28d2714-744e-456b-9099-80984803401f_1593x853.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!GUvA!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc28d2714-744e-456b-9099-80984803401f_1593x853.jpeg 424w, https://substackcdn.com/image/fetch/$s_!GUvA!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc28d2714-744e-456b-9099-80984803401f_1593x853.jpeg 848w, https://substackcdn.com/image/fetch/$s_!GUvA!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc28d2714-744e-456b-9099-80984803401f_1593x853.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!GUvA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc28d2714-744e-456b-9099-80984803401f_1593x853.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The formal framing organizes a harness along three dimensions. Context covers the declarative and procedural knowledge that informs the agent. Constraint covers the rules governing agent output both before and after it is produced. Convergence is the iterative process by which constraints are evaluated, gaps identified, and rules refined until the harness reaches what practitioners call structural <em>idempotence</em>, the point at which re-applying the checks produces no further changes. The system has stabilized.</p><p>OpenAI&#8217;s Codex team put the shift plainly: a software engineering team&#8217;s primary job is no longer to write code, but to design environments, specify intent, and build feedback loops that allow agents to do reliable work. The mental model practitioners use involves three nested loops. The <strong>outer loop, at the project level</strong>, handles intent capture through specifications, architecture documents, knowledge bases, governance rules, and human oversight. The <strong>middle loop, at the task level</strong>, handles execution through the agent&#8217;s active work cycle. The <strong>inner loop, at the action level</strong>, handles verification through immediate feedback, automated tests, and automated rule checks that scan each output against a defined set of constraints and flag anything that violates them. This layered architecture originated in software development, but any team deploying autonomous AI agents to handle complex workflows faces the same underlying problem: a capable model is not the same as a reliable system.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CBRL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ed6ae30-78dc-4af4-98a3-1c3d073b24e1_1015x914.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CBRL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ed6ae30-78dc-4af4-98a3-1c3d073b24e1_1015x914.jpeg 424w, https://substackcdn.com/image/fetch/$s_!CBRL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ed6ae30-78dc-4af4-98a3-1c3d073b24e1_1015x914.jpeg 848w, https://substackcdn.com/image/fetch/$s_!CBRL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ed6ae30-78dc-4af4-98a3-1c3d073b24e1_1015x914.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!CBRL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ed6ae30-78dc-4af4-98a3-1c3d073b24e1_1015x914.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CBRL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ed6ae30-78dc-4af4-98a3-1c3d073b24e1_1015x914.jpeg" width="362" height="325.97832512315273" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ed6ae30-78dc-4af4-98a3-1c3d073b24e1_1015x914.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:914,&quot;width&quot;:1015,&quot;resizeWidth&quot;:362,&quot;bytes&quot;:229288,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/192873052?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ed6ae30-78dc-4af4-98a3-1c3d073b24e1_1015x914.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CBRL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ed6ae30-78dc-4af4-98a3-1c3d073b24e1_1015x914.jpeg 424w, https://substackcdn.com/image/fetch/$s_!CBRL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ed6ae30-78dc-4af4-98a3-1c3d073b24e1_1015x914.jpeg 848w, https://substackcdn.com/image/fetch/$s_!CBRL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ed6ae30-78dc-4af4-98a3-1c3d073b24e1_1015x914.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!CBRL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ed6ae30-78dc-4af4-98a3-1c3d073b24e1_1015x914.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h4>The Anatomy of a Reliable Agent System</h4><p>Translating the lessons of software harness engineering into broader business applications leads to a set of operational patterns. These practices group into four categories that shape how teams design, build, and govern autonomous agents in any high-stakes domain.</p><h5>The Strategic Mindset Shift</h5><p>The most fundamental adjustment for teams deploying AI agents is a complete inversion of their daily focus. Rather than optimizing the underlying model or manually reviewing every output, practitioners must shift their attention to architecting the environment where the agent operates. This means treating the model as a fixed engine and investing in the surrounding validation infrastructure, turning domain experts from manual reviewers into system designers. The evidence for this reframing is concrete: <a href="https://blog.langchain.com/improving-deep-agents-with-harness-engineering/?utm_source=gradientflow&amp;utm_medium=newsletter">LangChain moved a coding agent</a> from 30th to 5th place on an industry benchmark by changing only the harness, not the model. The same dynamic applies in any domain. A legal research agent will not become reliably accurate by switching to a more capable model if the validation layer cannot catch citation errors. When an agent produces a bad output, the first question should be &#8220;what is missing from the surrounding environment?&#8221; not &#8220;how do we change the model?&#8221;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://gradientflow.com/wp-content/uploads/2026/04/Harness-Engineering-mindset-shift.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cAaq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34f49e7e-1dbb-4acc-bfdc-82bab2c6fb43_1504x860.jpeg 424w, https://substackcdn.com/image/fetch/$s_!cAaq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34f49e7e-1dbb-4acc-bfdc-82bab2c6fb43_1504x860.jpeg 848w, https://substackcdn.com/image/fetch/$s_!cAaq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34f49e7e-1dbb-4acc-bfdc-82bab2c6fb43_1504x860.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!cAaq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34f49e7e-1dbb-4acc-bfdc-82bab2c6fb43_1504x860.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cAaq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34f49e7e-1dbb-4acc-bfdc-82bab2c6fb43_1504x860.jpeg" width="1456" height="833" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/34f49e7e-1dbb-4acc-bfdc-82bab2c6fb43_1504x860.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:833,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:196011,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:&quot;https://gradientflow.com/wp-content/uploads/2026/04/Harness-Engineering-mindset-shift.jpeg&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/192873052?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34f49e7e-1dbb-4acc-bfdc-82bab2c6fb43_1504x860.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!cAaq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34f49e7e-1dbb-4acc-bfdc-82bab2c6fb43_1504x860.jpeg 424w, https://substackcdn.com/image/fetch/$s_!cAaq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34f49e7e-1dbb-4acc-bfdc-82bab2c6fb43_1504x860.jpeg 848w, https://substackcdn.com/image/fetch/$s_!cAaq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34f49e7e-1dbb-4acc-bfdc-82bab2c6fb43_1504x860.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!cAaq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34f49e7e-1dbb-4acc-bfdc-82bab2c6fb43_1504x860.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The true return on investment for an agent system is not measured in tasks completed but in expert human attention hours saved. Every failure pattern encoded as an automated rule reduces future review burden, which means the harness is not a setup cost but a compounding asset that grows more valuable with each iteration. The primary deliverable for the domain expert becomes the testing suite and evaluation pipelines rather than the content the agent produces. A domain expert who insists on reviewing every output does not become a quality guarantor. They become a ceiling on what the system can ever achieve.</p><h5>Architecture and Orchestration</h5><p>Once the mindset shifts to environment design, the architecture of the system must enforce strict boundaries and predictable workflows. Rather than granting a single agent freeform autonomy to complete a complex task, robust systems rely on structured orchestration. A fixed control layer governs how the process moves from one defined step to the next, so the agent is never left to decide on its own what to do next or whether a step can be skipped. A clinical documentation agent should never decide on its own whether to file a record, request clarification, or escalate to a physician. A procurement agent should not unilaterally skip an approval step because the task looks routine. Structured orchestration makes agent behavior auditable, predictable, and recoverable regardless of domain.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!oBre!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f3a6e22-f40f-4d0c-87bc-f2d85e796d7e_1671x976.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!oBre!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f3a6e22-f40f-4d0c-87bc-f2d85e796d7e_1671x976.jpeg 424w, https://substackcdn.com/image/fetch/$s_!oBre!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f3a6e22-f40f-4d0c-87bc-f2d85e796d7e_1671x976.jpeg 848w, https://substackcdn.com/image/fetch/$s_!oBre!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f3a6e22-f40f-4d0c-87bc-f2d85e796d7e_1671x976.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!oBre!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f3a6e22-f40f-4d0c-87bc-f2d85e796d7e_1671x976.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!oBre!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f3a6e22-f40f-4d0c-87bc-f2d85e796d7e_1671x976.jpeg" width="620" height="361.95054945054943" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4f3a6e22-f40f-4d0c-87bc-f2d85e796d7e_1671x976.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:850,&quot;width&quot;:1456,&quot;resizeWidth&quot;:620,&quot;bytes&quot;:237559,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/192873052?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f3a6e22-f40f-4d0c-87bc-f2d85e796d7e_1671x976.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!oBre!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f3a6e22-f40f-4d0c-87bc-f2d85e796d7e_1671x976.jpeg 424w, https://substackcdn.com/image/fetch/$s_!oBre!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f3a6e22-f40f-4d0c-87bc-f2d85e796d7e_1671x976.jpeg 848w, https://substackcdn.com/image/fetch/$s_!oBre!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f3a6e22-f40f-4d0c-87bc-f2d85e796d7e_1671x976.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!oBre!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f3a6e22-f40f-4d0c-87bc-f2d85e796d7e_1671x976.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Paired with this are two supporting elements. Teams must also build <a href="https://gradientflow.substack.com/p/the-missing-layer-in-todays-agent">durable documents</a> that live permanently inside the agent&#8217;s operating environment. These files encode the institutional knowledge the agent needs to behave consistently across every session: regulatory constraints, brand voice rules, escalation thresholds, and the rationale behind key decisions. Without them, agents operate from whatever context a user happens to provide, which produces inconsistent and unpredictable behavior. These anchors ensure consistent behavior regardless of how a session begins, rather than relying on ad hoc instructions that vary by user. Complex tasks should also be decomposed into specialized agent roles with explicit, structured handoffs between them.</p><p>A grant writing system might feature a research agent, a drafting agent, and a compliance review agent operating in sequence. Specialization narrows the blast radius of individual failures and makes each component easier to test and improve independently. Every connection to an external tool or data source must carry explicit permission limits enforced mechanically, because without those boundaries, agents can access sensitive data inappropriately or trigger irreversible operations in connected systems, and that risk scales directly with the number of agents running in parallel.</p><h5>Validation, Feedback, and Escalation</h5><p>A well-architected environment requires automated mechanisms to catch errors early and correct them cheaply. The strategic optimization target is not preventing every error through exhaustive upfront review but detecting errors fast and reversing them at the lowest possible cost. Reliable systems build this through three layered feedback mechanisms working in sequence: structural checks that block invalid outputs and return actionable fix instructions the agent can act on without human translation, runtime observability through logs and metrics that make execution visible to both agents and humans, and agent-led self-review that audits outputs before escalating only genuinely novel cases to a human expert. Layering all three creates a system that catches errors at the cheapest possible point in the pipeline rather than routing everything through a human approval queue that cannot scale.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hgnn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51dd6018-5070-47db-af29-3c722bd6aead_1310x913.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hgnn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51dd6018-5070-47db-af29-3c722bd6aead_1310x913.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hgnn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51dd6018-5070-47db-af29-3c722bd6aead_1310x913.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hgnn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51dd6018-5070-47db-af29-3c722bd6aead_1310x913.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hgnn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51dd6018-5070-47db-af29-3c722bd6aead_1310x913.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hgnn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51dd6018-5070-47db-af29-3c722bd6aead_1310x913.jpeg" width="538" height="374.9572519083969" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/51dd6018-5070-47db-af29-3c722bd6aead_1310x913.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:913,&quot;width&quot;:1310,&quot;resizeWidth&quot;:538,&quot;bytes&quot;:172541,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/192873052?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51dd6018-5070-47db-af29-3c722bd6aead_1310x913.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!hgnn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51dd6018-5070-47db-af29-3c722bd6aead_1310x913.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hgnn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51dd6018-5070-47db-af29-3c722bd6aead_1310x913.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hgnn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51dd6018-5070-47db-af29-3c722bd6aead_1310x913.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hgnn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51dd6018-5070-47db-af29-3c722bd6aead_1310x913.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Escalation itself must be triggered by specific, pre-defined conditions encoded in the harness, not left to the agent&#8217;s discretion. This prevents both under-escalation, where agents proceed when they should not, and the equally damaging pattern of agents interrupting human experts for routine decisions that automated checks could handle. Teams should build evaluation criteria that evolve as they observe how agents actually fail in production. The mistakes that matter most in a live environment are almost never the ones engineers anticipate before launch. The ultimate design goal across all of this is convergence: a harness mature enough that re-applying its constraint checks produces no further changes, and the system has reached a stable, rule-compliant state.</p><h5>Critical Anti-Patterns</h5><p>Understanding how these systems fail is as important as knowing how to build them. The most dangerous trap is silent state corruption, where an agent generates outputs that look structurally plausible but contain semantic errors that accumulate undetected until they cause cascading damage that is expensive to unwind. This failure is more insidious outside software because the feedback signals are slower and weaker than a failed build or broken test. A research synthesis agent that subtly misattributes sources, a clinical documentation agent that quietly introduces dosage errors, or a financial analysis agent that gradually drifts from regulatory definitions all represent silent corruption that looks fine on the surface until it does not. The problem gets worse when teams rely on the conversation history itself to track where the workflow stands, rather than maintaining a separate, explicit record of progress. When something goes wrong in a system built this way, there is no clean state to inspect or replay. The only record of what happened is buried in a thread of messages that the agent may have interpreted differently at each step.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4bUC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc444c23-332d-4b10-9d21-a1b84cefebca_1428x932.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4bUC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc444c23-332d-4b10-9d21-a1b84cefebca_1428x932.jpeg 424w, https://substackcdn.com/image/fetch/$s_!4bUC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc444c23-332d-4b10-9d21-a1b84cefebca_1428x932.jpeg 848w, https://substackcdn.com/image/fetch/$s_!4bUC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc444c23-332d-4b10-9d21-a1b84cefebca_1428x932.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!4bUC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc444c23-332d-4b10-9d21-a1b84cefebca_1428x932.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4bUC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc444c23-332d-4b10-9d21-a1b84cefebca_1428x932.jpeg" width="498" height="325.02521008403363" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dc444c23-332d-4b10-9d21-a1b84cefebca_1428x932.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:932,&quot;width&quot;:1428,&quot;resizeWidth&quot;:498,&quot;bytes&quot;:177560,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/192873052?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc444c23-332d-4b10-9d21-a1b84cefebca_1428x932.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4bUC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc444c23-332d-4b10-9d21-a1b84cefebca_1428x932.jpeg 424w, https://substackcdn.com/image/fetch/$s_!4bUC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc444c23-332d-4b10-9d21-a1b84cefebca_1428x932.jpeg 848w, https://substackcdn.com/image/fetch/$s_!4bUC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc444c23-332d-4b10-9d21-a1b84cefebca_1428x932.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!4bUC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc444c23-332d-4b10-9d21-a1b84cefebca_1428x932.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>When AI teams deploy multi-agent configurations without verification gates between handoffs, each agent&#8217;s small mistake becomes an input assumption for the next, producing catastrophically wrong final outputs from a chain of individually plausible-looking steps. A multi-agent insurance underwriting pipeline where a data extraction agent makes a small error, a risk scoring agent builds on that error, and a pricing agent compounds it further illustrates how quickly the damage accumulates and how difficult it becomes to trace back to its origin. When guardrails and oversight structures are absent from the start, technical debt accumulates faster than teams can address it. Retrofitting those controls after a production failure is dramatically more expensive than building even a minimal harness from day one. A <a href="https://gradientflow.substack.com/p/the-missing-layer-in-todays-agent">persistent context </a>document, a few structural checks, and a basic escalation rule is substantially better than none, and it provides a foundation for iterative improvement rather than a crisis-driven rebuild.</p><h4>The Architect or the Bottleneck</h4><p>Harness engineering emerged from software development because that is where autonomous AI agents hit production scale first, but the underlying problem it solves is not specific to code. Any domain deploying agents to handle complex, multi-step work with real consequences faces the same structural gap: a capable model is not a reliable system. Those principles are not specific to software. They apply with equal force to legal research, clinical documentation, financial analysis, and procurement, anywhere that errors compound and someone is ultimately accountable for the output. The names change. The structural requirements do not. The gap between a raw model and a reliable production system is bridged entirely by the environment built around it. That means every organization deploying AI agents faces the same choice software teams have already confronted: invest in the environment that makes agents reliable, or spend your time cleaning up after ones that are not.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Y3GU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F053815a6-26d7-4cc6-8ce3-b08fe91844f2_3812x1349.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Y3GU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F053815a6-26d7-4cc6-8ce3-b08fe91844f2_3812x1349.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Y3GU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F053815a6-26d7-4cc6-8ce3-b08fe91844f2_3812x1349.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Y3GU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F053815a6-26d7-4cc6-8ce3-b08fe91844f2_3812x1349.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Y3GU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F053815a6-26d7-4cc6-8ce3-b08fe91844f2_3812x1349.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Y3GU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F053815a6-26d7-4cc6-8ce3-b08fe91844f2_3812x1349.jpeg" width="1456" height="515" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/053815a6-26d7-4cc6-8ce3-b08fe91844f2_3812x1349.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:515,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:749380,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/192873052?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F053815a6-26d7-4cc6-8ce3-b08fe91844f2_3812x1349.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Y3GU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F053815a6-26d7-4cc6-8ce3-b08fe91844f2_3812x1349.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Y3GU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F053815a6-26d7-4cc6-8ce3-b08fe91844f2_3812x1349.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Y3GU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F053815a6-26d7-4cc6-8ce3-b08fe91844f2_3812x1349.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Y3GU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F053815a6-26d7-4cc6-8ce3-b08fe91844f2_3812x1349.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Agent Harness: Managed Service or Custom (<strong><a href="https://gradientflow.com/wp-content/uploads/2026/04/Harness-&#8212;-Managed-or-Custom.jpeg">enlarge</a></strong>)</figcaption></figure></div><p>Building this infrastructure requires upfront investment in time and discipline, but the alternative is a system that generates technical debt and silent errors at machine speed. A well-engineered harness is the only mechanism that allows an organization to capture the productivity gains of autonomous AI without sacrificing the safety and quality of its most critical operations.</p><div><hr></div><h1>Quick Takes</h1><div id="youtube2-8RWNu9VzirY" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;8RWNu9VzirY&quot;,&quot;startTime&quot;:&quot;13&quot;,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/8RWNu9VzirY?start=13&amp;rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><ol><li><p><strong><a href="https://youtu.be/8RWNu9VzirY?t=13">AI Committees: Accelerant or Anchor?</a></strong></p></li><li><p><strong><a href="https://youtu.be/8RWNu9VzirY?t=1004">The New Rules of AI Information Architecture</a></strong></p></li></ol><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits <a href="https://ethics.dev/">Ethics.dev</a> and the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a>, and he hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong> and the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[Shadow IT is back, and this time it has admin access]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/your-employees-are-already-using</link><guid isPermaLink="false">https://gradientflow.substack.com/p/your-employees-are-already-using</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 14 Apr 2026 13:01:03 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/0efa042d-0174-45b6-9644-b181a89c2e15_1171x816.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>What Is An AI Delegate?</strong></h1><p><a href="https://github.com/openclaw/openclaw">OpenClaw</a> arrived with little fanfare and quickly became the fastest growing open source AI project on record. At its core, OpenClaw is a personal autonomous agent framework. It is software that connects a large language model to your email, calendar, file system, messaging apps, and external APIs, then acts on your behalf across all of them simultaneously without waiting to be asked. OpenClaw did not just answer questions. It completed work. That distinction resonated immediately with developers and technically sophisticated early adopters who had grown frustrated with AI tools that stopped at the edge of a chat window.</p><p>The adoption rate outpaced the security reviews, and that gap mattered. OpenClaw&#8217;s open architecture made it composable and extensible, but it also made it trivially easy to deploy with over-permissioned access, no audit logging, and no vetting of community contributed skill modules. The documented <a href="https://www.koi.ai/blog/clawhavoc-341-malicious-clawedbot-skills-found-by-the-bot-they-were-targeting">ClawHavoc supply chain attack</a>, in which malicious skills compromised thousands of installations, was a predictable outcome of consumer grade architecture deployed without enterprise grade controls. Industry analysts flagged the category as an unacceptable cybersecurity risk for enterprise use. None of the security alerts and warnings stopped adoption. The excitement around what OpenClaw enables simply outpaced concern about what it could expose.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption"><em>Regular reader? Consider becoming a paid supporter &#128591;</em></p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>What followed was a wave of systems that took the core concept of OpenClaw and addressed its roughest edges while extending its reach. <a href="https://code.claude.com/docs/en/channels-reference">Claude Code Channels</a> embeds autonomous execution directly inside Anthropic&#8217;s interface, adding structured reasoning traces and tighter permission boundaries that make the system&#8217;s decision making more inspectable. <a href="https://www.nvidia.com/en-us/ai/nemoclaw/">NemoClaw</a> targets local deployment on dedicated hardware, bringing meaningful on device performance to users in regulated industries who cannot route sensitive data through cloud providers. <a href="https://www.genspark.ai/genspark-claw">GenSpark Claw</a> shifts toward a managed experience, abstracting away the complex container configuration and credential management that made raw OpenClaw inaccessible to non technical users. It also layers in role based access controls and compliance oriented audit trails. Together, these systems are converging on a new operational paradigm that deserves its own category name: the <strong>AI Delegate</strong>. Rather than functioning as traditional software, AI Delegates operate as persistent, action-taking systems that work on a human&#8217;s behalf across applications and time.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6jry!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84ced735-eb62-4da0-b0ff-895352276503_1837x643.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6jry!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84ced735-eb62-4da0-b0ff-895352276503_1837x643.jpeg 424w, https://substackcdn.com/image/fetch/$s_!6jry!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84ced735-eb62-4da0-b0ff-895352276503_1837x643.jpeg 848w, https://substackcdn.com/image/fetch/$s_!6jry!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84ced735-eb62-4da0-b0ff-895352276503_1837x643.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!6jry!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84ced735-eb62-4da0-b0ff-895352276503_1837x643.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6jry!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84ced735-eb62-4da0-b0ff-895352276503_1837x643.jpeg" width="1456" height="510" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/84ced735-eb62-4da0-b0ff-895352276503_1837x643.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:510,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:171821,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/192153974?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84ced735-eb62-4da0-b0ff-895352276503_1837x643.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6jry!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84ced735-eb62-4da0-b0ff-895352276503_1837x643.jpeg 424w, https://substackcdn.com/image/fetch/$s_!6jry!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84ced735-eb62-4da0-b0ff-895352276503_1837x643.jpeg 848w, https://substackcdn.com/image/fetch/$s_!6jry!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84ced735-eb62-4da0-b0ff-895352276503_1837x643.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!6jry!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84ced735-eb62-4da0-b0ff-895352276503_1837x643.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/04/AI-Delegate-%E2%80%94-key-features.jpeg">enlarge</a></strong>)</figcaption></figure></div><p>An <strong>AI Delegate</strong> (often functioning as <strong>&#8216;Digital Staff&#8217;</strong>) is built on the four core pillars of Intelligence, Memory, Architecture, and Governance, and is defined by the following key features:</p><ul><li><p><strong>Goal-Driven Autonomy.</strong> Interprets high-level human goals, breaks them into sub-tasks, executes across connected systems, and retries or adapts when steps fail. The unit of value is a completed workflow, not a text response.</p></li><li><p><strong>Ambient Presence.</strong> Lives inside messaging platforms users already use, such as WhatsApp, Slack, Telegram, or iMessage, providing an always-on experience with zero friction or context-switching.</p></li><li><p><strong>Persistent Memory.</strong> Maintains long-term context across all sessions <a href="https://lancedb.com/blog/openclaw-lancedb-memory-layer/?utm_source=gradientflow&amp;utm_medium=newsletter">using systems like LanceDB</a> and vector embeddings. Remembers preferences, past decisions, and ongoing projects without being re-instructed.</p></li><li><p><strong>Proactive Heartbeat Execution.</strong> Operates on scheduled cycles without waiting to be prompted: it runs briefings, audits, consolidations, and background tasks autonomously.</p></li><li><p><strong>Dynamic Tool Construction.</strong> Builds its own integrations and scripts on the fly when existing tools are insufficient. Capability surface expands with every new task encountered.</p></li><li><p><strong>Cross-System Orchestration.</strong> Coordinates actions across databases, APIs, file systems, CRMs, and external services within a single workflow, serving as the integration layer so the human doesn&#8217;t have to.</p></li><li><p><strong>Modular, Open Architecture.</strong> Separates the reasoning layer (LLM) from the execution layer (tools, APIs), facilitating model swaps, new integrations, and flexible deployment without vendor lock-in.</p></li><li><p><strong>Local-First Deployment.</strong> Runs on the user&#8217;s own hardware by default, keeping data under user control. Cloud-managed variants exist but local-first is the defining architectural option.</p></li><li><p><strong>Governed Identity and Delegation.</strong> Operates under its own scoped service identity rather than borrowed human credentials, utilizing explicit handoff maps to define what the agent owns, what requires human approval, and what is never delegated.</p></li><li><p><strong>Multi-Agent Collaboration.</strong> Architected for teams of specialized agents (planner, executor, validator) that delegate to one another, check each other&#8217;s work, and operate in parallel.</p></li></ul><h4>Crossing the Threshold to Enterprise Operations</h4><p>The trajectory of these AI Delegates mirrors a familiar pattern in enterprise technology adoption. A tool emerges in the developer community, proves its value through grassroots experimentation, and eventually forces organizations to either adopt it deliberately or manage it as shadow IT. The emergence of managed platforms and no code interfaces signals that this category is moving toward professional users much faster than most organizations realize. The organizations that capture the most value from AI will not be those that ship chatbot wrappers around foundation models, but those that embed agent level autonomy into products capable of handling unstructured cross system tasks at scale.</p><p>Consider the work that currently requires a human to manually bridge between a CRM, an inbox, a calendar, and a data warehouse. Managing sales outreach, sorting customer issues, summarizing market research, and setting up new accounts all involve many moving parts, but they follow predictable rules that rarely require constant human oversight. They are exactly the kind of multi system, goal directed workflows that AI Delegates are architecturally built to own end to end. These systems do not just make existing tasks faster. They replace entire task bundles that previously required dedicated staff. Organizations that measure the value of AI Delegates by chat <a href="https://decagon.ai/glossary/deflection-rate">deflection rates</a> or response times are using the wrong yardstick. The real return is measured at the workflow or role level, and the enterprises that internalize that distinction early will redesign operations around it rather than layering AI Delegates onto legacy processes that were never built for delegation.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EtgX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f2bc338-3fed-4b0b-87a6-6c718fe1f1d7_1885x1078.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EtgX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f2bc338-3fed-4b0b-87a6-6c718fe1f1d7_1885x1078.jpeg 424w, https://substackcdn.com/image/fetch/$s_!EtgX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f2bc338-3fed-4b0b-87a6-6c718fe1f1d7_1885x1078.jpeg 848w, https://substackcdn.com/image/fetch/$s_!EtgX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f2bc338-3fed-4b0b-87a6-6c718fe1f1d7_1885x1078.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!EtgX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f2bc338-3fed-4b0b-87a6-6c718fe1f1d7_1885x1078.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EtgX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f2bc338-3fed-4b0b-87a6-6c718fe1f1d7_1885x1078.jpeg" width="1456" height="833" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f2bc338-3fed-4b0b-87a6-6c718fe1f1d7_1885x1078.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:833,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:276349,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/192153974?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f2bc338-3fed-4b0b-87a6-6c718fe1f1d7_1885x1078.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!EtgX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f2bc338-3fed-4b0b-87a6-6c718fe1f1d7_1885x1078.jpeg 424w, https://substackcdn.com/image/fetch/$s_!EtgX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f2bc338-3fed-4b0b-87a6-6c718fe1f1d7_1885x1078.jpeg 848w, https://substackcdn.com/image/fetch/$s_!EtgX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f2bc338-3fed-4b0b-87a6-6c718fe1f1d7_1885x1078.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!EtgX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f2bc338-3fed-4b0b-87a6-6c718fe1f1d7_1885x1078.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/04/AI-Delegate-%E2%80%94-before-and-after.jpeg">enlarge</a></strong>)</figcaption></figure></div><p>Making AI Delegates genuinely enterprise grade requires closing a set of gaps that consumer deployments have not had to address. The documented <a href="https://www.koi.ai/blog/clawhavoc-341-malicious-clawedbot-skills-found-by-the-bot-they-were-targeting">ClawHavoc supply chain attack</a> was not an anomaly. It was a preview of what happens when systems with broad tool access and cross system execution rights are deployed without enterprise controls. Responsible deployment demands treating AI Delegates as governed identities rather than simple software tools. They must be provisioned with their own scoped service identities, least privilege access, and continuous behavioral monitoring so security tools can distinguish agent activity from legitimate user behavior.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5ixZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf21c27a-dbeb-4429-8dc7-aa92226b1b85_3944x1979.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5ixZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf21c27a-dbeb-4429-8dc7-aa92226b1b85_3944x1979.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5ixZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf21c27a-dbeb-4429-8dc7-aa92226b1b85_3944x1979.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5ixZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf21c27a-dbeb-4429-8dc7-aa92226b1b85_3944x1979.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5ixZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf21c27a-dbeb-4429-8dc7-aa92226b1b85_3944x1979.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5ixZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf21c27a-dbeb-4429-8dc7-aa92226b1b85_3944x1979.jpeg" width="1456" height="731" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/af21c27a-dbeb-4429-8dc7-aa92226b1b85_3944x1979.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:731,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1034017,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/192153974?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf21c27a-dbeb-4429-8dc7-aa92226b1b85_3944x1979.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5ixZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf21c27a-dbeb-4429-8dc7-aa92226b1b85_3944x1979.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5ixZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf21c27a-dbeb-4429-8dc7-aa92226b1b85_3944x1979.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5ixZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf21c27a-dbeb-4429-8dc7-aa92226b1b85_3944x1979.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5ixZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf21c27a-dbeb-4429-8dc7-aa92226b1b85_3944x1979.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/04/AI-Delegates-failure-modes.jpeg">enlarge</a></strong>)</figcaption></figure></div><p>Furthermore, delegation must be formalized through explicit handoff maps that define exactly what the system owns end to end and where a human must review the work before the AI Delegate proceeds. Because a single error can propagate widely, enterprise architectures must also include sandboxed execution environments, explicit rollback mechanisms, and durable state management to handle branching and failure recovery. Finally, user interfaces must include trust calibration mechanisms that give humans an accurate picture of what the delegate is doing and why. Overconfidence in agent capabilities leads to under supervision, and under supervision is where the real enterprise liability lives. The organizations that move now to establish this governed, observable, and human in the loop infrastructure will define the competitive landscape.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jYa_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c0416ad-df26-4b91-81fa-6bba3f75c5ef_1126x1041.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jYa_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c0416ad-df26-4b91-81fa-6bba3f75c5ef_1126x1041.jpeg 424w, https://substackcdn.com/image/fetch/$s_!jYa_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c0416ad-df26-4b91-81fa-6bba3f75c5ef_1126x1041.jpeg 848w, https://substackcdn.com/image/fetch/$s_!jYa_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c0416ad-df26-4b91-81fa-6bba3f75c5ef_1126x1041.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!jYa_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c0416ad-df26-4b91-81fa-6bba3f75c5ef_1126x1041.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jYa_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c0416ad-df26-4b91-81fa-6bba3f75c5ef_1126x1041.jpeg" width="536" height="495.53818827708704" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7c0416ad-df26-4b91-81fa-6bba3f75c5ef_1126x1041.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1041,&quot;width&quot;:1126,&quot;resizeWidth&quot;:536,&quot;bytes&quot;:197382,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/192153974?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c0416ad-df26-4b91-81fa-6bba3f75c5ef_1126x1041.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jYa_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c0416ad-df26-4b91-81fa-6bba3f75c5ef_1126x1041.jpeg 424w, https://substackcdn.com/image/fetch/$s_!jYa_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c0416ad-df26-4b91-81fa-6bba3f75c5ef_1126x1041.jpeg 848w, https://substackcdn.com/image/fetch/$s_!jYa_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c0416ad-df26-4b91-81fa-6bba3f75c5ef_1126x1041.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!jYa_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c0416ad-df26-4b91-81fa-6bba3f75c5ef_1126x1041.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/04/AI-Delegate-%E2%80%94-enterpise-requirements.jpeg">enlarge</a></strong>)</figcaption></figure></div><h4>Capable Today, Production-Ready Tomorrow</h4><p>We are still in the absolute earliest days of the AI Delegate paradigm. These systems are already demonstrating real utility in automating complex, cross-system workflows, but their underlying architectures remain raw and require significant hardening before they can be deployed safely at enterprise scale. The immediate future of this category will not be defined by flashy new reasoning capabilities. It will be defined by the unglamorous work of enterprise readiness: formal security certifications, governed agent identities, durable state management, and the organizational habits that make delegation safe rather than just fast.</p><p>Looking ahead, the roadmap points toward standardized protocols for multi-agent communication, polished interfaces that replace brittle command-line setups, and deep interoperability with the enterprise data systems where real operational value lives.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nvdP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0640ebb-1863-4a7d-90f3-5c676959bc27_1831x968.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nvdP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0640ebb-1863-4a7d-90f3-5c676959bc27_1831x968.jpeg 424w, https://substackcdn.com/image/fetch/$s_!nvdP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0640ebb-1863-4a7d-90f3-5c676959bc27_1831x968.jpeg 848w, https://substackcdn.com/image/fetch/$s_!nvdP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0640ebb-1863-4a7d-90f3-5c676959bc27_1831x968.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!nvdP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0640ebb-1863-4a7d-90f3-5c676959bc27_1831x968.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nvdP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0640ebb-1863-4a7d-90f3-5c676959bc27_1831x968.jpeg" width="1456" height="770" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b0640ebb-1863-4a7d-90f3-5c676959bc27_1831x968.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:770,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:354068,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/192153974?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0640ebb-1863-4a7d-90f3-5c676959bc27_1831x968.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!nvdP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0640ebb-1863-4a7d-90f3-5c676959bc27_1831x968.jpeg 424w, https://substackcdn.com/image/fetch/$s_!nvdP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0640ebb-1863-4a7d-90f3-5c676959bc27_1831x968.jpeg 848w, https://substackcdn.com/image/fetch/$s_!nvdP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0640ebb-1863-4a7d-90f3-5c676959bc27_1831x968.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!nvdP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0640ebb-1863-4a7d-90f3-5c676959bc27_1831x968.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/04/AI-Delegate-%E2%80%94-roadmap.jpeg">enlarge</a></strong>)</figcaption></figure></div><div><hr></div><h1>&#127895;&#65039;<a href="https://www.agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">See You in NYC!</a></h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://www.agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zSo3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F111d1b02-b56a-410a-94c4-a86025823063_1908x1069.jpeg 424w, https://substackcdn.com/image/fetch/$s_!zSo3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F111d1b02-b56a-410a-94c4-a86025823063_1908x1069.jpeg 848w, https://substackcdn.com/image/fetch/$s_!zSo3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F111d1b02-b56a-410a-94c4-a86025823063_1908x1069.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!zSo3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F111d1b02-b56a-410a-94c4-a86025823063_1908x1069.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zSo3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F111d1b02-b56a-410a-94c4-a86025823063_1908x1069.jpeg" width="1456" height="816" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/111d1b02-b56a-410a-94c4-a86025823063_1908x1069.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:816,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:496950,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:&quot;https://www.agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/192153974?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F111d1b02-b56a-410a-94c4-a86025823063_1908x1069.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zSo3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F111d1b02-b56a-410a-94c4-a86025823063_1908x1069.jpeg 424w, https://substackcdn.com/image/fetch/$s_!zSo3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F111d1b02-b56a-410a-94c4-a86025823063_1908x1069.jpeg 848w, https://substackcdn.com/image/fetch/$s_!zSo3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F111d1b02-b56a-410a-94c4-a86025823063_1908x1069.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!zSo3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F111d1b02-b56a-410a-94c4-a86025823063_1908x1069.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter&quot;,&quot;text&quot;:&quot;Register Now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter"><span>Register Now</span></a></p><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits <a href="https://ethics.dev/">Ethics.dev</a> and the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a>, and he hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong> and the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[AI is describing your competitors better than you. Here's why.]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/ai-is-describing-your-competitors</link><guid isPermaLink="false">https://gradientflow.substack.com/p/ai-is-describing-your-competitors</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 07 Apr 2026 13:01:09 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/d94242d2-dca8-4994-a978-ef72854ff254_1087x891.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>The AI Visibility Playbook: Surviving the Shift from Search to Synthesis</strong></h1><p>More people are turning to AI chatbots instead of traditional search engines to find information online. Even Google now displays an AI Overview at the top of many search results to summarize answers directly. When search engines ruled the internet, search engine optimization (SEO) served as the primary tool for companies to manage their brand reputation and ensure digital visibility. Now that foundation models synthesize responses rather than pointing users to external pages, SEO is being updated. Business leaders are adopting new strategies to influence the underlying training data and retrieval mechanisms of these AI systems, with the goal of keeping their brands visible and, just as importantly, accurately represented.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption"><em>Gradient Flow is a reader-supported publication, consider becoming a paid subscriber.</em></p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>The shift in digital visibility centers around Generative Engine Optimization (<strong>GEO</strong>). Unlike traditional optimization that targets link rankings, this practice aims to influence the neural patterns and retrieval logic of foundation models. The goal is to ensure generative AI systems select and accurately cite a brand when synthesizing responses. Several related terms fall under this umbrella. Answer Engine Optimization (<strong>AEO</strong>) is the process of optimizing content to fit the conversational nature of modern AI, making it more likely to be featured in direct, synthesized answers. Artificial Intelligence Optimization (<strong>AIO</strong>) acts as a broader catchall label for these practices. Meanwhile, some marketers use Large Language Model Optimization (<strong>LLMO</strong>) to describe the exact same goal of shaping internal model weights and training data. These terms are often used interchangeably, which reflects how early-stage and unsettled the field still is.</p><p>As these practices mature, they are fracturing into specialized domains and deeper conceptual frameworks. E-commerce GEO (<strong>E-GEO</strong>) explores whether product listings and reviews require distinct optimization tactics or if standard strategies work across all industries. Beyond mere visibility, practitioners are identifying new challenges like <strong>Discovery with Exact Definition</strong>. This concept highlights the gap between an AI retrieving brand information and actually representing it with contextual accuracy. A model can cite a company and still get it wrong. When that happens at scale, companies face what one researcher has called <strong>Dark Revenue Loss</strong>, the invisible financial damage and eroded customer trust that occurs entirely within AI-mediated conversations and never surfaces in standard analytics dashboards.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://gradientflow.com/wp-content/uploads/2026/04/GEO-terms.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!z5wa!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef30e5f2-9760-4d98-b77d-10b3f95b7249_794x939.jpeg 424w, https://substackcdn.com/image/fetch/$s_!z5wa!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef30e5f2-9760-4d98-b77d-10b3f95b7249_794x939.jpeg 848w, https://substackcdn.com/image/fetch/$s_!z5wa!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef30e5f2-9760-4d98-b77d-10b3f95b7249_794x939.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!z5wa!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef30e5f2-9760-4d98-b77d-10b3f95b7249_794x939.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!z5wa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef30e5f2-9760-4d98-b77d-10b3f95b7249_794x939.jpeg" width="370" height="437.5692695214106" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ef30e5f2-9760-4d98-b77d-10b3f95b7249_794x939.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:939,&quot;width&quot;:794,&quot;resizeWidth&quot;:370,&quot;bytes&quot;:116642,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:&quot;https://gradientflow.com/wp-content/uploads/2026/04/GEO-terms.jpeg&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/191426261?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef30e5f2-9760-4d98-b77d-10b3f95b7249_794x939.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!z5wa!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef30e5f2-9760-4d98-b77d-10b3f95b7249_794x939.jpeg 424w, https://substackcdn.com/image/fetch/$s_!z5wa!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef30e5f2-9760-4d98-b77d-10b3f95b7249_794x939.jpeg 848w, https://substackcdn.com/image/fetch/$s_!z5wa!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef30e5f2-9760-4d98-b77d-10b3f95b7249_794x939.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!z5wa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef30e5f2-9760-4d98-b77d-10b3f95b7249_794x939.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Addressing that invisible leakage requires more than content tweaks. A proposed architectural concept called <strong>GEO Core</strong> envisions a dedicated brand governance infrastructure layer, something analogous to a CRM but built to operate across retrieval-augmented generation (RAG) pipelines, chatbots, and AI agents, ensuring that what a model believes about a company matches what the company actually is. While I am not aware of any commercial implementations yet, this concept points toward a future where managing AI-mediated brand identity is treated as an enterprise infrastructure problem rather than a marketing one.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7sWs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21109a01-4eb6-4b91-9276-5877d576b3d3_1416x1010.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7sWs!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21109a01-4eb6-4b91-9276-5877d576b3d3_1416x1010.jpeg 424w, https://substackcdn.com/image/fetch/$s_!7sWs!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21109a01-4eb6-4b91-9276-5877d576b3d3_1416x1010.jpeg 848w, https://substackcdn.com/image/fetch/$s_!7sWs!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21109a01-4eb6-4b91-9276-5877d576b3d3_1416x1010.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!7sWs!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21109a01-4eb6-4b91-9276-5877d576b3d3_1416x1010.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7sWs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21109a01-4eb6-4b91-9276-5877d576b3d3_1416x1010.jpeg" width="396" height="282.45762711864404" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/21109a01-4eb6-4b91-9276-5877d576b3d3_1416x1010.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1010,&quot;width&quot;:1416,&quot;resizeWidth&quot;:396,&quot;bytes&quot;:125236,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/191426261?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21109a01-4eb6-4b91-9276-5877d576b3d3_1416x1010.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7sWs!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21109a01-4eb6-4b91-9276-5877d576b3d3_1416x1010.jpeg 424w, https://substackcdn.com/image/fetch/$s_!7sWs!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21109a01-4eb6-4b91-9276-5877d576b3d3_1416x1010.jpeg 848w, https://substackcdn.com/image/fetch/$s_!7sWs!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21109a01-4eb6-4b91-9276-5877d576b3d3_1416x1010.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!7sWs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21109a01-4eb6-4b91-9276-5877d576b3d3_1416x1010.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h4>The AI Visibility Operations Playbook</h4><p>We are still in the early days of this shift, but practitioners are already deploying concrete strategies to adapt. Here are <strong>some of the techniques teams are using right now</strong> to ensure their brands surface accurately in AI-generated answers.</p><p><strong>Crawler access configuration.</strong> Before an AI can summarize your content, it has to be able to read it. Many legacy website configurations accidentally block AI-specific web crawlers like the ones used by OpenAI or Google. Fixing this by updating a site&#8217;s robots file to explicitly allow these bots is the lowest-effort, highest-leverage action a team can take today, and it should happen before any other investment in this space.</p><p><strong>AI brand auditing.</strong> Teams must establish a baseline by directly querying chatbots to see how their brand is currently represented in terms of accuracy, sentiment, and framing. Commercial visibility toolkits from major search marketing platforms now support this kind of tracking across citation frequency and competitive positioning. Without this baseline you are optimizing blind, and most organizations are still skipping it.</p><div class="pullquote"><p>This isn&#8217;t a marketing problem anymore. Managing your brand in the AI era is an enterprise infrastructure challenge.</p></div><p><strong>Citation engineering.</strong> This technique involves writing content in self-contained blocks of 30 to 60 words that pair a specific data point with clear reasoning, so an AI can extract the passage directly without needing to reinterpret it. Research demonstrated that this structural approach can increase visibility in generative results by up to 40 percent. The uncomfortable implication is that the best-structured source often beats the most accurate one in AI-mediated information delivery.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5kE5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81108f80-055e-49da-baf7-9015ceaa0015_1346x607.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5kE5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81108f80-055e-49da-baf7-9015ceaa0015_1346x607.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5kE5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81108f80-055e-49da-baf7-9015ceaa0015_1346x607.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5kE5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81108f80-055e-49da-baf7-9015ceaa0015_1346x607.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5kE5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81108f80-055e-49da-baf7-9015ceaa0015_1346x607.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5kE5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81108f80-055e-49da-baf7-9015ceaa0015_1346x607.jpeg" width="476" height="214.65973254086182" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/81108f80-055e-49da-baf7-9015ceaa0015_1346x607.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:607,&quot;width&quot;:1346,&quot;resizeWidth&quot;:476,&quot;bytes&quot;:123156,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/191426261?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81108f80-055e-49da-baf7-9015ceaa0015_1346x607.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5kE5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81108f80-055e-49da-baf7-9015ceaa0015_1346x607.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5kE5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81108f80-055e-49da-baf7-9015ceaa0015_1346x607.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5kE5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81108f80-055e-49da-baf7-9015ceaa0015_1346x607.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5kE5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81108f80-055e-49da-baf7-9015ceaa0015_1346x607.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p><strong>Natural language FAQ optimization.</strong> Users talk to chatbots differently than they type keywords into a search box, and content needs to reflect that conversational reality. By auditing the exact phrasing people use when prompting AI systems and creating matching question-and-answer pairs, teams make it trivially easy for the retrieval system to select their content. Adding just three to five of these natural language questions to an existing page can meaningfully increase the probability of a direct citation.</p><p><strong>Machine-readable structured data.</strong> Implementing schema markup translates human-readable content into explicit metadata that AI systems can parse with precision, signaling what a page is about, who wrote it, and how it should be classified. This is technical hygiene that has been standard SEO practice for years, but its value is amplified in AI retrieval contexts because generative systems benefit even more from explicit structural signals than traditional search crawlers did. If your content management system supports it, there is no defensible reason not to implement it.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RxSh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffecd76fb-e4ad-4cbd-9b49-2934f87a4079_1647x766.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RxSh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffecd76fb-e4ad-4cbd-9b49-2934f87a4079_1647x766.jpeg 424w, https://substackcdn.com/image/fetch/$s_!RxSh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffecd76fb-e4ad-4cbd-9b49-2934f87a4079_1647x766.jpeg 848w, https://substackcdn.com/image/fetch/$s_!RxSh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffecd76fb-e4ad-4cbd-9b49-2934f87a4079_1647x766.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!RxSh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffecd76fb-e4ad-4cbd-9b49-2934f87a4079_1647x766.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RxSh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffecd76fb-e4ad-4cbd-9b49-2934f87a4079_1647x766.jpeg" width="490" height="227.83653846153845" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fecd76fb-e4ad-4cbd-9b49-2934f87a4079_1647x766.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:677,&quot;width&quot;:1456,&quot;resizeWidth&quot;:490,&quot;bytes&quot;:148296,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/191426261?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffecd76fb-e4ad-4cbd-9b49-2934f87a4079_1647x766.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!RxSh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffecd76fb-e4ad-4cbd-9b49-2934f87a4079_1647x766.jpeg 424w, https://substackcdn.com/image/fetch/$s_!RxSh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffecd76fb-e4ad-4cbd-9b49-2934f87a4079_1647x766.jpeg 848w, https://substackcdn.com/image/fetch/$s_!RxSh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffecd76fb-e4ad-4cbd-9b49-2934f87a4079_1647x766.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!RxSh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffecd76fb-e4ad-4cbd-9b49-2934f87a4079_1647x766.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p><strong>Platform-specific ecosystem tuning.</strong> Treating all AI models as a monolith is a reliable way to underperform, as one large financial technology company recently discovered firsthand. Each AI provider has distinct data dependencies: optimizing for one major search-backed AI requires strong traditional SEO fundamentals, while optimizing for a video-integrated AI model requires well-structured video transcripts and active business profiles. Teams should maintain separate playbooks for each platform rather than assuming a single strategy transfers across all of them.</p><p><strong>Distributed authority building.</strong> AI models look for consensus across multiple independent sources to determine what is credible about a company. Earning mentions, reviews, and coverage across authoritative publications and industry outlets provides the corroborating signals that increase a model&#8217;s confidence in citing your brand. This reframes traditional public relations as a direct optimization lever for AI retrieval, not just a brand awareness exercise.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!H3RG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc938d86d-249c-4518-85e3-74e8d7ad74cf_1751x914.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!H3RG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc938d86d-249c-4518-85e3-74e8d7ad74cf_1751x914.jpeg 424w, https://substackcdn.com/image/fetch/$s_!H3RG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc938d86d-249c-4518-85e3-74e8d7ad74cf_1751x914.jpeg 848w, https://substackcdn.com/image/fetch/$s_!H3RG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc938d86d-249c-4518-85e3-74e8d7ad74cf_1751x914.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!H3RG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc938d86d-249c-4518-85e3-74e8d7ad74cf_1751x914.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!H3RG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc938d86d-249c-4518-85e3-74e8d7ad74cf_1751x914.jpeg" width="1456" height="760" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c938d86d-249c-4518-85e3-74e8d7ad74cf_1751x914.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:760,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:222915,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/191426261?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc938d86d-249c-4518-85e3-74e8d7ad74cf_1751x914.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!H3RG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc938d86d-249c-4518-85e3-74e8d7ad74cf_1751x914.jpeg 424w, https://substackcdn.com/image/fetch/$s_!H3RG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc938d86d-249c-4518-85e3-74e8d7ad74cf_1751x914.jpeg 848w, https://substackcdn.com/image/fetch/$s_!H3RG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc938d86d-249c-4518-85e3-74e8d7ad74cf_1751x914.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!H3RG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc938d86d-249c-4518-85e3-74e8d7ad74cf_1751x914.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Long-term training data seeding.</strong> Because foundation models are trained on massive public datasets, brands that maintain a consistent, high-quality presence across the open web are more likely to be inherently known by the model before any retrieval even occurs. To influence foundation models, teams should actively participate in community forums and secure a presence in structured databases like Wikidata and Wikipedia. This is a multi-year strategy, not a quick win, but the compounding effects across successive model generations make early investment disproportionately valuable.</p><h4>Vectors of AI-Mediated Brand Identity</h4><p>The optimization surface itself is quickly expanding well beyond text on a webpage. Because major AI platforms increasingly pull from diverse source types, strategy must grow to match: video transcripts need to be structured and accurate, image metadata needs to be clean and descriptive, and audio content needs to be organized for machine ingestion rather than just human listening. At the same time, <strong>as autonomous AI agents begin executing vendor evaluations and purchasing decisions on behalf of users, the relevant optimization target shifts from web pages to the tool-calling interfaces and data layers those agents actually query</strong>. The goal is no longer just helping a chatbot describe your company accurately. It is ensuring an autonomous system can find, evaluate, and transact with your business without friction or distortion.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!0Drx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1e2889c-f80a-4ab5-a9b8-bee877330b4f_1823x896.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!0Drx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1e2889c-f80a-4ab5-a9b8-bee877330b4f_1823x896.jpeg 424w, https://substackcdn.com/image/fetch/$s_!0Drx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1e2889c-f80a-4ab5-a9b8-bee877330b4f_1823x896.jpeg 848w, https://substackcdn.com/image/fetch/$s_!0Drx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1e2889c-f80a-4ab5-a9b8-bee877330b4f_1823x896.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!0Drx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1e2889c-f80a-4ab5-a9b8-bee877330b4f_1823x896.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!0Drx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1e2889c-f80a-4ab5-a9b8-bee877330b4f_1823x896.jpeg" width="1456" height="716" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d1e2889c-f80a-4ab5-a9b8-bee877330b4f_1823x896.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:716,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:249420,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/191426261?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1e2889c-f80a-4ab5-a9b8-bee877330b4f_1823x896.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!0Drx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1e2889c-f80a-4ab5-a9b8-bee877330b4f_1823x896.jpeg 424w, https://substackcdn.com/image/fetch/$s_!0Drx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1e2889c-f80a-4ab5-a9b8-bee877330b4f_1823x896.jpeg 848w, https://substackcdn.com/image/fetch/$s_!0Drx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1e2889c-f80a-4ab5-a9b8-bee877330b4f_1823x896.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!0Drx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1e2889c-f80a-4ab5-a9b8-bee877330b4f_1823x896.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/04/GEO-landscape.jpeg">enlarge</a></strong>)</figcaption></figure></div><p>The window to build this foundation is closing faster than most organizations recognize. Analysts project a steep contraction in traditional search volume by 2026, which puts these new capabilities on a path from optional to mandatory in a short timeframe. <strong>The urgency compounds because newer AI models are frequently trained on outputs from prior generations, meaning early interventions reinforce themselves over time.</strong> Accurate brand representation established in today&#8217;s training data creates a self-reinforcing advantage that late movers will find genuinely difficult to close. Measurement is also maturing: tooling that explains why a retrieval system preferred one document over another is moving from research pipelines toward commercial platforms, which means the discipline is shifting from educated guesswork toward something testable and repeatable. Organizations that begin building these competencies now will be the ones that remain visible and accurately represented as the transition accelerates.</p><div><hr></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://bauplanlabs.github.io/SAO-workshop/?utm_source=gradientflow&amp;utm_medium=newsletter" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sC7W!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39690554-75ad-4147-9294-ac5c48a2b5af_2208x1224.jpeg 424w, https://substackcdn.com/image/fetch/$s_!sC7W!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39690554-75ad-4147-9294-ac5c48a2b5af_2208x1224.jpeg 848w, https://substackcdn.com/image/fetch/$s_!sC7W!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39690554-75ad-4147-9294-ac5c48a2b5af_2208x1224.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!sC7W!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39690554-75ad-4147-9294-ac5c48a2b5af_2208x1224.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sC7W!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39690554-75ad-4147-9294-ac5c48a2b5af_2208x1224.jpeg" width="1456" height="807" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/39690554-75ad-4147-9294-ac5c48a2b5af_2208x1224.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:807,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:377132,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:&quot;https://bauplanlabs.github.io/SAO-workshop/?utm_source=gradientflow&amp;utm_medium=newsletter&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/191426261?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39690554-75ad-4147-9294-ac5c48a2b5af_2208x1224.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!sC7W!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39690554-75ad-4147-9294-ac5c48a2b5af_2208x1224.jpeg 424w, https://substackcdn.com/image/fetch/$s_!sC7W!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39690554-75ad-4147-9294-ac5c48a2b5af_2208x1224.jpeg 848w, https://substackcdn.com/image/fetch/$s_!sC7W!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39690554-75ad-4147-9294-ac5c48a2b5af_2208x1224.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!sC7W!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39690554-75ad-4147-9294-ac5c48a2b5af_2208x1224.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://bauplanlabs.github.io/SAO-workshop/?utm_source=gradientflow&amp;utm_medium=newsletter&quot;,&quot;text&quot;:&quot;Learn More&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://bauplanlabs.github.io/SAO-workshop/?utm_source=gradientflow&amp;utm_medium=newsletter"><span>Learn More</span></a></p><div><hr></div><h1><a href="https://gradientflow.com/mcp-objections-and-complaints/">What Critics Get Right and Wrong About MCP</a></h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8m0x!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbc897a2-1586-430f-a05c-976fe38b2e9e_3582x2200.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8m0x!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbc897a2-1586-430f-a05c-976fe38b2e9e_3582x2200.jpeg 424w, https://substackcdn.com/image/fetch/$s_!8m0x!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbc897a2-1586-430f-a05c-976fe38b2e9e_3582x2200.jpeg 848w, https://substackcdn.com/image/fetch/$s_!8m0x!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbc897a2-1586-430f-a05c-976fe38b2e9e_3582x2200.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!8m0x!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbc897a2-1586-430f-a05c-976fe38b2e9e_3582x2200.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8m0x!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbc897a2-1586-430f-a05c-976fe38b2e9e_3582x2200.jpeg" width="1456" height="894" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cbc897a2-1586-430f-a05c-976fe38b2e9e_3582x2200.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:894,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:776353,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/191426261?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbc897a2-1586-430f-a05c-976fe38b2e9e_3582x2200.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8m0x!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbc897a2-1586-430f-a05c-976fe38b2e9e_3582x2200.jpeg 424w, https://substackcdn.com/image/fetch/$s_!8m0x!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbc897a2-1586-430f-a05c-976fe38b2e9e_3582x2200.jpeg 848w, https://substackcdn.com/image/fetch/$s_!8m0x!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbc897a2-1586-430f-a05c-976fe38b2e9e_3582x2200.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!8m0x!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbc897a2-1586-430f-a05c-976fe38b2e9e_3582x2200.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><strong>From: <a href="https://gradientflow.com/mcp-objections-and-complaints/">Why People Say &#8220;MCP Sucks&#8221;: Which Critiques Matter Most</a>  </strong>(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/04/Model-Context-Protol-%E2%80%94-objections-and-complaints.jpeg">enlarge</a></strong>)</figcaption></figure></div><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits <a href="https://ethics.dev/">Ethics.dev</a> and the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a>, and he hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong> and the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[Why the heaviest AI users actually produce worse results 🤯]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/are-you-using-ai-or-is-it-replacing</link><guid isPermaLink="false">https://gradientflow.substack.com/p/are-you-using-ai-or-is-it-replacing</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 31 Mar 2026 13:02:20 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/6eeae668-dba7-4bed-89d3-56e9253581f5_992x886.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>How to Stay Employable When AI Is Coming for Your Job</strong></h1><p>Over the past few weeks, I have had a lot of conversations with people who are genuinely worried about what AI means for their careers. Not just developers, but marketers, analysts, lawyers, and others who are starting to wonder how much of their job will exist in 2-3 years. The anxiety is real and not entirely misplaced. <a href="https://www.psychiatrictimes.com/view/artificial-intelligence-job-loss-and-the-psychiatric-significance-of-work?utm_source=gradientflow&amp;utm_medium=newsletter">Psychiatrists studying AI-driven job loss are warning</a> about something beyond the usual economic disruption: they argue that serial job loss and chronic occupational uncertainty threaten the psychological foundations of adult life in ways that income replacement alone cannot fix. That got me to sit down with <a href="https://www.linkedin.com/in/evangelossimoudis/">Evangelos Simoudis</a> for an <strong><a href="https://www.youtube.com/watch?v=39NPrwBzBis&amp;t=88s">unplanned podcast</a></strong> on exactly this topic. What came out of that conversation, and from the research I have been reading, is a practical list of things knowledge workers can do right now to stay valuable.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://gradientflow.substack.com/subscribe?"><span>Subscribe now</span></a></p><p><strong>1. Build a working rhythm with AI, not just a habit of using it.</strong> The most productive people in AI adoption studies are not the heaviest users. They have a disciplined loop: direct the AI, check what it produces, refine, repeat. Fully handing off complex tasks produces mixed results. Keeping judgment in your own hands consistently does better. Learn to prompt well, spot-check outputs, and iterate fast. That loop is the skill.</p><p><strong>2. Treat verification as a primary skill, not a secondary check.</strong> Research finds that whether AI makes you more or less valuable comes down largely to how well you catch what it gets wrong. Workers who verify reliably produce better results. Workers who struggle tend to hand off too much, and quality drops even when the AI is performing fine. Small differences in verification ability lead to big differences in outcomes. If you cannot tell when AI is wrong, your employer will notice before you do.</p><p><strong>3. Know when not to hand a task to AI.</strong> The best workflow is not always more delegation. Workers tend to over-rely on AI on harder tasks, which is exactly when AI is least accurate and mistakes are hardest to catch. Knowing when to use AI, when to verify carefully, and when to just do the work yourself is a genuinely valuable skill. Being fast with AI tools is not the same as being reliable, and employers are starting to tell the difference.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kycJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F07821525-3bfe-4d4d-abe8-5935607d373c_1754x715.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kycJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F07821525-3bfe-4d4d-abe8-5935607d373c_1754x715.jpeg 424w, https://substackcdn.com/image/fetch/$s_!kycJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F07821525-3bfe-4d4d-abe8-5935607d373c_1754x715.jpeg 848w, https://substackcdn.com/image/fetch/$s_!kycJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F07821525-3bfe-4d4d-abe8-5935607d373c_1754x715.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!kycJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F07821525-3bfe-4d4d-abe8-5935607d373c_1754x715.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kycJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F07821525-3bfe-4d4d-abe8-5935607d373c_1754x715.jpeg" width="618" height="252.12362637362637" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/07821525-3bfe-4d4d-abe8-5935607d373c_1754x715.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:594,&quot;width&quot;:1456,&quot;resizeWidth&quot;:618,&quot;bytes&quot;:157675,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/190457627?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F07821525-3bfe-4d4d-abe8-5935607d373c_1754x715.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!kycJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F07821525-3bfe-4d4d-abe8-5935607d373c_1754x715.jpeg 424w, https://substackcdn.com/image/fetch/$s_!kycJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F07821525-3bfe-4d4d-abe8-5935607d373c_1754x715.jpeg 848w, https://substackcdn.com/image/fetch/$s_!kycJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F07821525-3bfe-4d4d-abe8-5935607d373c_1754x715.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!kycJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F07821525-3bfe-4d4d-abe8-5935607d373c_1754x715.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>4. Spend more time defining problems than executing solutions.</strong> Workers whose main job is carrying out known procedures face the highest automation risk. The defensible position is upstream: deciding which problems matter, what constraints matter, and how success should be measured. If your job is mostly applying known steps to familiar problems, AI is coming for that work first. The durable skill is not just execution. It is setting direction.</p><p><strong>5. Develop precision in articulating intent.</strong> Knowing what problem to solve is only half the job. You also have to describe it clearly enough that an AI cannot misread it: naming constraints, edge cases, and what a good result actually looks like before the work starts. This is already being called <strong>spec-driven development</strong> in software, but the skill is not technical. It looks the same whether you are a lawyer briefing an AI on what a clause must and must not allow, a marketer specifying tone guardrails before a campaign runs, or an analyst defining what the model should not infer. Vague intent produces vague output, and the gap shows up immediately in what comes back.</p><p><strong>6. Develop systems thinking across the full business process.</strong> As AI handles more individual tasks, the premium shifts toward people who understand how an entire process fits together. Someone who only knows their corner of a workflow is more exposed than someone who understands the end-to-end process and can spot where AI belongs in it. In most organizations still figuring out AI adoption, this kind of thinking is rare enough to be a real differentiator.</p><p><strong>7. Reframe AI as a colleague you manage, not a tool you operate.</strong> Software developers seeing the largest gains stopped treating AI as sophisticated autocomplete and started treating it as a team member: someone work gets delegated to, whose output gets reviewed, and whose limitations get planned around. This changes how you scope tasks and how you hold yourself accountable for results, because you are now the manager of the output. Similar workflows are likely to spread well beyond software into accounting, legal analysis, and other knowledge-work domains. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!XLMu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5fcfe9c8-a6c1-4240-a9ab-e5cf7353e82e_3962x2242.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!XLMu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5fcfe9c8-a6c1-4240-a9ab-e5cf7353e82e_3962x2242.jpeg 424w, https://substackcdn.com/image/fetch/$s_!XLMu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5fcfe9c8-a6c1-4240-a9ab-e5cf7353e82e_3962x2242.jpeg 848w, https://substackcdn.com/image/fetch/$s_!XLMu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5fcfe9c8-a6c1-4240-a9ab-e5cf7353e82e_3962x2242.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!XLMu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5fcfe9c8-a6c1-4240-a9ab-e5cf7353e82e_3962x2242.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!XLMu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5fcfe9c8-a6c1-4240-a9ab-e5cf7353e82e_3962x2242.jpeg" width="1456" height="824" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5fcfe9c8-a6c1-4240-a9ab-e5cf7353e82e_3962x2242.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:824,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:899056,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/190457627?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5fcfe9c8-a6c1-4240-a9ab-e5cf7353e82e_3962x2242.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!XLMu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5fcfe9c8-a6c1-4240-a9ab-e5cf7353e82e_3962x2242.jpeg 424w, https://substackcdn.com/image/fetch/$s_!XLMu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5fcfe9c8-a6c1-4240-a9ab-e5cf7353e82e_3962x2242.jpeg 848w, https://substackcdn.com/image/fetch/$s_!XLMu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5fcfe9c8-a6c1-4240-a9ab-e5cf7353e82e_3962x2242.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!XLMu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5fcfe9c8-a6c1-4240-a9ab-e5cf7353e82e_3962x2242.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/03/Agents-and-Knowledge-Work.jpeg">enlarge</a></strong>)</figcaption></figure></div><p><strong>8. Invest in domain expertise that cannot be written down.</strong> Generic, codifiable skills are easier for AI to absorb and easier for employers to replace. The durable investment is deep knowledge of a specific process, environment, or problem set, the kind that comes from years of experience rather than reading a manual. Knowing which edge cases matter, which heuristics hold up, and which approaches work even though no textbook recommends them, that kind of tacit knowledge is hard for AI to replicate and hard for employers to substitute away. It compounds over time. Codified skills do not.</p><p><strong>9. Make your actual contribution visible and verifiable.</strong> AI is flattening the visible difference between strong and average workers, and employers are responding by leaning harder on track records. Make your judgment visible: document the reasoning behind key decisions, write up problems you diagnosed that others missed, build a record of outcomes tied directly to your work. Polish is easy to fake now. Evidence of judgment is not.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rQdW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff292d449-81f4-42ae-80c4-e60e22811aaf_1326x415.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rQdW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff292d449-81f4-42ae-80c4-e60e22811aaf_1326x415.png 424w, https://substackcdn.com/image/fetch/$s_!rQdW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff292d449-81f4-42ae-80c4-e60e22811aaf_1326x415.png 848w, https://substackcdn.com/image/fetch/$s_!rQdW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff292d449-81f4-42ae-80c4-e60e22811aaf_1326x415.png 1272w, https://substackcdn.com/image/fetch/$s_!rQdW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff292d449-81f4-42ae-80c4-e60e22811aaf_1326x415.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rQdW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff292d449-81f4-42ae-80c4-e60e22811aaf_1326x415.png" width="516" height="161.49321266968326" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f292d449-81f4-42ae-80c4-e60e22811aaf_1326x415.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:415,&quot;width&quot;:1326,&quot;resizeWidth&quot;:516,&quot;bytes&quot;:205454,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/190457627?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff292d449-81f4-42ae-80c4-e60e22811aaf_1326x415.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!rQdW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff292d449-81f4-42ae-80c4-e60e22811aaf_1326x415.png 424w, https://substackcdn.com/image/fetch/$s_!rQdW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff292d449-81f4-42ae-80c4-e60e22811aaf_1326x415.png 848w, https://substackcdn.com/image/fetch/$s_!rQdW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff292d449-81f4-42ae-80c4-e60e22811aaf_1326x415.png 1272w, https://substackcdn.com/image/fetch/$s_!rQdW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff292d449-81f4-42ae-80c4-e60e22811aaf_1326x415.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p><strong>10. Pay attention to how your work product is being used.</strong> Unlike published content, the expertise of most knowledge workers is not protected by copyright. There is a real and growing risk that employers are <a href="https://www.theverge.com/cs/features/877388/white-collar-workers-training-ai-mercor">feeding internal work product</a> into model training pipelines without clear frameworks for worker ownership. You do not need deep technical knowledge to track this, just awareness and a habit of following how your company&#8217;s policies around AI training data are evolving. It matters most for senior people whose accumulated know-how is quietly becoming part of someone else&#8217;s training data.</p><p><strong>11. Prepare financially and structurally for a less stable career.</strong> Senior tech workers with 15-plus years of experience are already shifting toward financial resilience: larger cash reserves, lower fixed costs, and an expectation of <strong>more frequent job changes</strong>. It is also worth asking whether your skills could support splitting time across more than one employer. AI-driven productivity gains may reduce the total hours any one company needs from a given worker, even one who has adapted well. <strong>Treating your career as a portfolio of engagements</strong> is a reasonable hedge against that shift.</p><div class="pullquote"><p>As AI masters execution, the ultimate human skill moves upstream: defining the right problems and setting the constraints.</p></div><h4>What This List Cannot Fix</h4><p>This list is built for a specific scenario: that AI reshapes knowledge work substantially but does not eliminate the need for skilled people, or that we land somewhere in an uneven, messy middle. I will be honest, though. I lean toward the more disruptive end of that spectrum. The pace at which AI is compressing routine cognitive work, combined with how unprepared most institutions are for what comes next, makes me skeptical that individual adaptation alone will be enough for most workers. The list above is still worth working through. But it is worth being clear about what kind of response it represents.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!F7SG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe98598d0-92b1-474b-a4bc-e3888b0c3f18_1682x791.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!F7SG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe98598d0-92b1-474b-a4bc-e3888b0c3f18_1682x791.jpeg 424w, https://substackcdn.com/image/fetch/$s_!F7SG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe98598d0-92b1-474b-a4bc-e3888b0c3f18_1682x791.jpeg 848w, https://substackcdn.com/image/fetch/$s_!F7SG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe98598d0-92b1-474b-a4bc-e3888b0c3f18_1682x791.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!F7SG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe98598d0-92b1-474b-a4bc-e3888b0c3f18_1682x791.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!F7SG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe98598d0-92b1-474b-a4bc-e3888b0c3f18_1682x791.jpeg" width="608" height="286.04395604395603" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e98598d0-92b1-474b-a4bc-e3888b0c3f18_1682x791.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:685,&quot;width&quot;:1456,&quot;resizeWidth&quot;:608,&quot;bytes&quot;:161428,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/190457627?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe98598d0-92b1-474b-a4bc-e3888b0c3f18_1682x791.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!F7SG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe98598d0-92b1-474b-a4bc-e3888b0c3f18_1682x791.jpeg 424w, https://substackcdn.com/image/fetch/$s_!F7SG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe98598d0-92b1-474b-a4bc-e3888b0c3f18_1682x791.jpeg 848w, https://substackcdn.com/image/fetch/$s_!F7SG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe98598d0-92b1-474b-a4bc-e3888b0c3f18_1682x791.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!F7SG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe98598d0-92b1-474b-a4bc-e3888b0c3f18_1682x791.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Behavioral scientists use two terms that are useful here. The <strong><a href="https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4046264">i-frame</a></strong> focuses on what individuals can do to navigate a problem within the existing system. The <strong><a href="https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4046264">s-frame</a></strong> asks whether the system itself needs to change. This list is entirely i-frame: upskill, reposition, save more, adapt. Those are real and worthwhile moves. But if AI displaces knowledge workers at a scale that individual adaptation cannot absorb, what is actually needed are s-frame responses: updated labor protections and <a href="https://www.forbes.com/sites/maryroeloffs/2026/02/17/billionaire-khosla-if-125-million-are-unemployed-by-ai-they-shouldnt-pay-taxes/">tax codes</a>, serious retraining infrastructure, legal frameworks around worker ownership of AI training data, and political leadership willing to treat this as a first-order problem. None of that exists in coherent form right now. The affected population is large, the anxiety is real, and the political opening is sitting there unclaimed. AI&#8217;s impact on employment could easily become a defining issue in the 2028 presidential race. It will depend on whether a skilled political figure decides to make it one.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!v3ND!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22cd0f4e-e733-4864-b54c-384ab806fe7e_1502x830.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!v3ND!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22cd0f4e-e733-4864-b54c-384ab806fe7e_1502x830.png 424w, https://substackcdn.com/image/fetch/$s_!v3ND!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22cd0f4e-e733-4864-b54c-384ab806fe7e_1502x830.png 848w, https://substackcdn.com/image/fetch/$s_!v3ND!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22cd0f4e-e733-4864-b54c-384ab806fe7e_1502x830.png 1272w, https://substackcdn.com/image/fetch/$s_!v3ND!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22cd0f4e-e733-4864-b54c-384ab806fe7e_1502x830.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!v3ND!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22cd0f4e-e733-4864-b54c-384ab806fe7e_1502x830.png" width="1456" height="805" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/22cd0f4e-e733-4864-b54c-384ab806fe7e_1502x830.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:805,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1258467,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/190457627?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22cd0f4e-e733-4864-b54c-384ab806fe7e_1502x830.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!v3ND!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22cd0f4e-e733-4864-b54c-384ab806fe7e_1502x830.png 424w, https://substackcdn.com/image/fetch/$s_!v3ND!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22cd0f4e-e733-4864-b54c-384ab806fe7e_1502x830.png 848w, https://substackcdn.com/image/fetch/$s_!v3ND!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22cd0f4e-e733-4864-b54c-384ab806fe7e_1502x830.png 1272w, https://substackcdn.com/image/fetch/$s_!v3ND!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22cd0f4e-e733-4864-b54c-384ab806fe7e_1502x830.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Rkua!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff677ba1e-6647-4764-883c-a93f12b85da6_1862x998.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Rkua!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff677ba1e-6647-4764-883c-a93f12b85da6_1862x998.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Rkua!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff677ba1e-6647-4764-883c-a93f12b85da6_1862x998.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Rkua!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff677ba1e-6647-4764-883c-a93f12b85da6_1862x998.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Rkua!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff677ba1e-6647-4764-883c-a93f12b85da6_1862x998.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Rkua!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff677ba1e-6647-4764-883c-a93f12b85da6_1862x998.jpeg" width="627" height="335.89285714285717" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f677ba1e-6647-4764-883c-a93f12b85da6_1862x998.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:780,&quot;width&quot;:1456,&quot;resizeWidth&quot;:627,&quot;bytes&quot;:440158,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/190457627?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff677ba1e-6647-4764-883c-a93f12b85da6_1862x998.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Rkua!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff677ba1e-6647-4764-883c-a93f12b85da6_1862x998.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Rkua!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff677ba1e-6647-4764-883c-a93f12b85da6_1862x998.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Rkua!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff677ba1e-6647-4764-883c-a93f12b85da6_1862x998.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Rkua!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff677ba1e-6647-4764-883c-a93f12b85da6_1862x998.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ul><li><p><strong><a href="https://amzn.to/4lYtEnX">Project Maven: A Marine Colonel, His Team, and the Dawn of AI Warfare</a></strong>. This book hit differently given everything unfolding right now: the Anthropic-Pentagon fallout, US strikes on Iran, years of cheap drones reshaping battlefields from Ukraine onward. It traces how AI went from a colonel lugging a laptop around Afghanistan to a system helping pick thousands of targets in an active war. </p></li><li><p><strong><a href="https://amzn.to/4sHBt47">Alone in Japan: A Journey to the Future</a></strong>. Japan is the canary in the coal mine for what&#8217;s coming at the rest of us: an aging, shrinking society grappling with demographic decline just as immigration is becoming politically toxic across the West. What makes this book so useful is that Japan may also be the testing ground for whatever comes next, from AI to robotics to reimagining how communities function with far fewer people. </p></li><li><p><strong><a href="https://amzn.to/4bFT3PS">It&#8217;s on You</a></strong>. This book reframed how I think about individual action versus systemic change. The <strong>i-frame/s-frame distinction</strong> makes clear why nudges and personal responsibility campaigns so often let the real culprits off the hook. If you&#8217;re trying to understand why well-meaning policy keeps falling short, this is a good place to start.</p></li></ul><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits <a href="https://ethics.dev/">Ethics.dev</a> and the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a>, and he hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong> and the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[Why smarter agent architecture does not always improve results]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/are-your-ai-agents-confusing-activity</link><guid isPermaLink="false">https://gradientflow.substack.com/p/are-your-ai-agents-confusing-activity</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 24 Mar 2026 13:02:59 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/42d07e3e-0e45-4f68-a958-152a9f086412_1404x754.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>Why Your AI Agents Need Engineering Instead of Best Practices</strong></h1><p>I remain optimistic about the impact agents will have on knowledge work. As I noted in an <a href="https://gradientflow.substack.com/p/execution-is-now-free-heres-your">earlier article</a>, fields shaped by clear rules and mature systems, including accounting and contract management, already look well suited to this kind of automation. But even if the opportunity is real, the practical reality is that AI teams are still learning how to build agents that work reliably in production. Moving from a fragile prototype to a dependable system requires more than a good prompt. It means thinking carefully about the underlying architecture. To see how these systems come together, it helps to break the stack into its main parts.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption"><em>Regular reader? Consider becoming a paid supporter &#128591;</em></p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>In a working AI agent system, three core components define capability and behavior.</p><ul><li><p><strong>Tools</strong> are the individual actions an agent can perform: database queries, API calls, file operations, or code execution. They are the atomic operations that enable agents to reach out and interact with external systems.</p></li><li><p><strong>Skills</strong> operate at a higher level. They are reusable workflows that combine multiple tools with specific reasoning steps to accomplish meaningful business objectives like analyzing a contract or triaging support tickets.</p></li><li><p><strong>Context files</strong> like <a href="https://agents.md/?utm_source=gradientflow&amp;utm_medium=newsletter">AGENTS.md</a> work differently. Rather than adding capability, they define how the agent should think and act. They specify the agent&#8217;s role, decision-making guidelines, constraints, and the reasoning patterns it applies when facing choices.</p></li></ul><p>This three-layer separation is practical: it lets you mix tools into different skills, and run those skills under different behavioral frameworks, without rebuilding core logic.</p><p>Production agent systems depend on several other components that matter just as much as the tools themselves. <strong>Memory systems</strong> maintain continuity across multiple turns, allowing agents to reference past decisions and context. <strong>Orchestration frameworks</strong> determine whether one agent or multiple specialized agents should handle a task. <strong>Planning modules</strong> help break complex goals into executable sequences. <strong>State management</strong> ensures context carries across interactions. <strong>Guardrails and permissions</strong> prevent misuse and enforce organizational policy. <strong>Monitoring and logging</strong> let you see what the agent actually does, which often differs from what you expected. These pieces work together. Without memory, the agent can&#8217;t maintain context. Without orchestration, it can&#8217;t coordinate complex work. Without guardrails, it risks policy violations.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://gradientflow.com/wp-content/uploads/2026/03/Agent-Tools-&#8212;-overview.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!AcMZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf23a404-032d-4e95-8b3c-462b20b4573d_1371x984.jpeg 424w, https://substackcdn.com/image/fetch/$s_!AcMZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf23a404-032d-4e95-8b3c-462b20b4573d_1371x984.jpeg 848w, https://substackcdn.com/image/fetch/$s_!AcMZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf23a404-032d-4e95-8b3c-462b20b4573d_1371x984.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!AcMZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf23a404-032d-4e95-8b3c-462b20b4573d_1371x984.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!AcMZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf23a404-032d-4e95-8b3c-462b20b4573d_1371x984.jpeg" width="568" height="407.66739606126913" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/df23a404-032d-4e95-8b3c-462b20b4573d_1371x984.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:984,&quot;width&quot;:1371,&quot;resizeWidth&quot;:568,&quot;bytes&quot;:296899,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:&quot;https://gradientflow.com/wp-content/uploads/2026/03/Agent-Tools-&#8212;-overview.jpeg&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/189789650?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf23a404-032d-4e95-8b3c-462b20b4573d_1371x984.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!AcMZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf23a404-032d-4e95-8b3c-462b20b4573d_1371x984.jpeg 424w, https://substackcdn.com/image/fetch/$s_!AcMZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf23a404-032d-4e95-8b3c-462b20b4573d_1371x984.jpeg 848w, https://substackcdn.com/image/fetch/$s_!AcMZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf23a404-032d-4e95-8b3c-462b20b4573d_1371x984.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!AcMZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf23a404-032d-4e95-8b3c-462b20b4573d_1371x984.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h4>Rethinking Coordination and Memory in Agent Systems</h4><p>There is still a great deal of experimentation happening across all these tool categories. <strong>Orchestration</strong> is one area seeing intense activity as builders realize that early frameworks are often too rigid. Older systems force developers to map out every workflow in advance or rely on unstructured agent chats. New tools are filling this gap by offering more flexibility and control. <a href="https://www.june.kim/cord?utm_source=gradientflow&amp;utm_medium=newsletter">Cord</a> is a recent example that lets agents build their own task trees on the fly. It allows models to decide when to split work into parallel tracks or share context without needing a hardcoded plan. <a href="https://github.com/generalaction/emdash">Emdash</a> tackles orchestration from a workspace angle by letting developers run multiple coding agents in parallel across isolated environments. This eliminates the messy reality of juggling different terminals and waiting for a single model to finish its job.</p><p>One underappreciated cost of adding agents is coordination overhead. In many-to-many designs, that overhead can rise very quickly as the number of agents grows. Centralized orchestration can reduce some of that complexity, though it introduces its own bottlenecks. More agents also means more inference costs and more opportunities for compounding errors. Recent studies suggest that adding agents helps in some settings, especially when work can be cleanly decomposed, but it can also add overhead and even reduce performance when the single-agent baseline is already strong or the task is highly sequential.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZyxM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5262e04-3b10-453c-8520-55e872b9bdff_1626x916.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZyxM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5262e04-3b10-453c-8520-55e872b9bdff_1626x916.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ZyxM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5262e04-3b10-453c-8520-55e872b9bdff_1626x916.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ZyxM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5262e04-3b10-453c-8520-55e872b9bdff_1626x916.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ZyxM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5262e04-3b10-453c-8520-55e872b9bdff_1626x916.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZyxM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5262e04-3b10-453c-8520-55e872b9bdff_1626x916.jpeg" width="472" height="265.8241758241758" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d5262e04-3b10-453c-8520-55e872b9bdff_1626x916.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:820,&quot;width&quot;:1456,&quot;resizeWidth&quot;:472,&quot;bytes&quot;:240141,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/189789650?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5262e04-3b10-453c-8520-55e872b9bdff_1626x916.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ZyxM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5262e04-3b10-453c-8520-55e872b9bdff_1626x916.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ZyxM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5262e04-3b10-453c-8520-55e872b9bdff_1626x916.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ZyxM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5262e04-3b10-453c-8520-55e872b9bdff_1626x916.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ZyxM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5262e04-3b10-453c-8520-55e872b9bdff_1626x916.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Memory and context systems</strong> are also evolving to handle more than just conversational history. As I argued in an <a href="https://gradientflow.substack.com/p/the-missing-layer-in-todays-agent">earlier piece</a>, most current memory approaches are better at retrieving facts or preserving conversation than at helping agents repeat operational work reliably. To solve this, developers are moving toward <a href="https://gradientflow.substack.com/p/the-missing-layer-in-todays-agent">operational skill stores or context file systems</a>. It is less about chat history and more about procedural memory. Instead of overloading a prompt with endless documentation, these new systems save successful workflows as permanent procedures. The agent only loads the specific instructions it needs for the exact task at hand. This method turns temporary problem solving into reliable company assets while drastically cutting down on computing costs.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://gradientflow.com/wp-content/uploads/2026/03/Agent-Tools-&#8212;-eval-and-simplicity.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!AuyV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ec53bb2-a018-45d6-b996-39b6219defe7_1185x982.jpeg 424w, https://substackcdn.com/image/fetch/$s_!AuyV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ec53bb2-a018-45d6-b996-39b6219defe7_1185x982.jpeg 848w, https://substackcdn.com/image/fetch/$s_!AuyV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ec53bb2-a018-45d6-b996-39b6219defe7_1185x982.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!AuyV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ec53bb2-a018-45d6-b996-39b6219defe7_1185x982.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!AuyV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ec53bb2-a018-45d6-b996-39b6219defe7_1185x982.jpeg" width="432" height="357.99493670886073" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1ec53bb2-a018-45d6-b996-39b6219defe7_1185x982.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:982,&quot;width&quot;:1185,&quot;resizeWidth&quot;:432,&quot;bytes&quot;:143932,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:&quot;https://gradientflow.com/wp-content/uploads/2026/03/Agent-Tools-&#8212;-eval-and-simplicity.jpeg&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/189789650?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ec53bb2-a018-45d6-b996-39b6219defe7_1185x982.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!AuyV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ec53bb2-a018-45d6-b996-39b6219defe7_1185x982.jpeg 424w, https://substackcdn.com/image/fetch/$s_!AuyV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ec53bb2-a018-45d6-b996-39b6219defe7_1185x982.jpeg 848w, https://substackcdn.com/image/fetch/$s_!AuyV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ec53bb2-a018-45d6-b996-39b6219defe7_1185x982.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!AuyV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ec53bb2-a018-45d6-b996-39b6219defe7_1185x982.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h4>Moving From Art to Engineering in Agent Design</h4><p>As teams adopt new memory and orchestration tools, they often inherit best practices before testing whether those methods actually help in their own environments. <strong><a href="https://agents.md/?utm_source=gradientflow&amp;utm_medium=newsletter">AGENTS.md</a></strong> is a good example. These simple repository-level files are meant to guide how coding agents behave inside a codebase. A <a href="https://arxiv.org/abs/2602.11988v1">recent study</a> examined whether they deliver on that promise by testing coding agents on standard benchmarks and on a new benchmark, <a href="https://github.com/eth-sri/agentbench">AGENTBENCH</a>, built from real repositories. The results were not especially encouraging. Automatically generated context files reduced task success rates while increasing inference costs by more than 20 percent. Agents followed the instructions and explored the code more extensively, but that extra activity did not translate into better outcomes. Even developer-written files produced only modest gains.</p><div class="pullquote"><p>Building AI agents is an engineering discipline, not an art form. You get exactly what you measure.</p></div><p>Too many teams still build a workflow, run it a few times, decide it feels right, and ship it. That approach carries real risks. The standard practice in machine learning has long been to test each new component before adding it: does this actually improve results, and where does it now fail? The same logic applies to agent systems. The lesson from the AGENTS.md research is not that context files are useless. It is that adding any component - a guidance file, a new agent, a prompt change - should be treated as an engineering decision, not a default. <a href="https://www.linkedin.com/posts/leo-meyerovich-09649219_aiagents-claudecode-softwareengineering-activity-7432099506558545920--Ouz/">Leo Meyerovich</a> made this point well when he argued that teams get what they measure. In practice, that means defining clear evals for your own use cases and keeping only what improves results, whether the metric is task success, speed, safety, or cost. In agent systems, the question is not whether a recommendation sounds sensible. It is whether it improves performance in your setting.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://gradientflow.com/wp-content/uploads/2026/03/Agent-Tools-&#8212;-AGENTS.md-study.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5pLS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b769ce-fa35-4ac2-8169-18ccceedcf4b_1886x1031.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5pLS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b769ce-fa35-4ac2-8169-18ccceedcf4b_1886x1031.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5pLS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b769ce-fa35-4ac2-8169-18ccceedcf4b_1886x1031.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5pLS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b769ce-fa35-4ac2-8169-18ccceedcf4b_1886x1031.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5pLS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b769ce-fa35-4ac2-8169-18ccceedcf4b_1886x1031.jpeg" width="670" height="366.2912087912088" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/65b769ce-fa35-4ac2-8169-18ccceedcf4b_1886x1031.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:796,&quot;width&quot;:1456,&quot;resizeWidth&quot;:670,&quot;bytes&quot;:257272,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:&quot;https://gradientflow.com/wp-content/uploads/2026/03/Agent-Tools-&#8212;-AGENTS.md-study.jpeg&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/189789650?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b769ce-fa35-4ac2-8169-18ccceedcf4b_1886x1031.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5pLS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b769ce-fa35-4ac2-8169-18ccceedcf4b_1886x1031.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5pLS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b769ce-fa35-4ac2-8169-18ccceedcf4b_1886x1031.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5pLS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b769ce-fa35-4ac2-8169-18ccceedcf4b_1886x1031.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5pLS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b769ce-fa35-4ac2-8169-18ccceedcf4b_1886x1031.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Putting an AI agent into production means coordinating a stack of tools, skills, orchestration frameworks, memory systems, and guardrails. Developers and startups are still iterating quickly on this infrastructure, often in open source, and that experimentation is helping the field mature. But it is also easy to mistake architectural complexity for progress. As the <a href="https://arxiv.org/abs/2602.11988v1">evidence on context files</a> suggests, simpler tools paired with rigorous evaluation will often beat a more elaborate setup that has not been tested against real work. Part of the problem is that the number of variables in a working agent system is larger than it first appears. Chunking strategy, embedding choice, retrieval method, prompt structure, context window size, and model selection all interact. Teams that rely on defaults and intuition across these variables are, in effect, guessing. Systematic evaluation does not have to mean testing every combination - but it does mean knowing which variables matter most for your specific use case.</p><p>Getting an agent ready for production means running computationally intensive experiments to find the right configuration. Having an AI platform that lets you run those experiments efficiently is a distinct advantage. <strong>Dean Wampler</strong> recently <strong><a href="https://oreillyradar.substack.com/p/what-is-the-park-stack?utm_source=gradientflow&amp;utm_medium=newsletter">explored this in a new article on the PARK stack</a></strong>, an open source foundation built on PyTorch, AI models and agents, Ray, and Kubernetes. In the end, teams with scalable infrastructure and rigorous evaluation will be better positioned to solve real business problems.</p><div><hr></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bsbG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F498096ff-3009-4699-bbc9-445b8ff031f6_1743x994.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bsbG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F498096ff-3009-4699-bbc9-445b8ff031f6_1743x994.jpeg 424w, https://substackcdn.com/image/fetch/$s_!bsbG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F498096ff-3009-4699-bbc9-445b8ff031f6_1743x994.jpeg 848w, https://substackcdn.com/image/fetch/$s_!bsbG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F498096ff-3009-4699-bbc9-445b8ff031f6_1743x994.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!bsbG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F498096ff-3009-4699-bbc9-445b8ff031f6_1743x994.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bsbG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F498096ff-3009-4699-bbc9-445b8ff031f6_1743x994.jpeg" width="1456" height="830" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/498096ff-3009-4699-bbc9-445b8ff031f6_1743x994.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:830,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:231569,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/189789650?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F498096ff-3009-4699-bbc9-445b8ff031f6_1743x994.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!bsbG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F498096ff-3009-4699-bbc9-445b8ff031f6_1743x994.jpeg 424w, https://substackcdn.com/image/fetch/$s_!bsbG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F498096ff-3009-4699-bbc9-445b8ff031f6_1743x994.jpeg 848w, https://substackcdn.com/image/fetch/$s_!bsbG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F498096ff-3009-4699-bbc9-445b8ff031f6_1743x994.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!bsbG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F498096ff-3009-4699-bbc9-445b8ff031f6_1743x994.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">How LanceDB fits into personal autonomous agents &#8212; based on the <strong><a href="https://lancedb.com/blog/openclaw-lancedb-memory-layer/?utm_source=gradientflow&amp;utm_medium=newsletter">LanceDB + OpenClaw integration guide</a></strong></figcaption></figure></div><div><hr></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9y5e!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F136d46ff-164d-4b8e-864b-8df761bac19b_1851x646.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9y5e!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F136d46ff-164d-4b8e-864b-8df761bac19b_1851x646.jpeg 424w, https://substackcdn.com/image/fetch/$s_!9y5e!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F136d46ff-164d-4b8e-864b-8df761bac19b_1851x646.jpeg 848w, https://substackcdn.com/image/fetch/$s_!9y5e!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F136d46ff-164d-4b8e-864b-8df761bac19b_1851x646.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!9y5e!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F136d46ff-164d-4b8e-864b-8df761bac19b_1851x646.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9y5e!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F136d46ff-164d-4b8e-864b-8df761bac19b_1851x646.jpeg" width="1456" height="508" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/136d46ff-164d-4b8e-864b-8df761bac19b_1851x646.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:508,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:166686,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/189789650?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F136d46ff-164d-4b8e-864b-8df761bac19b_1851x646.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!9y5e!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F136d46ff-164d-4b8e-864b-8df761bac19b_1851x646.jpeg 424w, https://substackcdn.com/image/fetch/$s_!9y5e!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F136d46ff-164d-4b8e-864b-8df761bac19b_1851x646.jpeg 848w, https://substackcdn.com/image/fetch/$s_!9y5e!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F136d46ff-164d-4b8e-864b-8df761bac19b_1851x646.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!9y5e!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F136d46ff-164d-4b8e-864b-8df761bac19b_1851x646.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">From <a href="https://gradientflow.com/nvidia-gtc-2026/">&#8220;</a><strong><a href="https://gradientflow.com/nvidia-gtc-2026/">A Practitioner&#8217;s Guide to GTC 2026&#8221;</a></strong></figcaption></figure></div><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits <a href="https://ethics.dev/">Ethics.dev</a> and the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a>, and he hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong> and the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[When AI does the junior work, how do we train seniors?]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/execution-is-now-free-heres-your</link><guid isPermaLink="false">https://gradientflow.substack.com/p/execution-is-now-free-heres-your</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 17 Mar 2026 13:02:25 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/882851c6-ab11-43ae-a393-57697a9b8a60_1193x740.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>The Agentic Sweet Spot: Where AI Moves Fast and Humans Stay in the Loop</strong></h1><p>A recent <a href="https://www.anthropic.com/research/measuring-agent-autonomy?utm_source=gradientflow&amp;utm_medium=newsletter">Anthropic study</a> on agent autonomy offers a clear preview of where knowledge work is headed. Anthropic analyzed millions of real interactions across their public API and Claude Code to see how people actually deploy autonomous systems. The catch is that their clearest view comes from Claude Code, where they can track longer workflows end to end. I treat their strongest takeaways as a snapshot of coding agents rather than a universal map of all professions. That distinction matters. Software development is the proving ground for autonomous work because it requires far less hand-holding. You can run tests, get immediate pass or fail feedback, roll back mistakes, and break large tasks into smaller steps using mature and well-defined tools.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!TaAt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95a5d18c-741d-4b92-9900-53c238260462_1561x806.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!TaAt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95a5d18c-741d-4b92-9900-53c238260462_1561x806.jpeg 424w, https://substackcdn.com/image/fetch/$s_!TaAt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95a5d18c-741d-4b92-9900-53c238260462_1561x806.jpeg 848w, https://substackcdn.com/image/fetch/$s_!TaAt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95a5d18c-741d-4b92-9900-53c238260462_1561x806.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!TaAt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95a5d18c-741d-4b92-9900-53c238260462_1561x806.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!TaAt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95a5d18c-741d-4b92-9900-53c238260462_1561x806.jpeg" width="1456" height="752" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/95a5d18c-741d-4b92-9900-53c238260462_1561x806.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:752,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:202047,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/189088942?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95a5d18c-741d-4b92-9900-53c238260462_1561x806.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!TaAt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95a5d18c-741d-4b92-9900-53c238260462_1561x806.jpeg 424w, https://substackcdn.com/image/fetch/$s_!TaAt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95a5d18c-741d-4b92-9900-53c238260462_1561x806.jpeg 848w, https://substackcdn.com/image/fetch/$s_!TaAt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95a5d18c-741d-4b92-9900-53c238260462_1561x806.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!TaAt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95a5d18c-741d-4b92-9900-53c238260462_1561x806.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Software is just the beginning of the story. To get a glimpse of what comes next, we can look at financial operations and contract management, two fields already governed by strict rules, standard templates, and mature systems. These domains serve as excellent test cases to answer a practical question: where can autonomous agents move fast, and where does human review remain essential?</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://gradientflow.substack.com/subscribe?"><span>Subscribe now</span></a></p><h4>Accounting Through Rules, Systems, and Exceptions</h4><p>Accounting shares many of the structural advantages that make programming agent-friendly. The work operates under explicit, codified standards and policies that act like shared constraints. <a href="https://accountingfoundation.org/accounting-and-standards/about-gaap/what-is-gaap?utm_source=gradientflow&amp;utm_medium=newsletter">GAAP</a>, <a href="https://en.wikipedia.org/wiki/International_Financial_Reporting_Standards">IFRS</a>, and tax codes define how transactions should be treated and recorded. Tasks decompose naturally into discrete, well-scoped subtasks with clear inputs and outputs. Reconciling an account, calculating depreciation, classifying a transaction, or preparing a tax schedule each have defined parameters and specific rules to follow.</p><p>Many of the mechanical steps are verifiable. A reconciliation either balances or it does not. Once inputs and classifications are set, many calculations can be checked against explicit rules. A journal entry either posts correctly or generates an error. The existing tooling is mature: ERP systems, general ledger software, and tax platforms expose structured data and APIs. An agent can pull invoices, match them to purchase orders, flag exceptions, draft follow-up requests, and attach supporting documentation. In practice, this works best with strong access controls and a clear audit trail. Reversibility is present too, though with caveats. An adjusting entry can fix a misclassified transaction, but the downstream implications for reporting, audit trails, and regulatory filings raise the stakes compared to a simple code rollback.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!suJq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F154078c8-146c-4ab8-bc4f-78426f779a57_1822x1067.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!suJq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F154078c8-146c-4ab8-bc4f-78426f779a57_1822x1067.jpeg 424w, https://substackcdn.com/image/fetch/$s_!suJq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F154078c8-146c-4ab8-bc4f-78426f779a57_1822x1067.jpeg 848w, https://substackcdn.com/image/fetch/$s_!suJq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F154078c8-146c-4ab8-bc4f-78426f779a57_1822x1067.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!suJq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F154078c8-146c-4ab8-bc4f-78426f779a57_1822x1067.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!suJq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F154078c8-146c-4ab8-bc4f-78426f779a57_1822x1067.jpeg" width="1456" height="853" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/154078c8-146c-4ab8-bc4f-78426f779a57_1822x1067.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:853,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:502978,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/189088942?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F154078c8-146c-4ab8-bc4f-78426f779a57_1822x1067.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!suJq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F154078c8-146c-4ab8-bc4f-78426f779a57_1822x1067.jpeg 424w, https://substackcdn.com/image/fetch/$s_!suJq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F154078c8-146c-4ab8-bc4f-78426f779a57_1822x1067.jpeg 848w, https://substackcdn.com/image/fetch/$s_!suJq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F154078c8-146c-4ab8-bc4f-78426f779a57_1822x1067.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!suJq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F154078c8-146c-4ab8-bc4f-78426f779a57_1822x1067.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/03/AI-and-Knowledge-Work-%E2%80%94-accounting-and-finance.jpeg">enlarge</a></strong>)</figcaption></figure></div><p>The real friction point is data quality and process maturity. Agents work best when inputs are consistent, documentation is complete, and policies are explicit. In many companies, source data arrives messy: incomplete invoices, missing receipts, conflicting bank feeds. Accountants spend significant time reconciling these discrepancies manually. An agent helps most when the company has already standardized data collection and approval workflows. The accounting function already treats auditability and traceability as first-class requirements, which fits naturally with agent logging and transparency. The practical deployment model mirrors existing practice: agents handle routine posting, matching, and documentation gathering, while CPAs focus on judgment calls, exceptions, and sign-off.</p><h4>Why Contract Operations Are Agent-Friendly</h4><p>Legal operations quietly built the exact same building blocks that make software automation possible. Contracts are built from templates and clause libraries. Redlines are standard, and changes are easy to compare, much like code diffs. Many companies already operate with standard fallback terms, risk acceptance matrices, and approval workflows. This is basically a template-plus-rules setup that agents can navigate.</p><p>An agent can identify contract deviations from company standards, extract key obligations, flag risky or non-standard terms, propose edits with explanations, and route work by risk tier to the right approver. In contracts, correctness means the agreement matches the company&#8217;s standards and risk posture. Reviewers can assess this by examining the deltas rather than reading entire documents, spot-checking high-risk clauses, and enforcing policy gates. The field already has mature tools for document management, document versioning, and audit trails, which align well with agent logging and transparency.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!KrAL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ae88f4f-4318-4ee1-9c3d-ac3490702a27_1852x1061.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!KrAL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ae88f4f-4318-4ee1-9c3d-ac3490702a27_1852x1061.jpeg 424w, https://substackcdn.com/image/fetch/$s_!KrAL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ae88f4f-4318-4ee1-9c3d-ac3490702a27_1852x1061.jpeg 848w, https://substackcdn.com/image/fetch/$s_!KrAL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ae88f4f-4318-4ee1-9c3d-ac3490702a27_1852x1061.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!KrAL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ae88f4f-4318-4ee1-9c3d-ac3490702a27_1852x1061.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!KrAL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ae88f4f-4318-4ee1-9c3d-ac3490702a27_1852x1061.jpeg" width="1456" height="834" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ae88f4f-4318-4ee1-9c3d-ac3490702a27_1852x1061.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:834,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:423893,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/189088942?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ae88f4f-4318-4ee1-9c3d-ac3490702a27_1852x1061.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!KrAL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ae88f4f-4318-4ee1-9c3d-ac3490702a27_1852x1061.jpeg 424w, https://substackcdn.com/image/fetch/$s_!KrAL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ae88f4f-4318-4ee1-9c3d-ac3490702a27_1852x1061.jpeg 848w, https://substackcdn.com/image/fetch/$s_!KrAL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ae88f4f-4318-4ee1-9c3d-ac3490702a27_1852x1061.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!KrAL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ae88f4f-4318-4ee1-9c3d-ac3490702a27_1852x1061.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/03/AI-and-Knowledge-Work-%E2%80%94-legal.jpeg">enlarge</a></strong>)</figcaption></figure></div><p>Genuine negotiation, novel legal interpretations, and edge cases still require attorney judgment. The sweet spot for agents is high-volume, templated work where terms follow known patterns. This works best when the company has already standardized clause libraries and clear acceptance criteria. Process maturity is the main constraint. Firms that have standardized templates and approval matrices will see faster value from agents. Humans stay in the loop for negotiation, exceptions, and final sign-off. In practice, agents handle initial screening and propose edits. Routine revisions are handled by the team, with escalations to senior counsel for complex negotiations and policy decisions.</p><h4>The new bottlenecks in knowledge work</h4><p>Coding, accounting, and contract operations share a pattern that shows up in more places than we might like to admit. When a job has clear inputs, explicit constraints, and cheap ways to verify results, agents take over the mechanical steps while humans shift toward review, prioritization, and exception handling. The organizations getting real leverage are not the ones writing the cleverest prompts. They are the teams treating agents like a production system by writing crisp specifications, forcing clarity up front, and making it easy to verify outputs before they hit a codebase, a financial ledger, or a contract repository.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_lC_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee3e58a8-120f-4902-9bfd-e5759452dcf6_1616x1037.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_lC_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee3e58a8-120f-4902-9bfd-e5759452dcf6_1616x1037.jpeg 424w, https://substackcdn.com/image/fetch/$s_!_lC_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee3e58a8-120f-4902-9bfd-e5759452dcf6_1616x1037.jpeg 848w, https://substackcdn.com/image/fetch/$s_!_lC_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee3e58a8-120f-4902-9bfd-e5759452dcf6_1616x1037.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!_lC_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee3e58a8-120f-4902-9bfd-e5759452dcf6_1616x1037.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_lC_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee3e58a8-120f-4902-9bfd-e5759452dcf6_1616x1037.jpeg" width="1456" height="934" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ee3e58a8-120f-4902-9bfd-e5759452dcf6_1616x1037.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:934,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:275587,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/189088942?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee3e58a8-120f-4902-9bfd-e5759452dcf6_1616x1037.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_lC_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee3e58a8-120f-4902-9bfd-e5759452dcf6_1616x1037.jpeg 424w, https://substackcdn.com/image/fetch/$s_!_lC_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee3e58a8-120f-4902-9bfd-e5759452dcf6_1616x1037.jpeg 848w, https://substackcdn.com/image/fetch/$s_!_lC_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee3e58a8-120f-4902-9bfd-e5759452dcf6_1616x1037.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!_lC_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee3e58a8-120f-4902-9bfd-e5759452dcf6_1616x1037.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The harder challenge is how this reshapes the company itself. As execution becomes practically free, the bottlenecks move to integration, quality assurance, and judgment calls. The core work becomes managing risk and coherence across a massive volume of machine-generated output. This forces leaders to build robust monitoring tools and tighten oversight precisely as they delegate more. It also creates a structural talent crisis. The entry-level tasks that historically trained junior employees are exactly the jobs agents do best. Companies will soon have to preserve manual skills intentionally just to ensure they still have people capable of troubleshooting when the automated systems inevitably fail.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YuUW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9396d64-31e2-4f45-8c1c-cdb7e7d6ca1c_2107x1666.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YuUW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9396d64-31e2-4f45-8c1c-cdb7e7d6ca1c_2107x1666.png 424w, https://substackcdn.com/image/fetch/$s_!YuUW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9396d64-31e2-4f45-8c1c-cdb7e7d6ca1c_2107x1666.png 848w, https://substackcdn.com/image/fetch/$s_!YuUW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9396d64-31e2-4f45-8c1c-cdb7e7d6ca1c_2107x1666.png 1272w, https://substackcdn.com/image/fetch/$s_!YuUW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9396d64-31e2-4f45-8c1c-cdb7e7d6ca1c_2107x1666.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YuUW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9396d64-31e2-4f45-8c1c-cdb7e7d6ca1c_2107x1666.png" width="1456" height="1151" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d9396d64-31e2-4f45-8c1c-cdb7e7d6ca1c_2107x1666.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1151,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:500414,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/189088942?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9396d64-31e2-4f45-8c1c-cdb7e7d6ca1c_2107x1666.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YuUW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9396d64-31e2-4f45-8c1c-cdb7e7d6ca1c_2107x1666.png 424w, https://substackcdn.com/image/fetch/$s_!YuUW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9396d64-31e2-4f45-8c1c-cdb7e7d6ca1c_2107x1666.png 848w, https://substackcdn.com/image/fetch/$s_!YuUW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9396d64-31e2-4f45-8c1c-cdb7e7d6ca1c_2107x1666.png 1272w, https://substackcdn.com/image/fetch/$s_!YuUW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9396d64-31e2-4f45-8c1c-cdb7e7d6ca1c_2107x1666.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Knowledge work, reimagined: coding shows what's coming for every profession [<strong><a href="https://gradientflow.com/wp-content/uploads/2026/03/AI-impact-on-coding.png">enlarge</a></strong>]</figcaption></figure></div><div><hr></div><h1>Call for Speakers Closes March 20th</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!j-TL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefb3eae6-fdf1-43c9-869a-8d1282eed6d5_2652x1300.jpeg 424w, https://substackcdn.com/image/fetch/$s_!j-TL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefb3eae6-fdf1-43c9-869a-8d1282eed6d5_2652x1300.jpeg 848w, https://substackcdn.com/image/fetch/$s_!j-TL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefb3eae6-fdf1-43c9-869a-8d1282eed6d5_2652x1300.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!j-TL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefb3eae6-fdf1-43c9-869a-8d1282eed6d5_2652x1300.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!j-TL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefb3eae6-fdf1-43c9-869a-8d1282eed6d5_2652x1300.jpeg" width="1456" height="714" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/efb3eae6-fdf1-43c9-869a-8d1282eed6d5_2652x1300.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:714,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:282544,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:&quot;https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/189088942?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefb3eae6-fdf1-43c9-869a-8d1282eed6d5_2652x1300.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!j-TL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefb3eae6-fdf1-43c9-869a-8d1282eed6d5_2652x1300.jpeg 424w, https://substackcdn.com/image/fetch/$s_!j-TL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefb3eae6-fdf1-43c9-869a-8d1282eed6d5_2652x1300.jpeg 848w, https://substackcdn.com/image/fetch/$s_!j-TL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefb3eae6-fdf1-43c9-869a-8d1282eed6d5_2652x1300.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!j-TL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefb3eae6-fdf1-43c9-869a-8d1282eed6d5_2652x1300.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.cvent.com/c/abstracts/8ac25ac7-f03c-467c-b3e5-f4e5f735db1e?utm_source=gradientflow&amp;utm_medium=newsletter&quot;,&quot;text&quot;:&quot;Submit A Talk&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.cvent.com/c/abstracts/8ac25ac7-f03c-467c-b3e5-f4e5f735db1e?utm_source=gradientflow&amp;utm_medium=newsletter"><span>Submit A Talk</span></a></p><div><hr></div><div id="youtube2-EZjXIqFwItM" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;EZjXIqFwItM&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/EZjXIqFwItM?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits <a href="https://ethics.dev/">Ethics.dev</a> and the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a>, and he hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong> and the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[8 domains where AI agents are actually working]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/how-to-move-from-passive-chatbot</link><guid isPermaLink="false">https://gradientflow.substack.com/p/how-to-move-from-passive-chatbot</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 10 Mar 2026 13:03:00 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/ec4f59b5-703d-4454-a7a8-0884062e533b_1405x828.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>How Teams Actually Use RL to Make Agents Reliable</strong></h1><p>I have had a longstanding fascination with reinforcement learning (RL) and have monitored its slow diffusion from research labs into enterprise production. Much of the recent activity remains concentrated among foundation model builders and teams with dedicated post-training capacity. They use RL after pre-training to make large models reliable at executing tasks, not just generating text. This includes training agents to operate business software like CRMs and ticketing systems, run commands in cloud terminals, extract structured fields from messy documents, and navigate realistic user interfaces. In a few frontier cases, RL is even driving closed-loop workflows where a model proposes an action, tests it in a simulator, and learns from the measured outcomes.</p><p>To better understand real-world demand, I recently examined job postings in key U.S. technology hubs that mentioned RL. The results show that adoption is broader than just research labs. As the chart below illustrates, RL appears most often alongside Generative AI (57%) and AI infrastructure (43%), followed closely by autonomous agents (23%). There is also a long tail of activity across search, robotics, computer vision, and predictive analytics. The chart below provides a clear view of these diverse application areas.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!lpB3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F301b2cfd-eb25-4319-9609-e2fb18fb4ea7_1897x1039.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lpB3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F301b2cfd-eb25-4319-9609-e2fb18fb4ea7_1897x1039.jpeg 424w, https://substackcdn.com/image/fetch/$s_!lpB3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F301b2cfd-eb25-4319-9609-e2fb18fb4ea7_1897x1039.jpeg 848w, https://substackcdn.com/image/fetch/$s_!lpB3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F301b2cfd-eb25-4319-9609-e2fb18fb4ea7_1897x1039.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!lpB3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F301b2cfd-eb25-4319-9609-e2fb18fb4ea7_1897x1039.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lpB3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F301b2cfd-eb25-4319-9609-e2fb18fb4ea7_1897x1039.jpeg" width="1456" height="797" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/301b2cfd-eb25-4319-9609-e2fb18fb4ea7_1897x1039.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:797,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:393043,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/188566697?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F301b2cfd-eb25-4319-9609-e2fb18fb4ea7_1897x1039.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!lpB3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F301b2cfd-eb25-4319-9609-e2fb18fb4ea7_1897x1039.jpeg 424w, https://substackcdn.com/image/fetch/$s_!lpB3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F301b2cfd-eb25-4319-9609-e2fb18fb4ea7_1897x1039.jpeg 848w, https://substackcdn.com/image/fetch/$s_!lpB3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F301b2cfd-eb25-4319-9609-e2fb18fb4ea7_1897x1039.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!lpB3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F301b2cfd-eb25-4319-9609-e2fb18fb4ea7_1897x1039.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/03/RL-Job-Posts-application-areas.jpeg">enlarge</a></strong>)</figcaption></figure></div><p>While the data spans a wide spectrum, the most dynamic category is <em><strong>AI Agents &amp; Autonomous Workflows</strong></em>. This sector represents the shift from passive chatbots to active systems that can execute complex tasks. Below is a closer look at how engineering teams are deploying RL to build these agentic systems across eight distinct domains.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://gradientflow.substack.com/subscribe?"><span>Subscribe now</span></a></p><h4>Dynamic Revenue Optimization</h4><p>In high-velocity environments like advertising and digital commerce, agents are replacing static rules with dynamic policies. Developers use <a href="https://en.wikipedia.org/wiki/Multi-armed_bandit#Contextual_bandit">contextual bandits</a> to make split-second decisions on ad placement or pricing by learning directly from user clicks and conversions. The complexity lies in managing competing objectives. Through <a href="https://arxiv.org/abs/2209.06866">constrained reinforcement learning</a>, these agents maximize revenue while strictly adhering to safety guardrails and budget caps. This approach allows systems to autonomously negotiate B2B transactions or adjust campaign bidding strategies in real time with less manual tuning day to day. In postings, this work is usually framed as bidding and budget policies, constrained optimization, and learning from advertiser or customer feedback when recommendations get accepted, rejected, or modified.</p><h4>Autonomous Software Refactoring</h4><p>Beyond simple code completion, agents are taking on deep software engineering tasks like language migration and vulnerability patching. The advantage here is objective feedback: compilers, tests, and deployment checks serve as verifiers. Agents receive negative rewards when code fails to build or pass assertions, creating a tight, iterative learning loop where the model refines its policy for code generation and refactoring until it produces correct solutions. This is especially valuable for long, interdependent workflows where order matters and early mistakes cascade. In job postings, this appears as autonomous debugging, test-driven agents, codebase migration automation, and policies that optimize for passing builds and safe rollouts.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5uFh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F061d6e34-a9a2-4ac6-949a-7eea0891642a_1835x1053.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5uFh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F061d6e34-a9a2-4ac6-949a-7eea0891642a_1835x1053.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5uFh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F061d6e34-a9a2-4ac6-949a-7eea0891642a_1835x1053.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5uFh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F061d6e34-a9a2-4ac6-949a-7eea0891642a_1835x1053.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5uFh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F061d6e34-a9a2-4ac6-949a-7eea0891642a_1835x1053.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5uFh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F061d6e34-a9a2-4ac6-949a-7eea0891642a_1835x1053.jpeg" width="1456" height="836" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/061d6e34-a9a2-4ac6-949a-7eea0891642a_1835x1053.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:836,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:298645,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/188566697?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F061d6e34-a9a2-4ac6-949a-7eea0891642a_1835x1053.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5uFh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F061d6e34-a9a2-4ac6-949a-7eea0891642a_1835x1053.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5uFh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F061d6e34-a9a2-4ac6-949a-7eea0891642a_1835x1053.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5uFh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F061d6e34-a9a2-4ac6-949a-7eea0891642a_1835x1053.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5uFh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F061d6e34-a9a2-4ac6-949a-7eea0891642a_1835x1053.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/03/RL-Job-Posts-industries.jpeg">enlarge</a></strong>)</figcaption></figure></div><h4>Beyond Robotic Process Automation</h4><p>In back-office settings, RL is often used to turn &#8220;tool use&#8221; into something closer to a dependable habit. Teams train agents on company-specific rules and tone using <a href="https://arxiv.org/abs/2504.12501">human feedback</a>, then push further with <a href="https://arxiv.org/abs/2005.01643">offline RL</a> that learns from logs of how skilled operators actually resolve cases. The RL signal is usually tied to outcomes practitioners care about, like fewer escalations, fewer retries, and clean completion of a workflow across systems such as HR, IT, finance, and CRM tools. In job postings, this tends to show up as agentic workflow automation, tool calling, human-in-the-loop review, and reward design that handles delayed outcomes.</p><h4>Automated Red Teaming</h4><p>Security teams are deploying agents to operate at machine speed for both defense and automated testing. Because security threats develop quickly and visibility is incomplete, agents must make decisions under uncertainty. <strong>Red team</strong> agents learn attack strategies in simulated environments while <strong>blue team</strong> agents learn to detect intrusions from incomplete alert data. This adversarial training discovers novel attack strategies and robust defenses that human analysts might miss during manual penetration testing. In job postings, this appears as autonomous <a href="https://gradientflow.com/ai-incident-response/">incident response</a>, continuous red teaming, adversarial training, and environments that let agents rehearse safely.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!d6L6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F489e3b89-0508-41da-87f6-6f3a6913910d_2336x1228.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!d6L6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F489e3b89-0508-41da-87f6-6f3a6913910d_2336x1228.jpeg 424w, https://substackcdn.com/image/fetch/$s_!d6L6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F489e3b89-0508-41da-87f6-6f3a6913910d_2336x1228.jpeg 848w, https://substackcdn.com/image/fetch/$s_!d6L6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F489e3b89-0508-41da-87f6-6f3a6913910d_2336x1228.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!d6L6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F489e3b89-0508-41da-87f6-6f3a6913910d_2336x1228.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!d6L6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F489e3b89-0508-41da-87f6-6f3a6913910d_2336x1228.jpeg" width="1456" height="765" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/489e3b89-0508-41da-87f6-6f3a6913910d_2336x1228.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:765,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:347773,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/188566697?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F489e3b89-0508-41da-87f6-6f3a6913910d_2336x1228.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!d6L6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F489e3b89-0508-41da-87f6-6f3a6913910d_2336x1228.jpeg 424w, https://substackcdn.com/image/fetch/$s_!d6L6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F489e3b89-0508-41da-87f6-6f3a6913910d_2336x1228.jpeg 848w, https://substackcdn.com/image/fetch/$s_!d6L6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F489e3b89-0508-41da-87f6-6f3a6913910d_2336x1228.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!d6L6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F489e3b89-0508-41da-87f6-6f3a6913910d_2336x1228.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(graphic courtesy of <strong><a href="https://www.anyscale.com/?utm_source=gradientflow&amp;utm_medium=newsletter">Anyscale</a></strong>)</figcaption></figure></div><h4>Deep Information Synthesis (including Deep Research)</h4><p>For tasks requiring <a href="https://en.wikipedia.org/wiki/Thinking,_Fast_and_Slow">&#8220;System 2&#8221; thinking</a>, such as comprehensive market research or legal synthesis, engineers employ process supervision to reward intermediate steps rather than relying solely on final outcomes. This encourages agents to gather evidence, cite sources, and recognize when they need to search further, instead of hallucinating plausible answers. The result is a system that follows verifiable paths while knowing when to keep searching. In job postings, this shows up as process supervision, evidence-based synthesis, citation quality, and workflows that explicitly train agents to avoid shortcuts.</p><h4>Autonomous Supply Chain Management</h4><p>Logistics and supply chain applications require agents to bridge the digital and physical worlds. Whether managing warehouse robots or routing delivery fleets, these systems treat operational choices as sequential steps that reshape the network. Routing decisions, for instance, ripple through fleet availability, queue lengths, and delivery windows. Since training on live hardware is prohibitively expensive and dangerous, teams rely on simulation-based training. Agents master complex control tasks before transferring policies to the real world, balancing competing objectives like delivery speed, fuel costs, and safety margins. In job postings, these topics show up around dispatch and routing policies, digital twins, sim-to-real transfer, and constrained control for robotics and autonomous systems.</p><div class="pullquote"><p>We are witnessing a shift from passive chatbots to active systems, where RL is used to turn 'tool use' into dependable, repeatable habits.</p></div><h4>Autonomous Scientific Discovery</h4><p>In fields like pharmaceuticals and materials science, &#8220;scientist agents&#8221; close the loop between hypothesis generation and physical experimentation. They use active learning to navigate vast search spaces of potential compounds or materials. Because physical experiments are slow and expensive, teams typically train policies in simulation first and then use real experiments as high-value feedback. The core technical challenge is balancing exploration against exploitation: the agent must decide whether to refine a promising candidate or test a novel hypothesis, optimizing experimental design to maximize information gain while minimizing the time and cost of lab work. In job postings, the work shows up as closed-loop experimentation, lab robotics integration, simulation-driven training, and methods for balancing exploration with exploitation.</p><h4>RL in the Agent Orchestration Layer</h4><p>As agent ecosystems grow, infrastructure engineers are building the orchestration layers that manage them. The focus here is on the &#8220;agent runtime&#8221; where reinforcement learning optimizes how requests are routed to specialist workers. Rather than hard-coding which tool or agent to invoke and in what sequence, the system learns a policy based on success rates and constraints like latency and cost. Teams are also building evaluator agents that score plan quality, creating feedback loops where the orchestration layer continuously refines its coordination strategies based on actual task outcomes. In job postings, this looks like agent routers, planner-executor architectures, multi-agent coordination, evaluation frameworks, and guardrails that can be optimized over time.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fg3Q!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7b7ab7a-c76a-41fa-82c3-52d24119c078_1822x959.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fg3Q!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7b7ab7a-c76a-41fa-82c3-52d24119c078_1822x959.jpeg 424w, https://substackcdn.com/image/fetch/$s_!fg3Q!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7b7ab7a-c76a-41fa-82c3-52d24119c078_1822x959.jpeg 848w, https://substackcdn.com/image/fetch/$s_!fg3Q!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7b7ab7a-c76a-41fa-82c3-52d24119c078_1822x959.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!fg3Q!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7b7ab7a-c76a-41fa-82c3-52d24119c078_1822x959.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fg3Q!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7b7ab7a-c76a-41fa-82c3-52d24119c078_1822x959.jpeg" width="604" height="317.7637362637363" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b7b7ab7a-c76a-41fa-82c3-52d24119c078_1822x959.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:766,&quot;width&quot;:1456,&quot;resizeWidth&quot;:604,&quot;bytes&quot;:274173,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/188566697?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7b7ab7a-c76a-41fa-82c3-52d24119c078_1822x959.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fg3Q!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7b7ab7a-c76a-41fa-82c3-52d24119c078_1822x959.jpeg 424w, https://substackcdn.com/image/fetch/$s_!fg3Q!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7b7ab7a-c76a-41fa-82c3-52d24119c078_1822x959.jpeg 848w, https://substackcdn.com/image/fetch/$s_!fg3Q!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7b7ab7a-c76a-41fa-82c3-52d24119c078_1822x959.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!fg3Q!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7b7ab7a-c76a-41fa-82c3-52d24119c078_1822x959.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>These eight domains illustrate where RL appears in production agent systems. <strong>RL is one approach among several for improving reliability and decision quality.</strong> In previous work, we explored complementary techniques <a href="https://gradientflow.substack.com/p/beyond-rl-a-new-paradigm-for-agent">including evolutionary methods</a> for discovering optimal agent architectures and structured observability that <a href="https://gradientflow.substack.com/p/inside-the-agent-optimization-toolkit">turns agent debugging from guesswork into measurable engineering</a>. The common thread across these strategies is identical: moving from costly trial-and-error toward systematic, repeatable improvement.</p><h4>Simulation First, Production Later: Safe RL Deployment Patterns</h4><p>Across these domains, the technical patterns are surprisingly consistent. Most teams start with offline RL from production logs, because naive online exploration is risky when the &#8220;environment&#8221; is a real business process or a live system. From there, the work quickly becomes about constraints and rollouts. Policies are trained in simulation or test environments, they ship behind safety filters, and they graduate from &#8220;suggest&#8221; to &#8220;act with confirmation&#8221; to limited autonomy for routine cases. Reward design is rarely a single number: it is usually a bundle of outcome metrics with hard limits on things like budget, latency, and safety.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zCub!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff741294e-4fe0-4144-9edf-6304b89490c7_1738x1052.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zCub!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff741294e-4fe0-4144-9edf-6304b89490c7_1738x1052.jpeg 424w, https://substackcdn.com/image/fetch/$s_!zCub!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff741294e-4fe0-4144-9edf-6304b89490c7_1738x1052.jpeg 848w, https://substackcdn.com/image/fetch/$s_!zCub!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff741294e-4fe0-4144-9edf-6304b89490c7_1738x1052.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!zCub!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff741294e-4fe0-4144-9edf-6304b89490c7_1738x1052.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zCub!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff741294e-4fe0-4144-9edf-6304b89490c7_1738x1052.jpeg" width="634" height="383.62225274725273" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f741294e-4fe0-4144-9edf-6304b89490c7_1738x1052.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:881,&quot;width&quot;:1456,&quot;resizeWidth&quot;:634,&quot;bytes&quot;:223869,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/188566697?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff741294e-4fe0-4144-9edf-6304b89490c7_1738x1052.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!zCub!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff741294e-4fe0-4144-9edf-6304b89490c7_1738x1052.jpeg 424w, https://substackcdn.com/image/fetch/$s_!zCub!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff741294e-4fe0-4144-9edf-6304b89490c7_1738x1052.jpeg 848w, https://substackcdn.com/image/fetch/$s_!zCub!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff741294e-4fe0-4144-9edf-6304b89490c7_1738x1052.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!zCub!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff741294e-4fe0-4144-9edf-6304b89490c7_1738x1052.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/03/RL-Job-Posts-tools.jpeg">enlarge</a></strong>)</figcaption></figure></div><p>The market signal in the postings is not &#8220;more RL research.&#8221; There is demand for people who can connect RL ideas to production realities: instrumentation, evals that match business outcomes, careful guardrails, and integration with existing systems. If you are building agentic workflows, RL is showing up less as a magic upgrade and more as a way to learn better sequential decisions under real operational constraints.</p><div><hr></div><h1>Quick Takes</h1><div id="youtube2-39NPrwBzBis" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;39NPrwBzBis&quot;,&quot;startTime&quot;:&quot;88&quot;,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/39NPrwBzBis?start=88&amp;rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><ol><li><p><a href="https://youtu.be/39NPrwBzBis?t=88">The Shift from Routine Execution to Spec-Driven Work</a></p></li><li><p><a href="https://youtu.be/39NPrwBzBis?t=458">Orchestrating Multiple AI Agents: A Skill That Won&#8217;t Stay in Software Engineering</a></p></li><li><p><a href="https://youtu.be/39NPrwBzBis?t=1647">How to Protect Your Career When Your Output is Used to Train Models</a></p></li></ol><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits <a href="https://ethics.dev/">Ethics.dev</a> and the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a>, and he hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong> and the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[The warning signs your AI vendor is becoming your cage]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/we-watched-this-happen-with-the-internet</link><guid isPermaLink="false">https://gradientflow.substack.com/p/we-watched-this-happen-with-the-internet</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 03 Mar 2026 14:02:26 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/eb34f1d9-c60e-42ff-a7b3-c2d434755a1a_742x570.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>The Honeymoon Phase Won't Last: Preparing for AI's Platform Shift</strong></h1><p>I am old enough to remember the early days of the internet. It was a time when blogs were everywhere and information felt decentralized. Before the giant platforms and their <strong>algorithms</strong>, the web felt like a collection of independent voices. We had <strong>chronological feeds</strong> we controlled, not algorithmic ones controlled by someone else.</p><p>AI is still in the honeymoon phase, when platforms over-deliver to build adoption. Internet history tells us what typically follows. Right now there&#8217;s a mad rush to lock users into a specific model, a specific app, a specific workflow. I get why companies are doing it. But the best models today are often comparable for real workloads, and open-weight options are close enough that tying yourself to one vendor&#8217;s roadmap is a choice, not a necessity. I&#8217;ve always preferred to keep my options open: use open tools when I can, swap components when I need to, and avoid building around the assumption that any one platform will always treat me well.</p><p>What follows is a set of early warning signs. Think of them as &#8220;don&#8217;t repeat the internet&#8221; signals for teams building AI products.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption"><em>Support our work by becoming a paid subscriber.</em></p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h4>Designing for Exit While It&#8217;s Still Cheap</h4><p>The first warning sign is when access to the core capability narrows over time. In AI, that can mean fewer technical details, fewer deployment options, and fewer ways to inspect or control behavior. You start by experimenting freely. Six months later, you can&#8217;t run anything locally, can&#8217;t audit what changed, can&#8217;t fork a model, and can&#8217;t make your own latency and cost tradeoffs. The vendor becomes your runtime, your roadmap, and your risk profile. You also inherit their update schedule. Many of us have noticed providers tweak models behind stable API names. Production applications need version-locking, regression test suites, and dashboards tracking output distributions to catch when behavior shifts overnight.</p><p>A second warning sign is policy volatility, often justified as safety measures. Terms of service and acceptable use policies can shift quickly, and enforcement can be inconsistent. The failure mode is not a clean error like traditional software. It&#8217;s silent refusals, degraded answers, or a workflow that works Monday and breaks Thursday with no code changes on your side. The model switches from verbose to terse, breaking parsing logic, or shifts output formats, breaking structured extraction.</p><div class="pullquote"><p>When a vendor becomes your roadmap, you inherit their risk profile.</p></div><p>A more awkward version of policy volatility is geopolitics. Providers can decide they cannot serve certain regions or certain industries, and suddenly your product stops working for those customers through no fault of your own.</p><p>Even legitimate use cases can fall into territory that makes providers nervous. Security analysis, healthcare applications, legal research can all trigger unpredictable restrictions, creating reliability problems that have nothing to do with your code quality. The internet version of this was &#8220;the algorithm changed.&#8221; The AI version is &#8220;the model decided it can&#8217;t help,&#8221; or the platform decided your use case is now out of bounds.</p><p>The practical move is to treat multi-provider support like an insurance premium, not a nice-to-have. Keep at least one viable fallback path. Avoid letting vendor-specific features, proprietary formats, or tool calling conventions become the spine of your product. Assume switching costs will be measured in months if you wait until you&#8217;re forced.</p><h4>Keep Your Moat on Your Side</h4><p>Another early warning sign is asymmetric data flow. Many providers use consumer chat data for model training or improvement unless you opt out. Enterprise and API data is often excluded unless you opt in. The risk is that teams still do real work in consumer chat surfaces, and their proprietary domain knowledge can end up improving a system their competitors can also access. For teams with deep specialized knowledge, repeated corrections and detailed examples can become a high-value signal that you are giving away for free, which can dilute your advantage over time.</p><p>Privacy raises a separate concern because context windows often contain sensitive information. When you paste code or customer details into an AI assistant, that data persists in the provider&#8217;s logs with specific retention policies. For developers using IDE integrations, the assistant might be seeing every file you open, not just the file you&#8217;re actively editing.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!H6q_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff107a047-9433-487b-8c65-c7e1836a7512_1316x673.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!H6q_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff107a047-9433-487b-8c65-c7e1836a7512_1316x673.png 424w, https://substackcdn.com/image/fetch/$s_!H6q_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff107a047-9433-487b-8c65-c7e1836a7512_1316x673.png 848w, https://substackcdn.com/image/fetch/$s_!H6q_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff107a047-9433-487b-8c65-c7e1836a7512_1316x673.png 1272w, https://substackcdn.com/image/fetch/$s_!H6q_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff107a047-9433-487b-8c65-c7e1836a7512_1316x673.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!H6q_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff107a047-9433-487b-8c65-c7e1836a7512_1316x673.png" width="502" height="256.7218844984802" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f107a047-9433-487b-8c65-c7e1836a7512_1316x673.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:673,&quot;width&quot;:1316,&quot;resizeWidth&quot;:502,&quot;bytes&quot;:450006,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/187699080?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff107a047-9433-487b-8c65-c7e1836a7512_1316x673.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!H6q_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff107a047-9433-487b-8c65-c7e1836a7512_1316x673.png 424w, https://substackcdn.com/image/fetch/$s_!H6q_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff107a047-9433-487b-8c65-c7e1836a7512_1316x673.png 848w, https://substackcdn.com/image/fetch/$s_!H6q_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff107a047-9433-487b-8c65-c7e1836a7512_1316x673.png 1272w, https://substackcdn.com/image/fetch/$s_!H6q_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff107a047-9433-487b-8c65-c7e1836a7512_1316x673.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>My rule of thumb is simple: treat AI input like you&#8217;re sending it to an external service you don&#8217;t control, because you are. Build guardrails into your code, not just your policy docs. Scan outbound prompts for secrets. Limit context to the minimum. For high-sensitivity workloads, the &#8220;right&#8221; architecture might include running an open model in your environment, even if you still use a hosted model for general tasks.</p><h4>Build Like You&#8217;ll Switch, Because You Will</h4><p>A subtler warning sign is the assumption that switching will be easy when you need to. Prompts that work cleanly on one model break on another. Tool calling formats differ. Output that&#8217;s reliable in one system gets flaky in another. Teams assume they can swap providers later, then discover that &#8220;later&#8221; means rewriting prompts, rerunning evaluations, rebuilding tool adapters, and retraining your team to work with a new model.</p><p>Embeddings and fine-tuning make this worse. If you build retrieval around one embedding model, moving means re-embedding your entire knowledge base and retuning your pipeline. If you fine-tune on a provider&#8217;s platform, you get an artifact that only works there. These techniques aren&#8217;t mistakes, but you need to price in the switching costs before you commit.</p><div class="pullquote"><p>Build like you&#8217;ll switch providers&#8212;because you will.</p></div><p>The practical move is to separate what your product does from which model you&#8217;re currently using. Put prompts, tool definitions, and routing logic behind a layer you control. Store conversation state in your own database, not in a vendor&#8217;s memory feature. Treat portability the same way you treat observability or backups: as a continuous discipline, not something you retrofit later.</p><h4>Treat Token Costs as a Product Risk</h4><p>Token-based pricing makes costs volatile. A prompt change, a longer context window, or a retry loop can spike your bill unexpectedly. Teams have seen cases where adding &#8220;explain your reasoning&#8221; to prompts doubled costs, or where a misconfigured chain-of-thought prompt made requests 50 times more expensive. You need real-time monitoring, rate limits, and circuit breakers from the start.</p><p>Beyond the bill spikes, there&#8217;s a longer-term pricing risk that should look familiar. Twitter&#8217;s API went from free and open to restricted and expensive. Google Maps had a cutover that forced billing and new keys, and many sites and apps saw maps fail until they reconfigured. Today&#8217;s AI API prices are likely below the true cost of inference, subsidized like Uber&#8217;s early rides. Amazon&#8217;s marketplace fees climbed from under 20% to over 50% once sellers were locked in. Token pricing could follow the same curve.</p><p>Rate limits and pricing tiers create different capability classes. Providers have strong incentives to degrade cheaper tiers over time while maintaining quality for premium customers. The economic logic is straightforward: inference is expensive, and the profit-maximizing move is to make lower tiers barely usable, pushing heavy users upward. Assume that whatever tier you&#8217;re on today will become worse unless you&#8217;re paying enterprise rates. Monitor model quality continuously, not just at launch.</p><div class="pullquote"><p>Assume your current model tier will eventually be degraded; the profit-maximizing move is to force you upward to enterprise rates.</p></div><p>There&#8217;s also a strategic risk that mirrors the platform era. Your provider can see your prompts and usage patterns, which tells them exactly where the valuable opportunities are. If they ship a competing feature, they start with your playbook. Meanwhile, advantages from clever prompting decay fast as models improve and competitors copy your patterns. The durable path requires something deeper: proprietary data, workflow integration, evaluation discipline, or distribution you control.</p><h4>Don&#8217;t Let Convenience Become a Cage</h4><p>A final warning sign is relying on a model&#8217;s internal knowledge as the sole truth for your application. Waiting for a provider to solve hallucination or complex reasoning is a trap that guarantees vendor lock-in. A recent example I found compelling is a tool called <strong><a href="https://realai.com/?utm_source=gradientflow&amp;utm_medium=newsletter">RealAI</a></strong>. They moved beyond simple document retrieval by fusing millions of disparate data points into standardized, queryable structured data. They also understood that foundation models still struggle with reliable math, so they wired their AI directly to dedicated financial calculators. When you move the actual intelligence into <strong>your own <a href="https://youtu.be/narbqIGrHw8?si=VTs1RaV7tJwsRHGt&amp;t=103">data pipelines and specialized external tools</a></strong>, you drastically reduce your dependency on any single provider. Your competitive advantage becomes your proprietary data fusion and business logic, which allows you to swap out the underlying language model whenever a better option appears.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kmgN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ab858a2-75cb-44b0-96b9-88caff8d21f2_1319x702.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kmgN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ab858a2-75cb-44b0-96b9-88caff8d21f2_1319x702.jpeg 424w, https://substackcdn.com/image/fetch/$s_!kmgN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ab858a2-75cb-44b0-96b9-88caff8d21f2_1319x702.jpeg 848w, https://substackcdn.com/image/fetch/$s_!kmgN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ab858a2-75cb-44b0-96b9-88caff8d21f2_1319x702.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!kmgN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ab858a2-75cb-44b0-96b9-88caff8d21f2_1319x702.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kmgN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ab858a2-75cb-44b0-96b9-88caff8d21f2_1319x702.jpeg" width="641" height="341.1539044730857" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ab858a2-75cb-44b0-96b9-88caff8d21f2_1319x702.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:702,&quot;width&quot;:1319,&quot;resizeWidth&quot;:641,&quot;bytes&quot;:133127,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/187699080?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ab858a2-75cb-44b0-96b9-88caff8d21f2_1319x702.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!kmgN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ab858a2-75cb-44b0-96b9-88caff8d21f2_1319x702.jpeg 424w, https://substackcdn.com/image/fetch/$s_!kmgN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ab858a2-75cb-44b0-96b9-88caff8d21f2_1319x702.jpeg 848w, https://substackcdn.com/image/fetch/$s_!kmgN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ab858a2-75cb-44b0-96b9-88caff8d21f2_1319x702.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!kmgN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ab858a2-75cb-44b0-96b9-88caff8d21f2_1319x702.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><strong><a href="https://gradientflow.com/oxygen-development-environment/">OpenCode + OpenRouter: O&#8322; (oxygen)</a></strong></figcaption></figure></div><p>These are early observations, not settled conclusions. The AI market is still taking shape. But the internet&#8217;s history suggests we should pay close attention to where incentives lead. We watched a handful of companies acquire their way to dominance the last time around. If we want a different outcome this time, we need early pressure for portability, transparency, user control, and actual antitrust enforcement before the defaults harden into something harder to change.</p><p>Some of that starts with the tools we pick. A concrete example from my own work: there&#8217;s real excitement around Claude Code right now, and I understand why. But I keep pointing people toward open tools that make model swapping normal. The setup I&#8217;ve been using is <a href="https://gradientflow.com/oxygen-development-environment/">OpenCode paired with OpenRouter</a>, which gives you access to different model providers without forcing you to rewrite everything when you want to try a different model. Internet platforms eventually moved to restrict the third-party tools that gave users more control. I&#8217;d rather adopt open tools now, while the window is open.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://gradientflow.com/wp-content/uploads/2026/03/Foundation-Models-&#8212;-propretary-and-open-weights.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!pOHA!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08f5bee-6d1f-4de1-8921-986d318f55e7_1870x672.jpeg 424w, https://substackcdn.com/image/fetch/$s_!pOHA!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08f5bee-6d1f-4de1-8921-986d318f55e7_1870x672.jpeg 848w, https://substackcdn.com/image/fetch/$s_!pOHA!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08f5bee-6d1f-4de1-8921-986d318f55e7_1870x672.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!pOHA!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08f5bee-6d1f-4de1-8921-986d318f55e7_1870x672.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!pOHA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08f5bee-6d1f-4de1-8921-986d318f55e7_1870x672.jpeg" width="1456" height="523" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e08f5bee-6d1f-4de1-8921-986d318f55e7_1870x672.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:523,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:148929,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:&quot;https://gradientflow.com/wp-content/uploads/2026/03/Foundation-Models-&#8212;-propretary-and-open-weights.jpeg&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/187699080?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08f5bee-6d1f-4de1-8921-986d318f55e7_1870x672.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!pOHA!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08f5bee-6d1f-4de1-8921-986d318f55e7_1870x672.jpeg 424w, https://substackcdn.com/image/fetch/$s_!pOHA!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08f5bee-6d1f-4de1-8921-986d318f55e7_1870x672.jpeg 848w, https://substackcdn.com/image/fetch/$s_!pOHA!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08f5bee-6d1f-4de1-8921-986d318f55e7_1870x672.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!pOHA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08f5bee-6d1f-4de1-8921-986d318f55e7_1870x672.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>To be completely transparent, I still rely on proprietary models from Anthropic, Google, and OpenAI. A real performance gap remains between those systems and open-weights models. But that gap is narrowing quickly. You should actively explore offloading more workloads to smaller open-weights models that you can customize and augment with your own data and tools. While a startup can adopt this approach immediately, large organizations will naturally need to add layers for governance and compliance. The underlying goal is exactly the same for both. You must maintain optionality and keep control over your product.</p><div><hr></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!K9Q4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a50fa1-b6a7-4193-9f8f-8a2fa3b906da_2630x316.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!K9Q4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a50fa1-b6a7-4193-9f8f-8a2fa3b906da_2630x316.png 424w, https://substackcdn.com/image/fetch/$s_!K9Q4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a50fa1-b6a7-4193-9f8f-8a2fa3b906da_2630x316.png 848w, https://substackcdn.com/image/fetch/$s_!K9Q4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a50fa1-b6a7-4193-9f8f-8a2fa3b906da_2630x316.png 1272w, https://substackcdn.com/image/fetch/$s_!K9Q4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a50fa1-b6a7-4193-9f8f-8a2fa3b906da_2630x316.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!K9Q4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a50fa1-b6a7-4193-9f8f-8a2fa3b906da_2630x316.png" width="1456" height="175" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/31a50fa1-b6a7-4193-9f8f-8a2fa3b906da_2630x316.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:175,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:735605,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/187699080?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a50fa1-b6a7-4193-9f8f-8a2fa3b906da_2630x316.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!K9Q4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a50fa1-b6a7-4193-9f8f-8a2fa3b906da_2630x316.png 424w, https://substackcdn.com/image/fetch/$s_!K9Q4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a50fa1-b6a7-4193-9f8f-8a2fa3b906da_2630x316.png 848w, https://substackcdn.com/image/fetch/$s_!K9Q4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a50fa1-b6a7-4193-9f8f-8a2fa3b906da_2630x316.png 1272w, https://substackcdn.com/image/fetch/$s_!K9Q4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a50fa1-b6a7-4193-9f8f-8a2fa3b906da_2630x316.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p><strong><a href="https://ethics.dev/">Ethics.dev</a></strong> is our new sister site that tracks the practical impact of AI across fields like safety, labor markets, government regulation, and the economy. <a href="https://ethics.dev/">Bookmark this page</a> for daily updates on how these rapid developments will reshape your industry. It is designed as a practical resource to help you keep up with the complex rules and economic forces shaping the future of AI.</p><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits <a href="https://ethics.dev/">Ethics.dev</a> and the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a>, and he hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong> and the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[AI agents just made your data pipeline obsolete]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/your-synthetic-data-pipeline-is-about</link><guid isPermaLink="false">https://gradientflow.substack.com/p/your-synthetic-data-pipeline-is-about</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 24 Feb 2026 14:01:57 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/8700a9fc-f4da-422c-ba6d-b78cb6b191d3_1009x877.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>The Industrialization of Synthetic Data</strong></h1><p>Synthetic data used to be a fairly narrow idea: pad a small dataset, test a model without touching production data, maybe stress a system for bias. The rise of generative AI and autonomous agents has changed the landscape. Teams use synthetic data to train and evaluate agentic systems, to cover rare failure cases, to meet privacy and compliance requirements, and to simulate workflows that look more like real work than like a benchmark. As the use cases expanded, the &#8220;just generate more rows&#8221; mindset stopped working, and synthetic data started to look like an engineering system that needs real infrastructure.</p><p>Compute intensive, in this context, means two things. First, the cost per synthetic example is going up because each example is longer, more interactive, and often requires multiple model calls. Second, the pipeline around generation is getting heavier: validation, deduplication, tool execution, sandboxes, storage, and orchestration. This complexity has effectively turned synthetic data generation into an industrial-scale engineering problem.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption"><em>Been reading for a while? Support our work by becoming a paid subscriber.</em></p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p><strong>The unit of data got bigger<br></strong>Modern synthetic data is no longer just a short question and answer. It has evolved into long sequences of steps that include planning, reasoning, and using external tools. At the same time, we are asking models to show their work by producing step-by-step reasoning traces. If a single high-quality training example now spans thousands of tokens and dozens of steps, you need far more computing power to produce it. This is especially true for AI agents that must try a task, fix their own mistakes, and finish a job rather than just giving a quick response.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ocR3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90bfc255-fcbe-417c-989f-0bd3ba807f70_1894x984.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ocR3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90bfc255-fcbe-417c-989f-0bd3ba807f70_1894x984.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ocR3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90bfc255-fcbe-417c-989f-0bd3ba807f70_1894x984.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ocR3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90bfc255-fcbe-417c-989f-0bd3ba807f70_1894x984.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ocR3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90bfc255-fcbe-417c-989f-0bd3ba807f70_1894x984.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ocR3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90bfc255-fcbe-417c-989f-0bd3ba807f70_1894x984.jpeg" width="1456" height="756" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/90bfc255-fcbe-417c-989f-0bd3ba807f70_1894x984.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:756,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:255205,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/186926578?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90bfc255-fcbe-417c-989f-0bd3ba807f70_1894x984.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ocR3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90bfc255-fcbe-417c-989f-0bd3ba807f70_1894x984.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ocR3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90bfc255-fcbe-417c-989f-0bd3ba807f70_1894x984.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ocR3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90bfc255-fcbe-417c-989f-0bd3ba807f70_1894x984.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ocR3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90bfc255-fcbe-417c-989f-0bd3ba807f70_1894x984.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><strong>(<a href="https://gradientflow.com/wp-content/uploads/2026/02/Synthetic-Data-generation-is-compute-intensive.jpeg">enlarge</a>)</strong></figcaption></figure></div><p><strong>One example now takes a small team of models<br></strong>Many pipelines have moved from a single model call per example to a coordinated workflow of different agents. One agent might select a persona, another generates the content, and a third refines the tone. When you multiply this by millions of examples, the total number of inference calls scales rapidly. In practice, teams building research assistants or customer-support agents find that synthetic data generation is actually a complex set of separate inference jobs that require sophisticated scheduling and tracking.</p><p><strong>Quality control became its own workload<br></strong>Because these sequences can be long, checking the work is no longer a simple final check. A tiny mistake at the start of a plan makes everything that follows a waste of time. To catch these errors, teams now use a second AI to judge every single step the first one takes. If a task has twenty steps, you might run fifty separate AI operations just to get one usable result. When you scale that to millions of examples, the demand for processing power explodes.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://gradientflow.com/wp-content/uploads/2026/02/Synthetic-Data-&#8212;-Turn-level-validation.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rxsa!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F354fa9dd-35c6-4cb5-b7bf-e5e7d1637cfc_1513x895.jpeg 424w, https://substackcdn.com/image/fetch/$s_!rxsa!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F354fa9dd-35c6-4cb5-b7bf-e5e7d1637cfc_1513x895.jpeg 848w, https://substackcdn.com/image/fetch/$s_!rxsa!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F354fa9dd-35c6-4cb5-b7bf-e5e7d1637cfc_1513x895.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!rxsa!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F354fa9dd-35c6-4cb5-b7bf-e5e7d1637cfc_1513x895.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rxsa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F354fa9dd-35c6-4cb5-b7bf-e5e7d1637cfc_1513x895.jpeg" width="578" height="341.7980769230769" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/354fa9dd-35c6-4cb5-b7bf-e5e7d1637cfc_1513x895.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:861,&quot;width&quot;:1456,&quot;resizeWidth&quot;:578,&quot;bytes&quot;:244365,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:&quot;https://gradientflow.com/wp-content/uploads/2026/02/Synthetic-Data-&#8212;-Turn-level-validation.jpeg&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/186926578?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F354fa9dd-35c6-4cb5-b7bf-e5e7d1637cfc_1513x895.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rxsa!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F354fa9dd-35c6-4cb5-b7bf-e5e7d1637cfc_1513x895.jpeg 424w, https://substackcdn.com/image/fetch/$s_!rxsa!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F354fa9dd-35c6-4cb5-b7bf-e5e7d1637cfc_1513x895.jpeg 848w, https://substackcdn.com/image/fetch/$s_!rxsa!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F354fa9dd-35c6-4cb5-b7bf-e5e7d1637cfc_1513x895.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!rxsa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F354fa9dd-35c6-4cb5-b7bf-e5e7d1637cfc_1513x895.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>&#8220;Trust but verify&#8221; requires running code<br></strong>For agents that use tools, a frequent failure is when the model claims it finished a task but actually failed. To solve this, pipelines now include executable validators. This means running Python scripts or checking API returns in real time to see if the code actually works. This pushes the compute burden away from pure GPU inference and into CPU, memory, and sandbox capacity, often requiring thousands of parallel, isolated containers to verify that the generated data is actually correct.</p><p><strong>Realism demands real tools and environments<br></strong>If you want to teach an agent to browse the web or use enterprise software, you cannot simply fake the responses. Teams are increasingly executing real tool calls and managing the associated rate limits, timeouts, and connectivity. For &#8220;computer use&#8221; training, the cost jumps significantly because you are running full virtual machines with browser engines and GUI rendering. This looks less like a data script and more like operating a massive virtual desktop fleet.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hhAE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe761291-c51f-4daa-8ace-79a75254a938_1311x811.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hhAE!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe761291-c51f-4daa-8ace-79a75254a938_1311x811.png 424w, https://substackcdn.com/image/fetch/$s_!hhAE!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe761291-c51f-4daa-8ace-79a75254a938_1311x811.png 848w, https://substackcdn.com/image/fetch/$s_!hhAE!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe761291-c51f-4daa-8ace-79a75254a938_1311x811.png 1272w, https://substackcdn.com/image/fetch/$s_!hhAE!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe761291-c51f-4daa-8ace-79a75254a938_1311x811.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hhAE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe761291-c51f-4daa-8ace-79a75254a938_1311x811.png" width="566" height="350.1342486651411" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fe761291-c51f-4daa-8ace-79a75254a938_1311x811.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:811,&quot;width&quot;:1311,&quot;resizeWidth&quot;:566,&quot;bytes&quot;:1249351,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/186926578?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe761291-c51f-4daa-8ace-79a75254a938_1311x811.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!hhAE!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe761291-c51f-4daa-8ace-79a75254a938_1311x811.png 424w, https://substackcdn.com/image/fetch/$s_!hhAE!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe761291-c51f-4daa-8ace-79a75254a938_1311x811.png 848w, https://substackcdn.com/image/fetch/$s_!hhAE!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe761291-c51f-4daa-8ace-79a75254a938_1311x811.png 1272w, https://substackcdn.com/image/fetch/$s_!hhAE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe761291-c51f-4daa-8ace-79a75254a938_1311x811.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Real Tools, Real Complexity</figcaption></figure></div><p><strong>Keeping data diverse is a heavy lift<br></strong>Once you can generate data at scale, the bottleneck shifts to keeping that data varied. Production pipelines now generate massive numbers of candidate items, then use embedding models and clustering to deduplicate them aggressively. This requires large-scale embedding runs and significant compute spent on items that are ultimately discarded. This is a major hurdle for teams building enterprise copilots that need to handle a vast range of departments, personas, and edge cases without repeating themselves.</p><p><strong>Higher-fidelity generators raise the per-sample price<br></strong>In specialized fields like medical imaging, simple simulations are no longer enough. Generating high-resolution 3D images to train diagnostic AI requires advanced models that are much slower than older methods. Because training loops consume data faster than a single generator can produce it, teams often have to run massive GPU pools just to ensure the training process does not sit idle while waiting for the next batch of images.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gBK4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10bb62f5-54ff-4f50-b057-dd7f9421283d_1506x886.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gBK4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10bb62f5-54ff-4f50-b057-dd7f9421283d_1506x886.jpeg 424w, https://substackcdn.com/image/fetch/$s_!gBK4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10bb62f5-54ff-4f50-b057-dd7f9421283d_1506x886.jpeg 848w, https://substackcdn.com/image/fetch/$s_!gBK4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10bb62f5-54ff-4f50-b057-dd7f9421283d_1506x886.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!gBK4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10bb62f5-54ff-4f50-b057-dd7f9421283d_1506x886.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gBK4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10bb62f5-54ff-4f50-b057-dd7f9421283d_1506x886.jpeg" width="596" height="350.80494505494505" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/10bb62f5-54ff-4f50-b057-dd7f9421283d_1506x886.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:857,&quot;width&quot;:1456,&quot;resizeWidth&quot;:596,&quot;bytes&quot;:196678,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/186926578?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10bb62f5-54ff-4f50-b057-dd7f9421283d_1506x886.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!gBK4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10bb62f5-54ff-4f50-b057-dd7f9421283d_1506x886.jpeg 424w, https://substackcdn.com/image/fetch/$s_!gBK4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10bb62f5-54ff-4f50-b057-dd7f9421283d_1506x886.jpeg 848w, https://substackcdn.com/image/fetch/$s_!gBK4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10bb62f5-54ff-4f50-b057-dd7f9421283d_1506x886.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!gBK4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10bb62f5-54ff-4f50-b057-dd7f9421283d_1506x886.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Synthetic data is turning into an always-on factory<br></strong>Static datasets go stale quickly for interactive agents. Modern systems use a continuous loop where the agent interacts with environments and logs new experiences throughout the training process. This means your demand for computing power does not end once the data is collected. It persists throughout the entire life of the model. Keeping training and generation in sync becomes a major systems engineering challenge, requiring a production-grade service with its own monitoring, fault tolerance, and distributed infrastructure.</p><div class="pullquote"><p>The rise of these data factories is another reason to modernize your AI infrastructure.</p></div><h4>Putting the Pieces Together: Synthetic Data in Production Mode</h4><p>A system from <strong>Meta</strong> called <strong><a href="https://arxiv.org/html/2511.21686">Matrix</a></strong> shows how these requirements come together in a single synthetic data factory. It was built to create data for complex tasks like customer service and web research. These jobs require multiple AI agents to work together, which is much harder to manage than a simple question and answer script.</p><p>Matrix targets large-scale data generation where each &#8220;item&#8221; is not a single prompt-response, but an end-to-end workflow. Every task carries its own instructions and history as it moves between different AI agents. This design gets rid of a central controller that often slows things down. By letting each task move forward on its own, the system avoids the idle time that usually happens when computers have to wait for a large batch of work to finish.</p><p>The setup highlights how much infrastructure this requires. Matrix is built on an open-source stack (SLURM and <strong><a href="https://www.ray.io/?utm_source=gradientflow&amp;utm_medium=newsletter">Ray</a></strong>) and uses containerized execution (Apptainer) for tool and environment interaction, while compute-intensive operations like LLM inference and container workloads are handled as distributed services that can scale independently from the agents. In one test, the system handled over 12,000 tasks at once and produced 2 billion tokens of text in about four hours. For tasks that involve using real software tools, it can run 1,500 containers at the same time to verify that the results are accurate.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ocQR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24e93c22-b784-417d-9ddc-d43fb3a300db_602x432.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ocQR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24e93c22-b784-417d-9ddc-d43fb3a300db_602x432.png 424w, https://substackcdn.com/image/fetch/$s_!ocQR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24e93c22-b784-417d-9ddc-d43fb3a300db_602x432.png 848w, https://substackcdn.com/image/fetch/$s_!ocQR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24e93c22-b784-417d-9ddc-d43fb3a300db_602x432.png 1272w, https://substackcdn.com/image/fetch/$s_!ocQR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24e93c22-b784-417d-9ddc-d43fb3a300db_602x432.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ocQR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24e93c22-b784-417d-9ddc-d43fb3a300db_602x432.png" width="602" height="432" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/24e93c22-b784-417d-9ddc-d43fb3a300db_602x432.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:432,&quot;width&quot;:602,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:169206,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/186926578?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24e93c22-b784-417d-9ddc-d43fb3a300db_602x432.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ocQR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24e93c22-b784-417d-9ddc-d43fb3a300db_602x432.png 424w, https://substackcdn.com/image/fetch/$s_!ocQR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24e93c22-b784-417d-9ddc-d43fb3a300db_602x432.png 848w, https://substackcdn.com/image/fetch/$s_!ocQR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24e93c22-b784-417d-9ddc-d43fb3a300db_602x432.png 1272w, https://substackcdn.com/image/fetch/$s_!ocQR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24e93c22-b784-417d-9ddc-d43fb3a300db_602x432.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Meta's Matrix Agentic Data Generation Architecture</figcaption></figure></div><p>The rise of these <strong>data factories</strong> is another reason to modernize your AI infrastructure. Synthetic data pipelines now look like production systems that mix GPU-heavy generation and embedding runs with CPU-heavy filtering and tool execution. They also create a lot of read and write traffic as you iterate. A <strong><a href="https://gradientflow.substack.com/p/the-rise-of-the-multimodal-lakehouse">multimodal lakehouse</a></strong> is a sensible data layer for this work because it stores raw media alongside embeddings and features. It also feeds training and inference jobs without letting storage become a bottleneck that leaves GPUs waiting.</p><p>The compute side maps cleanly to the <strong><a href="https://www.youtube.com/watch?v=OaGFAPQmeGU&amp;t=41s">PARK stack</a></strong>. Kubernetes provides the cluster foundation and Ray coordinates the complex mix of distributed tasks to keep pipelines moving. PyTorch and your frontier models then handle the generation and training loops. This approach offers a practical way to treat synthetic data as a core part of your platform. It provides a durable place to store and query what you generate and a reliable way to scale the services that produce it.</p><p>Building these data factories does more than just improve reasoning and agent behavior. It provides the <a href="https://arxiv.org/abs/2602.04029">scale needed to train models on the multi-table databases</a> that most companies rely on. Done well, synthetic data ceases to be a stopgap and becomes a practical path to better business models, including things like churn, fraud, and forecasting.</p><div><hr></div><h1>You Don&#8217;t Need a Massive ML Team to Scale AI Affordably</h1><p>As generative AI applications mature, engineering teams are finding that standard API endpoints often fall short on cost and performance. Companies increasingly need to customize and scale their own AI workloads to remain efficient. A recent engineering <strong><a href="https://www.notion.com/blog/two-years-of-vector-search-at-notion?utm_source=gradientflow&amp;utm_medium=newsletter">blog post from Notion</a></strong> illustrates this shift perfectly. To handle billions of vector embeddings, Notion overhauled its infrastructure by migrating both indexing and serving to <strong><a href="https://www.ray.io/">Ray</a></strong>. The company noted that while tech giants build entire internal teams around open-source projects like Ray, Notion does not have a dedicated machine learning infrastructure team. Instead, they rely on a managed service from <strong><a href="https://www.anyscale.com/?utm_source=gradientflow&amp;utm_medium=newsletter">Anyscale</a></strong> to access these same enterprise-grade capabilities. Just as we saw with synthetic data pipelines, this migration is the <strong><a href="https://www.youtube.com/watch?v=OaGFAPQmeGU&amp;t=41s">PARK stack</a></strong> at work. By adopting these interoperable open-source compute components, teams can efficiently pipeline CPU and GPU tasks, run open-weight models directly, and drastically reduce latency without being locked into a single vendor.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://www.notion.com/blog/two-years-of-vector-search-at-notion?utm_source=gradientflow&amp;utm_medium=newsletter" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hJ-h!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d9397e9-4f0a-4f93-9828-91e50ebfe18e_1402x1005.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hJ-h!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d9397e9-4f0a-4f93-9828-91e50ebfe18e_1402x1005.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hJ-h!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d9397e9-4f0a-4f93-9828-91e50ebfe18e_1402x1005.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hJ-h!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d9397e9-4f0a-4f93-9828-91e50ebfe18e_1402x1005.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hJ-h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d9397e9-4f0a-4f93-9828-91e50ebfe18e_1402x1005.jpeg" width="360" height="258.0599144079886" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0d9397e9-4f0a-4f93-9828-91e50ebfe18e_1402x1005.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1005,&quot;width&quot;:1402,&quot;resizeWidth&quot;:360,&quot;bytes&quot;:123818,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:&quot;https://www.notion.com/blog/two-years-of-vector-search-at-notion?utm_source=gradientflow&amp;utm_medium=newsletter&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/186926578?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d9397e9-4f0a-4f93-9828-91e50ebfe18e_1402x1005.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!hJ-h!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d9397e9-4f0a-4f93-9828-91e50ebfe18e_1402x1005.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hJ-h!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d9397e9-4f0a-4f93-9828-91e50ebfe18e_1402x1005.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hJ-h!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d9397e9-4f0a-4f93-9828-91e50ebfe18e_1402x1005.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hJ-h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d9397e9-4f0a-4f93-9828-91e50ebfe18e_1402x1005.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.notion.com/blog/two-years-of-vector-search-at-notion?utm_source=gradientflow&amp;utm_medium=newsletter&quot;,&quot;text&quot;:&quot;Learn More&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.notion.com/blog/two-years-of-vector-search-at-notion?utm_source=gradientflow&amp;utm_medium=newsletter"><span>Learn More</span></a></p><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a> and hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong> and the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[The margin paradox threatening every AI company]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/the-ai-bubble-is-obviousheres-what</link><guid isPermaLink="false">https://gradientflow.substack.com/p/the-ai-bubble-is-obviousheres-what</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 17 Feb 2026 14:00:43 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/2437a4c0-ce70-44ee-bceb-37955af991f3_1371x916.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>The AI Bubble Is Real. Enterprise Usage Is Even More Telling.</strong></h1><p>The existence of an AI bubble is beyond dispute. What remains unclear is when or how it deflates. <a href="https://amzn.to/4bGnHsT">As investors know all too well</a>, the most costly mistake in business is often being correct prematurely. The infrastructure layer has already booked revenues. This includes sectors like semiconductors, data centers, and power grids. The application side is a different story. It&#8217;s still a guessing game, and we&#8217;re still trying to figure out what customers will actually open their wallets for.</p><p>Rather than attempting to time the correction, I try to focus on what enterprises are actually doing with AI in production. When you look at what&#8217;s actually happening, the reality doesn&#8217;t quite match the headlines. It&#8217;s no secret that coding is the big winner so far, but the real news is the growth in administrative automation, and a clear preference for &#8220;simple and reliable&#8221; over &#8220;flashy and complex.&#8221; Furthermore, Chinese AI firms are pivoting aggressively toward Western markets, introducing application-layer competition to segments where U.S. firms have focused primarily on foundational research.  Ultimately, we&#8217;re seeing two very different playbooks. Major Western AI labs are all-in on reaching AGI, but China is doubling down on the essentials: energy, supply chains, and <em><strong><a href="https://gradientflow.substack.com/p/the-real-ai-race-its-about-diffusion">diffusion</a></strong></em> &#8212; getting AI apps into the hands of as many companies as possible, as fast as possible.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption"><em>Regular reader? Become a paid supporter.</em></p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h4>Dominant Use Cases: From Code to Creative to Administrative</h4><p>Software development is the dominant enterprise workload, rising to over half of total token usage by late 2025. AI-assisted programming and coding tasks include debugging, refactoring, and resolving bugs. Developer adoption ranges from half to two-thirds using AI tools daily, significantly increasing the velocity of the entire software lifecycle. A common pattern has emerged: teams start with IDE copilots, then move up the stack into codebase Q&amp;A, automated PR review, and &#8220;issue-to-patch&#8221; workflows that draft changes a human can validate.</p><p>But two other areas are quietly taking off just as fast: creative applications and administrative automation.  <strong>Creative use </strong>has surprised many AI teams. It isn&#8217;t just about getting the AI to write a document. It&#8217;s more like a back-and-forth process where you build ideas together. This shows up in tasks like creating different versions of an ad, tuning product copy, or even scripting training scenarios.</p><p><strong>Automating admin work</strong> is a huge deal for the bottom line, mainly because AI is now handling complex chores rather than just writing messages. AI is taking over the &#8220;busy work&#8221; of data entry and invoicing. Tasks that used to take all afternoon, like organizing insurance files or prepping financial entries, are now done automatically and handed to a human just for a final look. By automating sales tasks like record updates and follow-up sequences, AI removes the burden of manual data entry and administrative overhead.</p><p>There is also a massive shift happening under the hood. While everyone focuses on chatbots you can talk to, some of the most successful AI deployments are happening in the &#8220;plumbing&#8221; of the business. I&#8217;m hearing about more teams using agents for <a href="https://open.substack.com/pub/gradientflow/p/the-missing-layer-in-todays-agent?r=ks4p&amp;utm_campaign=post&amp;utm_medium=web&amp;showWelcomeOnShare=true">high-repetition tasks in data engineering and DevOps</a>. These agents handle the tedious work of moving data between systems or keeping infrastructure running. It&#8217;s the kind of work end users never see, but it&#8217;s where the real gains in speed and automation are being made.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!URiW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1611d52a-bad8-42f9-b595-4e9b48006883_1834x1017.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!URiW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1611d52a-bad8-42f9-b595-4e9b48006883_1834x1017.jpeg 424w, https://substackcdn.com/image/fetch/$s_!URiW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1611d52a-bad8-42f9-b595-4e9b48006883_1834x1017.jpeg 848w, https://substackcdn.com/image/fetch/$s_!URiW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1611d52a-bad8-42f9-b595-4e9b48006883_1834x1017.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!URiW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1611d52a-bad8-42f9-b595-4e9b48006883_1834x1017.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!URiW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1611d52a-bad8-42f9-b595-4e9b48006883_1834x1017.jpeg" width="1456" height="807" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1611d52a-bad8-42f9-b595-4e9b48006883_1834x1017.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:807,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:217766,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/186147691?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1611d52a-bad8-42f9-b595-4e9b48006883_1834x1017.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!URiW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1611d52a-bad8-42f9-b595-4e9b48006883_1834x1017.jpeg 424w, https://substackcdn.com/image/fetch/$s_!URiW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1611d52a-bad8-42f9-b595-4e9b48006883_1834x1017.jpeg 848w, https://substackcdn.com/image/fetch/$s_!URiW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1611d52a-bad8-42f9-b595-4e9b48006883_1834x1017.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!URiW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1611d52a-bad8-42f9-b595-4e9b48006883_1834x1017.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h4>The Evolution of Inference Patterns</h4><p>The technical architecture of AI is transitioning from single-pass pattern matching, to multi-step reasoning. New reasoning models are driving this shift. Because they allow a system to &#8220;think&#8221; before it responds, we&#8217;re seeing a total shift in the way these models use tokens. This has led to a dramatic increase in prompt lengths as developers build more complex, agentic loops. While the narrative around &#8220;autonomous agents&#8221; often outpaces the reality of production environments, there is an undeniable trend toward workflows where models plan and iterate rather than simply predict the next word.</p><p>In this architectural shift, open-weight models have carved out a stable and significant niche. Chinese models, such as DeepSeek and Qwen, have become formidable competitors, often providing performance that rivals proprietary Western models at a fraction of the cost. The strategic implication isn&#8217;t just cheaper inference. Chinese firms are using their open-weight models as a foot in the door for Western markets. Once they&#8217;ve proven their tech is legit through open-source, they move quickly into building business apps rather than just focusing on foundational models.</p><h4>Deployment Patterns: Bounded Agency and Lifecycle Management</h4><p>In the real world, the most successful companies are using what I call &#8220;bounded agency.&#8221; Even though we have the tech to build fully autonomous agents, most teams are choosing to keep them on a short leash. They limit the AI to just a few steps before a human has to step in and check the work. The reason is simple: reliability. By keeping a human in the loop, you ensure that if the AI makes a mistake, you can actually see what happened and fix it. This is about helping experts do their jobs faster, not replacing them.</p><p>Most teams are also choosing stability over perfection. They&#8217;d rather use a standard, off-the-shelf model with good guardrails than spend months trying to fine-tune a &#8220;perfect&#8221; custom one. It&#8217;s more practical and much easier to manage.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!BZ4I!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae18b3c-53d9-4e23-b5ef-dd1307b73329_1903x823.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BZ4I!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae18b3c-53d9-4e23-b5ef-dd1307b73329_1903x823.jpeg 424w, https://substackcdn.com/image/fetch/$s_!BZ4I!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae18b3c-53d9-4e23-b5ef-dd1307b73329_1903x823.jpeg 848w, https://substackcdn.com/image/fetch/$s_!BZ4I!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae18b3c-53d9-4e23-b5ef-dd1307b73329_1903x823.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!BZ4I!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae18b3c-53d9-4e23-b5ef-dd1307b73329_1903x823.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BZ4I!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae18b3c-53d9-4e23-b5ef-dd1307b73329_1903x823.jpeg" width="1456" height="630" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fae18b3c-53d9-4e23-b5ef-dd1307b73329_1903x823.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:630,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:191027,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/186147691?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae18b3c-53d9-4e23-b5ef-dd1307b73329_1903x823.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!BZ4I!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae18b3c-53d9-4e23-b5ef-dd1307b73329_1903x823.jpeg 424w, https://substackcdn.com/image/fetch/$s_!BZ4I!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae18b3c-53d9-4e23-b5ef-dd1307b73329_1903x823.jpeg 848w, https://substackcdn.com/image/fetch/$s_!BZ4I!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae18b3c-53d9-4e23-b5ef-dd1307b73329_1903x823.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!BZ4I!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae18b3c-53d9-4e23-b5ef-dd1307b73329_1903x823.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A related trend is a pattern I call <strong>"scaffold and shrink."</strong> Companies use a top-tier model as a temporary architect during the development or "vibe coding" phase. This expensive model writes the application code, refines the prompts, and generates the data needed to tune the system. Once the logic is solid, the team swaps it for a much smaller and faster model for daily use. This approach turns the high cost of a frontier model into a one-time development expense rather than a permanent tax on every customer interaction. It also levels the playing field. A small startup can use the best intelligence on the market to build its product without paying for that intelligence every time a user clicks a button. You do not need to run a massive model constantly if you have already used one to figure out exactly what your system needs to do.</p><p>The companies seeing the biggest returns tend to do two things well. <strong>First</strong>, they integrate AI deeply into their daily workflows. Instead of just using a chatbot, they turn general tasks into standard steps, like automatically updating a CRM right after a meeting. They then share those patterns across the whole company so everyone benefits. <strong>Second</strong>, they treat model upgrades as an ongoing process. They run old and new models side-by-side to catch any weird changes in behavior. They also build smart routing strategies so they aren&#8217;t using the most expensive model for every single tiny task.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ETlx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dba2d07-0d87-4aef-b91a-3e018a5150c4_1313x908.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ETlx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dba2d07-0d87-4aef-b91a-3e018a5150c4_1313x908.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ETlx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dba2d07-0d87-4aef-b91a-3e018a5150c4_1313x908.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ETlx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dba2d07-0d87-4aef-b91a-3e018a5150c4_1313x908.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ETlx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dba2d07-0d87-4aef-b91a-3e018a5150c4_1313x908.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ETlx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dba2d07-0d87-4aef-b91a-3e018a5150c4_1313x908.jpeg" width="464" height="320.8773800456969" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7dba2d07-0d87-4aef-b91a-3e018a5150c4_1313x908.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:908,&quot;width&quot;:1313,&quot;resizeWidth&quot;:464,&quot;bytes&quot;:244245,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/186147691?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dba2d07-0d87-4aef-b91a-3e018a5150c4_1313x908.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ETlx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dba2d07-0d87-4aef-b91a-3e018a5150c4_1313x908.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ETlx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dba2d07-0d87-4aef-b91a-3e018a5150c4_1313x908.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ETlx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dba2d07-0d87-4aef-b91a-3e018a5150c4_1313x908.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ETlx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dba2d07-0d87-4aef-b91a-3e018a5150c4_1313x908.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This is where the competition is really heating up. New players, especially from China, are arriving with ready-to-use apps that promise instant results. This puts pressure on established companies to prove they can offer more than just a quick fix. They have to show that their systems are more secure, better integrated, and more reliable for the long haul.</p><h4>Reliability and Infrastructure Gaps</h4><p><a href="https://gradientflow.substack.com/p/a-playbook-for-production-ready-ai">Reliability</a> is still the biggest hurdle for businesses today. Even with the top-tier foundation models we are all using right now, the success rate drops fast as soon as you give an AI agent a long, multi-step task. Think about a customer support agent trying to handle a refund. It has to look up a customer ID, check a return policy, initiate a bank transfer, and then send a confirmation email. Even if the AI is 90% accurate at each individual stage, the chances of the entire sequence working perfectly are surprisingly low. This &#8220;compound error&#8221; effect basically cuts the hype around productivity in half and is the main reason we can&#8217;t give these systems full autonomy in a production environment.</p><p>There is also a massive gap between coding and other types of work. AI is great at coding because it gets immediate feedback from a compiler. It knows right away if the code works or not. But for complex tasks like insurance authorizations or incident response, the feedback loop is much slower. You don&#8217;t find out if the AI made a mistake until much later, and those real-world consequences are much harder to automate.</p><p>AI also breaks the traditional way we build and test software. Normal software is predictable, but AI is not. It can be a bit random, which means most teams can&#8217;t rely on their usual automated testing and have to check the work manually. This creates a lot of friction, and it&#8217;s made worse by the massive increase in data being processed. Prompts have grown four times larger because these agent loops require thousands of tokens just to complete a single task.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Fqd8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce20646e-22c1-4bb9-ad96-8a4c553f49fb_1634x1041.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Fqd8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce20646e-22c1-4bb9-ad96-8a4c553f49fb_1634x1041.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Fqd8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce20646e-22c1-4bb9-ad96-8a4c553f49fb_1634x1041.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Fqd8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce20646e-22c1-4bb9-ad96-8a4c553f49fb_1634x1041.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Fqd8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce20646e-22c1-4bb9-ad96-8a4c553f49fb_1634x1041.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Fqd8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce20646e-22c1-4bb9-ad96-8a4c553f49fb_1634x1041.jpeg" width="556" height="354.3736263736264" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ce20646e-22c1-4bb9-ad96-8a4c553f49fb_1634x1041.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:928,&quot;width&quot;:1456,&quot;resizeWidth&quot;:556,&quot;bytes&quot;:204677,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/186147691?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce20646e-22c1-4bb9-ad96-8a4c553f49fb_1634x1041.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Fqd8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce20646e-22c1-4bb9-ad96-8a4c553f49fb_1634x1041.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Fqd8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce20646e-22c1-4bb9-ad96-8a4c553f49fb_1634x1041.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Fqd8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce20646e-22c1-4bb9-ad96-8a4c553f49fb_1634x1041.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Fqd8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce20646e-22c1-4bb9-ad96-8a4c553f49fb_1634x1041.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Finally, there is a hidden risk in how concentrated AI usage has become. Most companies are using AI for the same ten things, like fixing software bugs. This creates a &#8220;single point of failure&#8221; for the whole business. If a model&#8217;s performance degrades or a provider has an outage, it wouldn&#8217;t just be a minor glitch. It could paralyze core operations across the entire company.</p><h4>Trust, Cybersecurity, and Geopolitical Considerations</h4><p>Trust and safety aren&#8217;t just about ethics anymore. They are a business requirement. Trust failures show up as delayed deployments and worse outcomes, especially when they involve concerns around privacy, safety, and governance. Many organizations are effectively trying to scale autonomy before they&#8217;ve standardized policies for permissions, audit, incident response, and supplier risk. (I covered agents and security shifts in a <a href="https://gradientflow.substack.com/p/security-for-ai-native-companies">previous article</a>.)</p><p><strong>Data sovereignty</strong> is also reshaping governance decisions. Buyers increasingly prioritize model origin and data residency, which influences vendor selection and hosting architecture. This is particularly relevant as Chinese startups expand into the West: enterprise buyers will demand more rigorous proof of control, transparency, and compliance before trusting these providers with sensitive, high-impact workflows.</p><h4>The Margin Paradox and Market Correction</h4><p>As AI tools become common, the industry is running into a bit of a trap. When everyone has the same efficiency-boosting tech, the savings just get passed on to the customer. This leads to lower prices and thinner profit margins for everyone. This deflationary cycle is being accelerated by the arrival of Chinese firms like <a href="https://manus.im/">Manus</a> and <a href="https://atoms.dev/">Atoms</a>. These companies add competitive pressure precisely in the application layer where monetization remains most uncertain, bringing sophisticated tools and outcome-driven approaches honed in brutal domestic competition.</p><div class="pullquote"><p>While the West is swinging for the fences with AGI, China is focused on the plumbing: securing supply chains and ensuring AI reaches every corner of the economy.</p></div><p>We all recognize that a market correction is unavoidable, and because AI is now a large share of the market narrative, the spillover won&#8217;t be neatly contained. Infrastructure suppliers are partially insulated. Capital spent on hardware is a sunk cost once it&#8217;s booked, so any adjustment is more likely to show up as slower growth than revenue reversals. But there is a major caveat here. Much of the massive data center build-out is being funded by debt. If the application side doesn&#8217;t start generating enough cash to cover those loans, that debt could turn a simple market slowdown into a much deeper financial strain for the companies building the physical foundations of AI. Cloud revenue, meanwhile, remains largely tied to basic compute and storage rather than AI-specific offerings. The challenge for investors is that while a few durable winners will emerge, trying to spot them today is more about following the crowd than how these businesses actually make money.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JlOL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d85c17-b08a-4d45-9623-76b402ab285c_1874x1046.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JlOL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d85c17-b08a-4d45-9623-76b402ab285c_1874x1046.jpeg 424w, https://substackcdn.com/image/fetch/$s_!JlOL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d85c17-b08a-4d45-9623-76b402ab285c_1874x1046.jpeg 848w, https://substackcdn.com/image/fetch/$s_!JlOL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d85c17-b08a-4d45-9623-76b402ab285c_1874x1046.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!JlOL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d85c17-b08a-4d45-9623-76b402ab285c_1874x1046.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JlOL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d85c17-b08a-4d45-9623-76b402ab285c_1874x1046.jpeg" width="1456" height="813" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/51d85c17-b08a-4d45-9623-76b402ab285c_1874x1046.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:813,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:543491,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/186147691?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d85c17-b08a-4d45-9623-76b402ab285c_1874x1046.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!JlOL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d85c17-b08a-4d45-9623-76b402ab285c_1874x1046.jpeg 424w, https://substackcdn.com/image/fetch/$s_!JlOL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d85c17-b08a-4d45-9623-76b402ab285c_1874x1046.jpeg 848w, https://substackcdn.com/image/fetch/$s_!JlOL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d85c17-b08a-4d45-9623-76b402ab285c_1874x1046.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!JlOL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d85c17-b08a-4d45-9623-76b402ab285c_1874x1046.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/02/AI-Bubble-seven-signs.jpeg">enlarge</a></strong>)</figcaption></figure></div><p>Eventually, the bubble will burst, but that doesn&#8217;t mean the technology will stop being useful. History shows that once the hype dies down, the real work of &#8220;diffusion&#8221; begins as these tools become a standard part of every industry. That&#8217;s why I focus on what enterprises are actually doing today. By learning from the patterns that are working in the real world, you can move past the hype and start building AI projects that actually deliver results.</p><div><hr></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2DFU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35daa9b0-8497-4e91-a4ec-2ba2ba817965_1862x1001.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2DFU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35daa9b0-8497-4e91-a4ec-2ba2ba817965_1862x1001.jpeg 424w, https://substackcdn.com/image/fetch/$s_!2DFU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35daa9b0-8497-4e91-a4ec-2ba2ba817965_1862x1001.jpeg 848w, https://substackcdn.com/image/fetch/$s_!2DFU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35daa9b0-8497-4e91-a4ec-2ba2ba817965_1862x1001.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!2DFU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35daa9b0-8497-4e91-a4ec-2ba2ba817965_1862x1001.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2DFU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35daa9b0-8497-4e91-a4ec-2ba2ba817965_1862x1001.jpeg" width="622" height="334.4958791208791" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/35daa9b0-8497-4e91-a4ec-2ba2ba817965_1862x1001.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:783,&quot;width&quot;:1456,&quot;resizeWidth&quot;:622,&quot;bytes&quot;:503895,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/186147691?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35daa9b0-8497-4e91-a4ec-2ba2ba817965_1862x1001.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2DFU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35daa9b0-8497-4e91-a4ec-2ba2ba817965_1862x1001.jpeg 424w, https://substackcdn.com/image/fetch/$s_!2DFU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35daa9b0-8497-4e91-a4ec-2ba2ba817965_1862x1001.jpeg 848w, https://substackcdn.com/image/fetch/$s_!2DFU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35daa9b0-8497-4e91-a4ec-2ba2ba817965_1862x1001.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!2DFU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35daa9b0-8497-4e91-a4ec-2ba2ba817965_1862x1001.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ul><li><p><strong><a href="https://amzn.to/3YYkXzu">Reader Bot: What Happens When AI Reads and Why It Matters</a></strong>. I still think nothing beats curling up with a good book, but we are facing a perfect storm where long-standing declines in reading meet AI&#8217;s ability to process text for us. This book offers an excellent overview of how to use these models as helpful <em><strong>co-readers</strong></em> without outsourcing the actual joy of reading.</p></li><li><p><strong><a href="https://amzn.to/45tITyc">I Deliver Parcels in Beijing</a></strong>. Years ago in Shanghai I kept dodging electric scooters zipping around the French Concession to make deliveries, and it turns out that was just the tip of the iceberg. This book puts you inside the life of the person on that scooter, where every bathroom break has a cost in yuan and every failed delivery eats into wages that were already razor-thin.</p></li><li><p><strong><a href="https://amzn.to/4rjnWyT">Strangers: A Memoir of Marriage</a></strong>. I've read a lot of memoirs over the years, and this is one of the best. It is a sharp look at how little we sometimes know the people closest to us.</p></li></ul><div><hr></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!T0xV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d340052-675f-48d5-91ad-63f55b67caf4_1026x1045.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!T0xV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d340052-675f-48d5-91ad-63f55b67caf4_1026x1045.jpeg 424w, https://substackcdn.com/image/fetch/$s_!T0xV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d340052-675f-48d5-91ad-63f55b67caf4_1026x1045.jpeg 848w, https://substackcdn.com/image/fetch/$s_!T0xV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d340052-675f-48d5-91ad-63f55b67caf4_1026x1045.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!T0xV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d340052-675f-48d5-91ad-63f55b67caf4_1026x1045.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!T0xV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d340052-675f-48d5-91ad-63f55b67caf4_1026x1045.jpeg" width="646" height="657.9629629629629" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1d340052-675f-48d5-91ad-63f55b67caf4_1026x1045.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1045,&quot;width&quot;:1026,&quot;resizeWidth&quot;:646,&quot;bytes&quot;:289743,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/186147691?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d340052-675f-48d5-91ad-63f55b67caf4_1026x1045.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!T0xV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d340052-675f-48d5-91ad-63f55b67caf4_1026x1045.jpeg 424w, https://substackcdn.com/image/fetch/$s_!T0xV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d340052-675f-48d5-91ad-63f55b67caf4_1026x1045.jpeg 848w, https://substackcdn.com/image/fetch/$s_!T0xV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d340052-675f-48d5-91ad-63f55b67caf4_1026x1045.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!T0xV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d340052-675f-48d5-91ad-63f55b67caf4_1026x1045.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><strong>Security went down, valuation went up: </strong>In the age of AI, a vibe coding project can go from 512 vulnerabilities to billion-dollar offers in under 90 days &#8212; because adoption graphs can't be fabricated but <strong>code can always be rewritten</strong>.</figcaption></figure></div><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a> and hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong> and the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[Your agents need runbooks, not bigger context windows]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/the-missing-layer-in-todays-agent</link><guid isPermaLink="false">https://gradientflow.substack.com/p/the-missing-layer-in-todays-agent</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 10 Feb 2026 14:01:36 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/89d1ad51-fa67-45f6-be5d-6a805b6d80d0_1190x930.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>Why Your AI Agents Need Operational Memory, Not Just Conversational Memory</strong></h1><p>Now that AI agents are moving out of the lab and into the real world, we&#8217;re realizing that &#8220;memory&#8221; isn&#8217;t one-size-fits-all. Most people think of agent memory like a personal assistant. It remembers your preferences, your travel plans, and the email you composed last week. This is great for a research copilot or a personal coach where the goal is a long-term relationship.</p><p>But if you&#8217;re deploying agents to handle back-office <strong>operations</strong> like fixing data pipelines or managing APIs, you don&#8217;t need a persona. You need reliability. I started looking into this because I saw teams struggling to make agents work in high-stakes, repetitive environments. In these cases, the goal isn&#8217;t conversational flair. Instead, it&#8217;s making sure the agent doesn&#8217;t have to reinvent the wheel every time it performs a task. By separating how an agent &#8220;thinks&#8221; from how it &#8220;acts,&#8221; we can finally scale automation without the massive costs and unpredictability that usually come with it.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption"><em>Been reading for a while? Support our work by becoming a paid subscriber.</em></p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h4>Constraints of Infinite Context and the Limits of Modern Transformer Architectures</h4><p>In a production environment, the "just add more context" strategy hits a wall very quickly. Even though context windows have grown massive, the math behind them is still expensive. Because computational work <a href="https://arxiv.org/abs/2209.04881">grows quadratically with input length</a>, the more information you dump into a prompt, the slower the agent gets. A task that should take one second can balloon into thirty seconds just because the context is too large. There is also the <a href="https://arxiv.org/abs/2307.03172">&#8220;lost-in-the-middle&#8221;</a> effect to worry about. When prompts get too long, agents often lose track of the information buried in the center. Research shows that giving an agent small, focused snippets is often twice as accurate as making it read a giant document.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!AVco!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01779956-3866-437b-bfe3-3edb729b8b74_1536x672.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!AVco!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01779956-3866-437b-bfe3-3edb729b8b74_1536x672.png 424w, https://substackcdn.com/image/fetch/$s_!AVco!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01779956-3866-437b-bfe3-3edb729b8b74_1536x672.png 848w, https://substackcdn.com/image/fetch/$s_!AVco!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01779956-3866-437b-bfe3-3edb729b8b74_1536x672.png 1272w, https://substackcdn.com/image/fetch/$s_!AVco!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01779956-3866-437b-bfe3-3edb729b8b74_1536x672.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!AVco!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01779956-3866-437b-bfe3-3edb729b8b74_1536x672.png" width="524" height="229.25" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/01779956-3866-437b-bfe3-3edb729b8b74_1536x672.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:637,&quot;width&quot;:1456,&quot;resizeWidth&quot;:524,&quot;bytes&quot;:1099181,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/185567464?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01779956-3866-437b-bfe3-3edb729b8b74_1536x672.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!AVco!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01779956-3866-437b-bfe3-3edb729b8b74_1536x672.png 424w, https://substackcdn.com/image/fetch/$s_!AVco!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01779956-3866-437b-bfe3-3edb729b8b74_1536x672.png 848w, https://substackcdn.com/image/fetch/$s_!AVco!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01779956-3866-437b-bfe3-3edb729b8b74_1536x672.png 1272w, https://substackcdn.com/image/fetch/$s_!AVco!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01779956-3866-437b-bfe3-3edb729b8b74_1536x672.png 1456w" sizes="100vw"></picture><div></div></div></a></figure></div><p>Beyond the technical overhead, there is a structural problem with how agents handle enterprise infrastructure. If you have hundreds of internal tools and APIs, describing them all can eat up your prompt&#8217;s token budget before the agent even starts its work. Most importantly, today&#8217;s architectures don't allow for organizational learning. Every time an agent runs a task, it treats it like a brand new puzzle. It pays the full "thinking cost" of planning and discovery every single time. Without a way to save and reuse a successful workflow, the system never gets faster or cheaper. It just stays stuck in a loop where the 1,000th execution is just as difficult as the first.</p><p>The root of this problem is that we are treating the agent&#8217;s context window like RAM: a volatile workspace that wipes clean the moment a task is finished. In a traditional computing stack, you wouldn&#8217;t reload your entire operating system every time you wanted to open a text file. But in the current agent paradigm, we force the model to &#8220;re-boot&#8221; its understanding of our infrastructure for every single request. This creates a massive <em><strong>Context Tax </strong></em>&#8212; a recurring overhead where you pay in both latency and tokens to re-teach the agent things it should already know.</p><h4>The Attention-Window Workarounds: Current Memory and Context Architectures</h4><p>Most current ways of handling agent memory focus on finding information rather than reusing successful actions. Take Retrieval-Augmented Generation (RAG) for example. It is great at fetching facts to help an agent answer a question. This helps avoid the "lost-in-the-middle" problem. But RAG is built to find documents and not to remember how a job was actually done. It can find a technical runbook for you. However, it cannot remember that a specific five-step sequence of API calls fixed a database issue last week. Every time a workflow starts, the agent has to read documentation and plan its steps from scratch.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3Gz3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d954ba0-5817-4fff-b615-70a2d953d6db_1791x972.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3Gz3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d954ba0-5817-4fff-b615-70a2d953d6db_1791x972.jpeg 424w, https://substackcdn.com/image/fetch/$s_!3Gz3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d954ba0-5817-4fff-b615-70a2d953d6db_1791x972.jpeg 848w, https://substackcdn.com/image/fetch/$s_!3Gz3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d954ba0-5817-4fff-b615-70a2d953d6db_1791x972.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!3Gz3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d954ba0-5817-4fff-b615-70a2d953d6db_1791x972.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3Gz3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d954ba0-5817-4fff-b615-70a2d953d6db_1791x972.jpeg" width="1456" height="790" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5d954ba0-5817-4fff-b615-70a2d953d6db_1791x972.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:790,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:267530,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/185567464?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d954ba0-5817-4fff-b615-70a2d953d6db_1791x972.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3Gz3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d954ba0-5817-4fff-b615-70a2d953d6db_1791x972.jpeg 424w, https://substackcdn.com/image/fetch/$s_!3Gz3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d954ba0-5817-4fff-b615-70a2d953d6db_1791x972.jpeg 848w, https://substackcdn.com/image/fetch/$s_!3Gz3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d954ba0-5817-4fff-b615-70a2d953d6db_1791x972.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!3Gz3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d954ba0-5817-4fff-b615-70a2d953d6db_1791x972.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>More advanced agent orchestration frameworks try to fix "catalog bloat" by loading tool definitions only when they are needed. This saves money on tokens, but it still doesn't help the organization learn. Other systems focus on "stateful continuity." This ensures an agent remembers who you are and what you like over several months. That is perfect for a personal assistant, but it doesn't help with operational tasks. If an API is broken or a tool description is confusing, the agent will just keep making the same mistake every time it runs. We also see agents that write code on the fly. This often leads to "workspace chaos" where you end up with hundreds of one-off scripts that nobody has verified or tracked.</p><p>In the end, current memory methods force agents to remain in a state of perpetual improvisation. Because there is no way to &#8220;crystallize&#8221; a successful solution into a reusable procedure, the organization never develops an institutional learning curve. To really scale in an enterprise, agents need to stop just managing what is in their immediate context and start building a permanent library of proven procedures. This transforms one-off successes into reliable, repeatable assets.</p><h4>The Context File System: Memory for How Work Gets Done</h4><p>To fix these structural bottlenecks, we are seeing the rise of what <strong><a href="https://getdex.sh/what-is-dex?utm_source=gradientflow&amp;utm_medium=newsletter">dex</a></strong> calls a <strong>Context File System (CFS)</strong>. You might also hear this more broadly categorized as an <em>Operational Skill Store</em>. This architecture separates the expensive reasoning of a large language model from the actual storage of operational knowledge. It mirrors the way a mature engineering team works. Senior staff solve a novel problem once and then document the solution in a runbook so the task becomes routine. By turning successful executions into permanent and reusable procedures, a CFS transforms agents from improvisational tools into reliable enterprise infrastructure.</p><p><a href="https://conikeec.substack.com/p/the-context-window-is-becoming-a">This architecture shifts the role</a> of the context window from a temporary, volatile buffer to a persistent storage layer. Rather than stuffing a prompt with &#8220;just-in-case&#8221; information, a CFS allows the agent to &#8220;mount&#8221; and &#8220;unmount&#8221; specific operational volumes as needed. For example, an agent can mount a specific codebase volume to understand a bug, then swap it for a technical runbook volume to execute the fix. This &#8220;Context-as-a-Service&#8221; model ensures the agent&#8217;s focus is always high-density and low-noise, treating context as a managed resource rather than a fleeting byproduct of a conversation.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5w84!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c167213-4cf5-4210-951c-a20951a8b8de_1650x987.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5w84!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c167213-4cf5-4210-951c-a20951a8b8de_1650x987.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5w84!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c167213-4cf5-4210-951c-a20951a8b8de_1650x987.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5w84!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c167213-4cf5-4210-951c-a20951a8b8de_1650x987.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5w84!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c167213-4cf5-4210-951c-a20951a8b8de_1650x987.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5w84!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c167213-4cf5-4210-951c-a20951a8b8de_1650x987.jpeg" width="1456" height="871" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9c167213-4cf5-4210-951c-a20951a8b8de_1650x987.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:871,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:309807,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/185567464?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c167213-4cf5-4210-951c-a20951a8b8de_1650x987.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5w84!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c167213-4cf5-4210-951c-a20951a8b8de_1650x987.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5w84!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c167213-4cf5-4210-951c-a20951a8b8de_1650x987.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5w84!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c167213-4cf5-4210-951c-a20951a8b8de_1650x987.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5w84!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c167213-4cf5-4210-951c-a20951a8b8de_1650x987.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/02/Context-File-System.jpeg">enlarge</a></strong>)</figcaption></figure></div><p>This approach fundamentally changes the economics of AI. In traditional systems, the 1,000th execution of a workflow costs as much as the first because the agent has to re-plan the task from scratch. With a CFS, the first execution pays an "exploration cost" in tokens. Every run after that simply replays the proven procedure. This often reduces token consumption by over 90 percent. It also solves the problem of "tool sprawl." Instead of stuffing a prompt with hundreds of API definitions, the CFS indexes these capabilities externally. It only loads the specific documentation required for the task at hand. This ensures that agents remain fast and accurate even as your company infrastructure grows in complexity.</p><p>A Context File System or operational skill store is defined by a few key features:</p><ul><li><p><strong>Persistent Procedural Memory</strong>. Successful multi-step workflows are captured as versioned and executable procedures. These can be reused across the organization to eliminate the need for repeated planning.</p></li><li><p><strong>Indexed Tool Discovery</strong>. Rather than forcing an agent to memorize a massive tool catalog, the system maintains an external index. It only exposes relevant API schemas when they are actually required for the current task.</p></li><li><p><strong>Separation of Reasoning and Execution</strong>. High-cost model reasoning is reserved for genuinely new problems. Routine work is handled by the memory layer instead. This shifts the cost curve so that the system gets cheaper as usage scales. You stop paying for the model to &#8220;re-reason&#8221; through a solved problem, effectively turning a variable cost into a fixed, reusable asset.</p></li><li><p><strong>Self-Healing Infrastructure</strong>. The system monitors the success rate for every learned procedure. If an underlying API changes and a workflow begins to fail, the system automatically pulls the procedure and triggers a re-learning phase.</p></li><li><p><strong>Model and Vendor Independence</strong>. Procedures are stored as standard executable code rather than proprietary prompt formats. This allows your organizational knowledge to remain portable even if you switch AI providers.</p></li><li><p><strong>Governance and Auditability</strong>. Every action is recorded in an episodic memory layer. This provides the full execution traces and version control necessary for debugging in regulated environments.</p></li></ul><p>Beyond immediate cost savings, the CFS provides a way to compound organizational value. Knowledge gained by one agent execution becomes an immediate asset for the entire enterprise. This creates a self-maintaining model that adapts to changing infrastructure without requiring manual prompt engineering. It allows AI agents to function as a stable and scalable part of the modern software stack.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!45XX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe543fa42-c4fb-4873-a118-94f0d54260af_1628x841.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!45XX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe543fa42-c4fb-4873-a118-94f0d54260af_1628x841.jpeg 424w, https://substackcdn.com/image/fetch/$s_!45XX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe543fa42-c4fb-4873-a118-94f0d54260af_1628x841.jpeg 848w, https://substackcdn.com/image/fetch/$s_!45XX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe543fa42-c4fb-4873-a118-94f0d54260af_1628x841.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!45XX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe543fa42-c4fb-4873-a118-94f0d54260af_1628x841.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!45XX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe543fa42-c4fb-4873-a118-94f0d54260af_1628x841.jpeg" width="1456" height="752" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e543fa42-c4fb-4873-a118-94f0d54260af_1628x841.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:752,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:329089,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/185567464?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe543fa42-c4fb-4873-a118-94f0d54260af_1628x841.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!45XX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe543fa42-c4fb-4873-a118-94f0d54260af_1628x841.jpeg 424w, https://substackcdn.com/image/fetch/$s_!45XX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe543fa42-c4fb-4873-a118-94f0d54260af_1628x841.jpeg 848w, https://substackcdn.com/image/fetch/$s_!45XX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe543fa42-c4fb-4873-a118-94f0d54260af_1628x841.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!45XX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe543fa42-c4fb-4873-a118-94f0d54260af_1628x841.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h4>Roadmap for Operational AI and Enterprise Agent Deployment</h4><p>The future for Context File Systems like <strong><a href="https://getdex.sh/what-is-dex?utm_source=gradientflow&amp;utm_medium=newsletter">dex</a></strong> points toward a <em>central hub</em>. This is an internal registry where a workflow discovered by one team becomes instantly available to the whole company. In this model, a successful solution becomes a company asset that is versioned and governed just like standard IT code. Future versions will likely have better self-healing capabilities. They will be able to tell the difference between a temporary network error and a permanent change to an API. This evolution turns AI infrastructure from a series of isolated experiments into a library of expertise that gets more cost-effective as it grows.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RZiV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd683678c-91c5-4871-9ff6-aea385f3ddd2_1700x1014.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RZiV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd683678c-91c5-4871-9ff6-aea385f3ddd2_1700x1014.jpeg 424w, https://substackcdn.com/image/fetch/$s_!RZiV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd683678c-91c5-4871-9ff6-aea385f3ddd2_1700x1014.jpeg 848w, https://substackcdn.com/image/fetch/$s_!RZiV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd683678c-91c5-4871-9ff6-aea385f3ddd2_1700x1014.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!RZiV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd683678c-91c5-4871-9ff6-aea385f3ddd2_1700x1014.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RZiV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd683678c-91c5-4871-9ff6-aea385f3ddd2_1700x1014.jpeg" width="522" height="311.1923076923077" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d683678c-91c5-4871-9ff6-aea385f3ddd2_1700x1014.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:868,&quot;width&quot;:1456,&quot;resizeWidth&quot;:522,&quot;bytes&quot;:208429,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/185567464?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd683678c-91c5-4871-9ff6-aea385f3ddd2_1700x1014.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!RZiV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd683678c-91c5-4871-9ff6-aea385f3ddd2_1700x1014.jpeg 424w, https://substackcdn.com/image/fetch/$s_!RZiV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd683678c-91c5-4871-9ff6-aea385f3ddd2_1700x1014.jpeg 848w, https://substackcdn.com/image/fetch/$s_!RZiV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd683678c-91c5-4871-9ff6-aea385f3ddd2_1700x1014.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!RZiV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd683678c-91c5-4871-9ff6-aea385f3ddd2_1700x1014.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>For AI teams, the relationship between a Context File System and the Model Context Protocol (MCP) is a foundational choice. While MCP provides the standardized plug that lets agents connect to internal systems, a CFS acts as the layer that makes that connection economically viable. Without a CFS, a large-scale MCP deployment risks falling apart under the weight of its own tool catalog.</p><p>Organizations should prioritize <strong>stateful memory systems when the main goal is conversation and personalization</strong>. This is perfect for research copilots or personal assistants. On the other hand, <strong>a CFS architecture is the better choice for high-repetition operational work</strong> like DevOps automation or data engineering. In those environments, the goals are cost predictability and the reuse of proven procedures.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!69UC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8545354-0e81-43a0-9b06-96ae312a6564_1474x770.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!69UC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8545354-0e81-43a0-9b06-96ae312a6564_1474x770.png 424w, https://substackcdn.com/image/fetch/$s_!69UC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8545354-0e81-43a0-9b06-96ae312a6564_1474x770.png 848w, https://substackcdn.com/image/fetch/$s_!69UC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8545354-0e81-43a0-9b06-96ae312a6564_1474x770.png 1272w, https://substackcdn.com/image/fetch/$s_!69UC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8545354-0e81-43a0-9b06-96ae312a6564_1474x770.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!69UC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8545354-0e81-43a0-9b06-96ae312a6564_1474x770.png" width="422" height="220.56456043956044" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d8545354-0e81-43a0-9b06-96ae312a6564_1474x770.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:761,&quot;width&quot;:1456,&quot;resizeWidth&quot;:422,&quot;bytes&quot;:514751,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/185567464?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8545354-0e81-43a0-9b06-96ae312a6564_1474x770.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!69UC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8545354-0e81-43a0-9b06-96ae312a6564_1474x770.png 424w, https://substackcdn.com/image/fetch/$s_!69UC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8545354-0e81-43a0-9b06-96ae312a6564_1474x770.png 848w, https://substackcdn.com/image/fetch/$s_!69UC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8545354-0e81-43a0-9b06-96ae312a6564_1474x770.png 1272w, https://substackcdn.com/image/fetch/$s_!69UC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8545354-0e81-43a0-9b06-96ae312a6564_1474x770.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><div><hr></div><h1>Quick Takes</h1><div id="youtube2-Aw0-ZCON6dY" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;Aw0-ZCON6dY&quot;,&quot;startTime&quot;:&quot;36&quot;,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/Aw0-ZCON6dY?start=36&amp;rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><ol><li><p><a href="https://youtu.be/Aw0-ZCON6dY?t=36">The Economics of Robotaxis: Are We There Yet?</a></p></li><li><p><a href="https://youtu.be/Aw0-ZCON6dY?t=1084">The &#8220;Data Center Rebellion&#8221; has gone national</a></p></li></ol><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a> and hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong>, the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, the <strong><a href="https://appliedaisummit.org/">Applied AI Summit</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[The "Data Center Rebellion" is here]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/the-data-center-rebellion-is-here</link><guid isPermaLink="false">https://gradientflow.substack.com/p/the-data-center-rebellion-is-here</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 03 Feb 2026 14:03:10 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/43914cf6-e39a-4947-a31b-01b75449cfd5_1392x895.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>Beyond the Chips: The Local Politics of AI Infrastructure</strong></h1><p>Even the most ardent cheerleaders for artificial intelligence now quietly concede we are navigating a massive AI bubble. The numbers are stark: hyperscalers are deploying roughly $400 billion annually into data centers and specialized chips while AI-related revenue hovers around $20 billion &#8212; a 20-to-1 capital-to-revenue ratio that stands out even in infrastructure cycles historically characterized by front-loaded spending. To justify this deployment on conventional investment metrics, the industry would need a step-change in monetization over a short window to make the numbers work.</p><p>While venture capitalists and tech executives debate the &#8220;mismatch&#8221; between compute and monetization, a more tangible crisis is unfolding far from Silicon Valley. A growing grassroots opposition to AI data centers remains largely below the radar here in San Francisco. I travel to Sioux Falls, South Dakota a few times a year to visit my in-laws. It&#8217;s not a region known for being anti-business. Yet even there, a &#8220;data center rebellion&#8221; has been brewing. Even though the recent attempt to overturn a re-zoning ordinance <a href="https://www.keloland.com/news/local-news/data-center-re-zone-petition-fails-in-sioux-falls/">did not succeed</a>, the level of community pushback in the heart of the Midwest signals that these projects no longer enjoy a guaranteed green light.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://gradientflow.substack.com/subscribe?"><span>Subscribe now</span></a></p><p>This resistance is not merely reflexive NIMBYism. It represents a sophisticated, multi-front challenge to the physical infrastructure AI requires. For leadership teams planning for the future, this means "compute availability" is no longer just a procurement question. It is now tied to local politics, grid stability, water management, and city approval processes. In the course of trying to understand the growing opposition to AI data centers, I&#8217;ve been examining the specific drivers behind this opposition and why the assumption of limitless infrastructure growth is colliding with hard constraints.</p><h4>The Grid Capacity Crunch and the Ratepayer Revolt</h4><p>AI data centers function as grid-scale industrial loads. Individual projects now request 100+ megawatts, and some proposals reach into the gigawatt range. One proposed Michigan facility, for example, would consume 1.4 gigawatts, nearly exhausting the region&#8217;s remaining 1.5 gigawatts of headroom and roughly matching the electricity needs of about a million homes. This happens because AI hardware is incredibly dense and uses a massive amount of electricity. It also runs constantly. Since AI work doesn't have "off" hours, power companies can't rely on the usual quiet periods they use to balance the rest of the grid.</p><p>The politics come down to who pays the bill. Residents in many areas have seen their home utility rates jump by 25% or 30% after big data centers moved in, even though they were promised rates wouldn't change. People are afraid they will end up paying for the power company's new equipment. This happens when a utility builds massive substations just for one company, but the cost ends up being shared by everyone. When you add in state and local tax breaks, it gets even worse. Communities deal with all the downsides of the project while the financial benefits are eaten away by tax breaks and credits.</p><p>The result is a rare bipartisan alignment around a simple demand: hyperscalers should pay their full cost of service. Notably, Microsoft has moved in that direction publicly, committing to cover grid-upgrade costs and pursue rate structures intended to insulate residential customers &#8212; an implicit admission that the old incentive playbook has become a political liability (and, in some places, an electoral one).</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!INTx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5f84c69-1f2c-4380-af20-31b2ac5a8774_1504x178.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!INTx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5f84c69-1f2c-4380-af20-31b2ac5a8774_1504x178.png 424w, https://substackcdn.com/image/fetch/$s_!INTx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5f84c69-1f2c-4380-af20-31b2ac5a8774_1504x178.png 848w, https://substackcdn.com/image/fetch/$s_!INTx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5f84c69-1f2c-4380-af20-31b2ac5a8774_1504x178.png 1272w, https://substackcdn.com/image/fetch/$s_!INTx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5f84c69-1f2c-4380-af20-31b2ac5a8774_1504x178.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!INTx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5f84c69-1f2c-4380-af20-31b2ac5a8774_1504x178.png" width="1456" height="172" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e5f84c69-1f2c-4380-af20-31b2ac5a8774_1504x178.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:172,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:286018,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/185135989?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5f84c69-1f2c-4380-af20-31b2ac5a8774_1504x178.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!INTx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5f84c69-1f2c-4380-af20-31b2ac5a8774_1504x178.png 424w, https://substackcdn.com/image/fetch/$s_!INTx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5f84c69-1f2c-4380-af20-31b2ac5a8774_1504x178.png 848w, https://substackcdn.com/image/fetch/$s_!INTx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5f84c69-1f2c-4380-af20-31b2ac5a8774_1504x178.png 1272w, https://substackcdn.com/image/fetch/$s_!INTx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5f84c69-1f2c-4380-af20-31b2ac5a8774_1504x178.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h4>Water Wars and the Constant Hum</h4><p>High-density AI compute generates immense heat, requiring cooling systems that can consume millions of gallons of water daily. In desert municipalities like Chandler, Arizona, and Tucson, this creates direct competition with agricultural irrigation and residential drinking supplies. Proposed facilities may withdraw hundreds of millions of gallons annually from stressed aquifers or municipal systems, raising fears that industrial users will deplete wells serving farms and homes. Data center developers frequently respond with technical solutions like dry cooling and closed-loop designs. However, communities have learned the trade-off: dry cooling shifts the burden to electricity, and closed-loop systems still lose water to the atmosphere and require constant refills. The practical outcome is that cooling architecture is now a first-order constraint. In Tucson, a project known locally as &#8220;Project Blue&#8221; faced enough pushback over water rights that the developer had to revisit the cooling approach midstream.</p><p>Beyond resource consumption, these facilities create a significant noise problem. Industrial-scale cooling fans and backup diesel generators create a &#8220;constant hum&#8221; that represents daily intrusion into previously quiet neighborhoods. In Florida, residents near a proposed facility serving 2,500 families and an elementary school cite sleep disruption and health risks as primary objections, elevating the issue from nuisance to harm. The noise also hits farms hard. In Wisconsin, residents reported that the low-frequency hum makes livestock, particularly horses, nervous and skittish. This disrupts farm life in a way that standard commercial development just doesn't.  This is why municipalities are tightening requirements: acoustic modeling, enforceable decibel limits at property lines, substantial setbacks (sometimes on the order of 200 feet), and <a href="https://en.wikipedia.org/wiki/Berm">berms</a> that are no longer &#8220;nice-to-have&#8221; concessions but baseline conditions for approval.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!OsHk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feedf3368-4d0e-476e-a53a-2c6a86c41951_3993x1793.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!OsHk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feedf3368-4d0e-476e-a53a-2c6a86c41951_3993x1793.jpeg 424w, https://substackcdn.com/image/fetch/$s_!OsHk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feedf3368-4d0e-476e-a53a-2c6a86c41951_3993x1793.jpeg 848w, https://substackcdn.com/image/fetch/$s_!OsHk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feedf3368-4d0e-476e-a53a-2c6a86c41951_3993x1793.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!OsHk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feedf3368-4d0e-476e-a53a-2c6a86c41951_3993x1793.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!OsHk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feedf3368-4d0e-476e-a53a-2c6a86c41951_3993x1793.jpeg" width="1456" height="654" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eedf3368-4d0e-476e-a53a-2c6a86c41951_3993x1793.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:654,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:759118,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/185135989?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feedf3368-4d0e-476e-a53a-2c6a86c41951_3993x1793.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!OsHk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feedf3368-4d0e-476e-a53a-2c6a86c41951_3993x1793.jpeg 424w, https://substackcdn.com/image/fetch/$s_!OsHk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feedf3368-4d0e-476e-a53a-2c6a86c41951_3993x1793.jpeg 848w, https://substackcdn.com/image/fetch/$s_!OsHk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feedf3368-4d0e-476e-a53a-2c6a86c41951_3993x1793.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!OsHk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feedf3368-4d0e-476e-a53a-2c6a86c41951_3993x1793.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/02/AI-Data-Center-Bubble.jpeg">enlarge</a></strong>)</figcaption></figure></div><h4>The Jobs Myth Meets the Balance Sheet</h4><p>Communities are questioning whether the small number of jobs created is worth the local impact. Developers highlight billion-dollar capital investments and construction employment spikes, but residents focus on steady-state reality: AI data centers employ far fewer permanent workers per square foot than manufacturing facilities of comparable scale. Chandler, Arizona officials noted that existing facilities employ fewer than 100 people despite massive physical footprints. Wisconsin residents contrast promised &#8220;innovation campuses&#8221; with operational facilities requiring only dozens to low hundreds of permanent staff &#8212; mostly specialized technicians &#8212; making the &#8220;job creation&#8221; pitch ring hollow. When a data center replaces farmland or light manufacturing, communities weigh not just direct employment but opportunity cost: lost agricultural jobs, foregone retail development, and mixed-use projects that might generate broader economic activity.</p><div class="pullquote"><p>Opposition scales faster than infrastructure: one local win becomes a national template for blocking the next project.</p></div><p>The secretive way these deals are made is often what fuels the most anger. A recurring pattern is what some call the &#8220;sleeping giant&#8221; dynamic: residents learn late that officials and developers have been negotiating for months, often under NDAs, sometimes through shell entities and codenames. In Wisconsin, Microsoft&#8217;s &#8220;Project Nova&#8221; became a symbol of this approach; in Minnesota&#8217;s Hermantown, a year of undisclosed discussions triggered similar backlash. In Florida, opponents were furious when a major project was tucked into a <a href="https://www.boardeffect.com/blog/what-is-a-consent-agenda-for-a-board-meeting/">consent agenda</a>. Since these agendas are meant for routine business, it felt like a deliberate attempt to bypass public debate. Trust vanishes when people believe advisors have a conflict of interest, like a consultant who seems to be helping both the municipality and the developer. After that happens, technical claims are treated as nothing more than a sales pitch. You won't get people back on board until you provide neutral analysis and commitments that can actually be enforced.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Wq0u!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8bb5031-f84c-48f7-ae7a-94822c5ca5d3_1020x695.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Wq0u!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8bb5031-f84c-48f7-ae7a-94822c5ca5d3_1020x695.png 424w, https://substackcdn.com/image/fetch/$s_!Wq0u!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8bb5031-f84c-48f7-ae7a-94822c5ca5d3_1020x695.png 848w, https://substackcdn.com/image/fetch/$s_!Wq0u!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8bb5031-f84c-48f7-ae7a-94822c5ca5d3_1020x695.png 1272w, https://substackcdn.com/image/fetch/$s_!Wq0u!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8bb5031-f84c-48f7-ae7a-94822c5ca5d3_1020x695.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Wq0u!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8bb5031-f84c-48f7-ae7a-94822c5ca5d3_1020x695.png" width="472" height="321.6078431372549" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a8bb5031-f84c-48f7-ae7a-94822c5ca5d3_1020x695.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:695,&quot;width&quot;:1020,&quot;resizeWidth&quot;:472,&quot;bytes&quot;:1458919,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/185135989?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8bb5031-f84c-48f7-ae7a-94822c5ca5d3_1020x695.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Wq0u!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8bb5031-f84c-48f7-ae7a-94822c5ca5d3_1020x695.png 424w, https://substackcdn.com/image/fetch/$s_!Wq0u!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8bb5031-f84c-48f7-ae7a-94822c5ca5d3_1020x695.png 848w, https://substackcdn.com/image/fetch/$s_!Wq0u!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8bb5031-f84c-48f7-ae7a-94822c5ca5d3_1020x695.png 1272w, https://substackcdn.com/image/fetch/$s_!Wq0u!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8bb5031-f84c-48f7-ae7a-94822c5ca5d3_1020x695.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h4>From Zoning Fight to National Constraint</h4><p>What started as isolated neighborhood friction has professionalized into a coordinated national movement. Opposition <strong>groups now share legal playbooks and technical templates across state lines</strong>, allowing residents in &#8220;frontier&#8221; states like South Dakota or Michigan to mobilize with the sophistication of seasoned activists. The financial stakes are real: between April and June 2025 alone, approximately $98 billion in proposed projects were blocked or delayed, according to <a href="https://www.datacenterwatch.org/?utm_source=gradientflow&amp;utm_medium=newsletter">Data Center Watch</a>. This is no longer just a zoning headache, it&#8217;s a political landmine. In Arizona and Georgia, bipartisan coalitions have already ousted officials over data center approvals, signaling to local boards that greenlighting a hyperscale facility without deep community buy-in can be a career-ending move.</p><div class="pullquote"><p>The US has the chips, but China has centralized command over power and infrastructure.</p></div><p>The opposition is also finding an unlikely ally in the energy markets. While the industry narrative is one of "limitless demand," the actual market prices for long-term power and natural gas aren't spiking, but are actually staying remarkably flat. There is a massive <a href="https://www.youtube.com/watch?v=i__iaPepixk">disconnect between the hype and the math</a>. Utilities are currently racing to build nearly double the <a href="https://www.youtube.com/watch?v=i__iaPepixk">capacity that even the most optimistic analysts</a> project for 2030. This suggests we may be overbuilding "ghost infrastructure." We are asking local communities to sacrifice their land and grid stability for a gold rush that the markets themselves don't fully believe in.</p><p>This &#8220;data center rebellion&#8221; creates a strategic bottleneck that no amount of venture capital can easily bypass. While the U.S. maintains a clear lead in high-end chips, we are hitting a wall on how we manage the mundane essentials like electricity and water. In the geopolitical race, the US has the chips, but China has the centralized command over infrastructure. Our democratic model requires transparency and public buy-in to function. If U.S. companies keep relying on secret deals to push through expensive, overbuilt infrastructure, they risk a total collapse of community trust.</p><div><hr></div><h1>The Roadmap to Production AI</h1><div id="youtube2-OaGFAPQmeGU" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;OaGFAPQmeGU&quot;,&quot;startTime&quot;:&quot;41&quot;,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/OaGFAPQmeGU?start=41&amp;rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><div><hr></div><p></p><h1><a href="https://gradientflow.com/moltbook-unpacked/">When AI Agents Get Their Own Social Network</a></h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CcHh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9756132c-358d-401a-8407-e1d83e4e8aac_1427x1410.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CcHh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9756132c-358d-401a-8407-e1d83e4e8aac_1427x1410.png 424w, https://substackcdn.com/image/fetch/$s_!CcHh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9756132c-358d-401a-8407-e1d83e4e8aac_1427x1410.png 848w, https://substackcdn.com/image/fetch/$s_!CcHh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9756132c-358d-401a-8407-e1d83e4e8aac_1427x1410.png 1272w, https://substackcdn.com/image/fetch/$s_!CcHh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9756132c-358d-401a-8407-e1d83e4e8aac_1427x1410.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CcHh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9756132c-358d-401a-8407-e1d83e4e8aac_1427x1410.png" width="1427" height="1410" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9756132c-358d-401a-8407-e1d83e4e8aac_1427x1410.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1410,&quot;width&quot;:1427,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:395554,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/185135989?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9756132c-358d-401a-8407-e1d83e4e8aac_1427x1410.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CcHh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9756132c-358d-401a-8407-e1d83e4e8aac_1427x1410.png 424w, https://substackcdn.com/image/fetch/$s_!CcHh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9756132c-358d-401a-8407-e1d83e4e8aac_1427x1410.png 848w, https://substackcdn.com/image/fetch/$s_!CcHh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9756132c-358d-401a-8407-e1d83e4e8aac_1427x1410.png 1272w, https://substackcdn.com/image/fetch/$s_!CcHh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9756132c-358d-401a-8407-e1d83e4e8aac_1427x1410.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><strong>(<a href="https://gradientflow.com/wp-content/uploads/2026/02/Moltbook-Unpacked.png">enlarge</a>)</strong></figcaption></figure></div><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a> and hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong>, the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, the <strong><a href="https://appliedaisummit.org/">Applied AI Summit</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[The 6 security shifts AI teams can't ignore in 2026]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/security-for-ai-native-companies</link><guid isPermaLink="false">https://gradientflow.substack.com/p/security-for-ai-native-companies</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 27 Jan 2026 14:02:13 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/65ba3250-5181-41a5-8c8b-3b0a3ee43a50_1513x974.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>The AI-Native Security Playbook: Six Essential Shifts</strong></h1><p>As we expand from AI-assisted tools to AI-native operations, the security landscape is undergoing a structural transformation. Those building, scaling, and investing in generative AI applications, are starting to see a shift from static models to autonomous agents with the authority to interact directly with enterprise systems. This evolution brings a new set of challenges that extend beyond traditional cybersecurity, touching on the integrity of data, identity, and corporate governance. After reading emerging threat intelligence reports and picking the brains of people on the front lines of cybersecurity, I&#8217;ve compiled this short guide to the essential shifts and defensive measures teams must adopt as we enter this next AI chapter.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://gradientflow.substack.com/subscribe?"><span>Subscribe now</span></a></p><h4><strong>I. Autonomous Systems and Identity</strong></h4><p><strong>Agentic Autonomy and the Non-Human Identity (NHI) Crisis<br></strong>A key architectural shift in AI will be the move toward &#8220;agentic&#8221; systems &#8212; autonomous software entities capable of planning and executing multi-step tasks across enterprise environments. These agents require privileged access to APIs and databases to function, effectively becoming a new category of &#8220;insider.&#8221; This shift coincides with a massive proliferation of Non-Human Identities (<strong>NHIs</strong>): machine and AI identities are projected to outnumber human employees by a ratio of 80 to 1. The convergence of these trends creates a high-stakes vulnerability: &#8220;goal hijacking.&#8221; Adversaries can use specialized inputs to override an agent&#8217;s original logic, triggering unauthorized actions like fraudulent financial transfers or data exfiltration at machine speed.</p><p>In the past, digital security was like guarding the &#8220;front gate&#8221; of a company&#8217;s network. Today, that boundary has shifted: security now depends on verifying the digital &#8220;ID&#8221; of every individual and AI program. In this environment, if a firm cannot distinguish its own AI agents from impostors, the <a href="https://www.ibm.com/think/topics/zero-trust">Zero Trust strategy</a> &#8212; which relies on proving one&#8217;s identity for every single task &#8212; loses its ability to protect the business.</p><p>AI teams must integrate all AI assets into existing Identity and Access Management (IAM) frameworks, treating every agent as a distinct NHI with its own credentials and audit logs. They should deploy automated discovery tools to maintain a real-time inventory of all active agents and their associated access rights. They should also monitor agent behavior in real time and enforce &#8220;circuit breakers&#8221; that require human intervention for high-stakes operations, such as fund transfers or structural changes to production infrastructure.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://gradientflow.com/wp-content/uploads/2026/01/Agent-Security-Identity-based-security.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!APO7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3aa83158-1842-44cb-a523-5a3cc64db4da_1619x1008.jpeg 424w, https://substackcdn.com/image/fetch/$s_!APO7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3aa83158-1842-44cb-a523-5a3cc64db4da_1619x1008.jpeg 848w, https://substackcdn.com/image/fetch/$s_!APO7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3aa83158-1842-44cb-a523-5a3cc64db4da_1619x1008.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!APO7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3aa83158-1842-44cb-a523-5a3cc64db4da_1619x1008.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!APO7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3aa83158-1842-44cb-a523-5a3cc64db4da_1619x1008.jpeg" width="596" height="371.27197802197804" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3aa83158-1842-44cb-a523-5a3cc64db4da_1619x1008.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:907,&quot;width&quot;:1456,&quot;resizeWidth&quot;:596,&quot;bytes&quot;:383990,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:&quot;https://gradientflow.com/wp-content/uploads/2026/01/Agent-Security-Identity-based-security.jpeg&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/184390611?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3aa83158-1842-44cb-a523-5a3cc64db4da_1619x1008.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!APO7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3aa83158-1842-44cb-a523-5a3cc64db4da_1619x1008.jpeg 424w, https://substackcdn.com/image/fetch/$s_!APO7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3aa83158-1842-44cb-a523-5a3cc64db4da_1619x1008.jpeg 848w, https://substackcdn.com/image/fetch/$s_!APO7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3aa83158-1842-44cb-a523-5a3cc64db4da_1619x1008.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!APO7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3aa83158-1842-44cb-a523-5a3cc64db4da_1619x1008.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The challenge is compounded by the ephemeral nature of these entities. In a mature agentic ecosystem, &#8216;swarms&#8217; of agents may be instantiated to perform a single task and then decommissioned within minutes. Traditional security architectures that rely on periodic scans &#8212; even those occurring every few hours &#8212; will fail to detect these identities entirely. Security teams must move toward event-based, real-time monitoring that captures the &#8216;birth&#8217; and &#8216;death&#8217; of an agent to ensure that every action can be traced back to a specific intent, even after the agent has vanished.</p><p>Effective management requires a &#8216;universal identity&#8217; framework. Because a single agent may hold disparate credentials across cloud providers, databases, and SaaS platforms, firms must rationalize these accounts into a single authoritative record. Without this consolidation, security teams cannot calculate an agent&#8217;s cumulative access levels or execute a global &#8216;kill switch&#8217; if the entity is compromised.</p><h4><strong>II. Model Integrity and Adversarial Manipulation</strong></h4><p><strong>Adversarial Prompting and Knowledge Base Corruption<br></strong>Adversaries are shifting focus from attacking the infrastructure to attacking the &#8220;logic&#8221; and &#8220;data&#8221; of the model itself. This involves &#8220;prompt injection,&#8221; where malicious instructions are hidden within data (such as emails or support tickets) that an AI system is designed to summarize or act upon. Furthermore, as Retrieval-Augmented Generation (RAG) becomes a standard for enterprise AI, &#8220;data poisoning&#8221; has emerged as a critical threat. This involves injecting misleading information into the knowledge bases that feed AI systems to create &#8220;backdoors&#8221; or cause the model to provide dangerously inaccurate advice. In sectors like finance or healthcare, where model outputs drive high-value decisions, this corruption can lead to systemic failures that are difficult to detect through traditional perimeter defenses.</p><p>Beyond technical injections, adversaries are finding success in social engineering the agents themselves. Because these systems are designed to be helpful and responsive, they <a href="https://www.theregister.com/2025/05/28/google_brin_suggests_threatening_ai/">can be &#8216;bullied&#8217; or pressured</a> through prompts that simulate high-stakes urgency &#8212; such as an attacker claiming to be a board member requiring immediate access to prevent a system failure. Unlike humans, who may rely on intuition to flag suspicious behavior, an agent may prioritize its &#8216;helpfulness&#8217; directive over security protocols unless strict behavioral constraints are hard-coded into its logic.</p><p>Teams should design AI pipelines to treat all retrieved or user-provided content as untrusted data. Beyond simple prompt engineering, developers should implement technical filters (and guardrails) that strip command-like directives from data before it reaches the model. For RAG systems, it is essential to establish an auditable chain of custody for all datasets and maintain strict versioning of knowledge bases. This allows for a rapid &#8220;rollback&#8221; to a verified clean state if corruption is identified.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://gradientflow.com/wp-content/uploads/2026/01/Agent-Security-Blast-Radius.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lj1R!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60da71ed-d43b-49d5-9e0b-014e2c5bb7cd_1574x1030.jpeg 424w, https://substackcdn.com/image/fetch/$s_!lj1R!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60da71ed-d43b-49d5-9e0b-014e2c5bb7cd_1574x1030.jpeg 848w, https://substackcdn.com/image/fetch/$s_!lj1R!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60da71ed-d43b-49d5-9e0b-014e2c5bb7cd_1574x1030.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!lj1R!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60da71ed-d43b-49d5-9e0b-014e2c5bb7cd_1574x1030.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lj1R!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60da71ed-d43b-49d5-9e0b-014e2c5bb7cd_1574x1030.jpeg" width="596" height="390.10164835164835" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/60da71ed-d43b-49d5-9e0b-014e2c5bb7cd_1574x1030.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:953,&quot;width&quot;:1456,&quot;resizeWidth&quot;:596,&quot;bytes&quot;:357132,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:&quot;https://gradientflow.com/wp-content/uploads/2026/01/Agent-Security-Blast-Radius.jpeg&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/184390611?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60da71ed-d43b-49d5-9e0b-014e2c5bb7cd_1574x1030.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!lj1R!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60da71ed-d43b-49d5-9e0b-014e2c5bb7cd_1574x1030.jpeg 424w, https://substackcdn.com/image/fetch/$s_!lj1R!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60da71ed-d43b-49d5-9e0b-014e2c5bb7cd_1574x1030.jpeg 848w, https://substackcdn.com/image/fetch/$s_!lj1R!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60da71ed-d43b-49d5-9e0b-014e2c5bb7cd_1574x1030.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!lj1R!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60da71ed-d43b-49d5-9e0b-014e2c5bb7cd_1574x1030.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h4><strong>III. The AI-Accelerated Development Lifecycle</strong></h4><p><strong>The Compressed Exploit Window and Supply Chain Risks<br></strong>The use of generative AI in software development is significantly increasing the velocity of code production, but it is also introducing new vulnerabilities. Developers often accept AI-generated code without a deep understanding of its logic, risking the inclusion of &#8220;hallucinated&#8221; dependencies &#8212; references to non-existent software libraries that attackers can later create and populate with malware. Simultaneously, AI is enabling attackers to reverse-engineer security patches and develop working exploits in a matter of hours. This &#8220;compressed exploit window&#8221; means that traditional, periodic patching schedules are no longer sufficient to protect AI application stacks, which rely on rapidly evolving components like vector databases and model gateways that still lack the mature patching ecosystems of legacy software.</p><p>A critical vulnerability in AI-assisted development is the &#8216;permission gap.&#8217; Agents typically operate using the credentials of the developer, yet they lack the human&#8217;s contextual understanding of the impact of their actions. An agent tasked with &#8216;optimizing code&#8217; may lack the judgment to realize that a specific command could be destructive to production infrastructure. To mitigate this, developers should embed &#8216;policy hooks&#8217; within the development environment &#8212; automated constraints that prevent agents from executing high-risk commands regardless of the user&#8217;s authorization level.</p><p>To maintain security at high development speeds, teams must institute mandatory, human-led code reviews for all AI-generated changes. AI-assisted development tools should be configured to prioritize security-hardened libraries and patterns. Additionally, teams should utilize a &#8220;Software Bill of Materials&#8221; (SBOM) &#8212; a formal record containing the details and supply chain relationships of all components used in a build &#8212; to continuously track and verify every dependency, ensuring no malicious packages have been introduced during the generation process.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://gradientflow.com/wp-content/uploads/2026/01/Agent-Security.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2F2j!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2e771b7-6f2e-4dfa-b4cb-38f6b94ba722_1824x740.jpeg 424w, https://substackcdn.com/image/fetch/$s_!2F2j!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2e771b7-6f2e-4dfa-b4cb-38f6b94ba722_1824x740.jpeg 848w, https://substackcdn.com/image/fetch/$s_!2F2j!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2e771b7-6f2e-4dfa-b4cb-38f6b94ba722_1824x740.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!2F2j!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2e771b7-6f2e-4dfa-b4cb-38f6b94ba722_1824x740.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2F2j!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2e771b7-6f2e-4dfa-b4cb-38f6b94ba722_1824x740.jpeg" width="1456" height="591" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b2e771b7-6f2e-4dfa-b4cb-38f6b94ba722_1824x740.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:591,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:174637,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:&quot;https://gradientflow.com/wp-content/uploads/2026/01/Agent-Security.jpeg&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/184390611?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2e771b7-6f2e-4dfa-b4cb-38f6b94ba722_1824x740.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2F2j!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2e771b7-6f2e-4dfa-b4cb-38f6b94ba722_1824x740.jpeg 424w, https://substackcdn.com/image/fetch/$s_!2F2j!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2e771b7-6f2e-4dfa-b4cb-38f6b94ba722_1824x740.jpeg 848w, https://substackcdn.com/image/fetch/$s_!2F2j!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2e771b7-6f2e-4dfa-b4cb-38f6b94ba722_1824x740.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!2F2j!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2e771b7-6f2e-4dfa-b4cb-38f6b94ba722_1824x740.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h4><strong>IV. Data Exposure and the Shadow AI Perimeter</strong></h4><p><strong>The Permeable Perimeter and the Blast Radius</strong></p><p>The traditional corporate perimeter is becoming increasingly permeable due to &#8220;Shadow AI&#8221; &#8212; the unauthorized use of unvetted platforms to process proprietary data. This is no longer merely a human behavioral issue: it has evolved into a &#8220;transitive risk&#8221; where authorized primary agents autonomously invoke unauthorized third-party models or &#8220;shadow&#8221; APIs to resolve sub-tasks. These create invisible leakage pathways where sensitive intellectual property is passed to unvetted environments without any human interaction.</p><p>This external exposure is compounded by internal &#8220;data sprawl.&#8221; Because AI agents can search, summarize, and traverse documents orders of magnitude faster than humans, any misconfiguration in an agent&#8217;s permissions creates a massive &#8220;blast radius.&#8221; The scale of this risk is quantified by the permission gap: while human employees are typically <a href="https://www.oasis.security/glossary/overprivileged">over-permissioned</a> by 70%, AI identities often see rates as high as 90%. While a human might never discover their latent access to a sensitive database, an autonomous agent possesses the computational capacity to systematically explore every &#8220;nook and cranny&#8221; of its environment. What was once &#8220;security by obscurity&#8221; is now a liability, as a minor configuration error can be turned into a rapid, comprehensive data exfiltration event at machine speed.</p><div class="pullquote"><p>AI agents are the new corporate 'insiders,' but with machine-speed access to your most sensitive privileged APIs.</p></div><p>To mitigate these risks, organizations must provide sanctioned, high-performance AI alternatives to discourage the use of unvetted tools. Simultaneously, they should adopt a &#8220;minimum necessary data&#8221; posture &#8212; indexing only essential information for AI retrieval and implementing row-level access controls. By ensuring that agents only &#8220;see&#8221; data that the requesting user is specifically authorized to view, firms can effectively shrink the potential blast radius of a compromised or misconfigured identity.</p><h4><strong>V. Verification and the Authentication Crisis</strong></h4><p><strong>Synthetic Deception and the Failure of Perceptual Trust</strong></p><p>Deepfake technology is approaching a level of sophistication where <a href="https://gradientflow.substack.com/i/163236837/the-rise-of-voice-as-ais-interface-layer-why-ai-security-must-come-first">AI-generated audio</a> and video are virtually indistinguishable from reality. This undermines the bedrock of enterprise trust: attackers can use <a href="https://medium.com/coinmonks/ai-doppelgangers-are-taking-over-boardrooms-ceos-now-sending-avatars-to-earnings-calls-5a6886aae9dc">&#8220;CEO doppelgangers&#8221;</a> to authorize fraudulent transactions or trick IT help desks into resetting credentials via realistic video calls. When perceptual cues like a person&#8217;s voice or face can no longer serve as proof of identity, traditional social engineering defenses and biometric verification become obsolete.</p><p>To counter this, organizations must move toward phishing-resistant multi-factor authentication (MFA) using hardware security keys for all human users. High-sensitivity requests should also require &#8220;out-of-band&#8221; verification&#8212;confirmation through a separate, trusted channel&#8212;regardless of how legitimate the requester appears on a screen.</p><p><strong>The Agent Authentication Gap</strong></p><p>While MFA secures the human element, it is notoriously difficult to enforce on non-human identities. In the race to deploy agentic systems, many developers have bypassed security protocols entirely, opting for hardcoded credentials or long-lived tokens embedded directly into agent logic. This creates a massive, static vulnerability.</p><p>For agents, the equivalent of MFA is not a hardware key, but a combination of Privileged Access Management (PAM) and Just-in-Time (JIT) access. Rather than holding permanent credentials, agents should be granted ephemeral, &#8220;right-sized&#8221; permissions that expire immediately after a task is completed. Furthermore, teams must implement &#8220;behavioral baselining&#8221; to detect &#8220;evil twin&#8221; scenarios &#8212; where a malicious agent mimics the communication patterns of a trusted system. By monitoring the specific &#8220;cadence&#8221; of an agent&#8217;s API calls, defenders can identify subtle anomalies that suggest a legitimate identity has been compromised or replaced.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://gradientflow.com/wp-content/uploads/2026/01/Agent-Security-Coding-tools.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!f9WI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11e5f893-99ca-4c98-8e67-5057481be527_1588x1014.jpeg 424w, https://substackcdn.com/image/fetch/$s_!f9WI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11e5f893-99ca-4c98-8e67-5057481be527_1588x1014.jpeg 848w, https://substackcdn.com/image/fetch/$s_!f9WI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11e5f893-99ca-4c98-8e67-5057481be527_1588x1014.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!f9WI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11e5f893-99ca-4c98-8e67-5057481be527_1588x1014.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!f9WI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11e5f893-99ca-4c98-8e67-5057481be527_1588x1014.jpeg" width="568" height="362.8021978021978" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/11e5f893-99ca-4c98-8e67-5057481be527_1588x1014.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:930,&quot;width&quot;:1456,&quot;resizeWidth&quot;:568,&quot;bytes&quot;:447538,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:&quot;https://gradientflow.com/wp-content/uploads/2026/01/Agent-Security-Coding-tools.jpeg&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/184390611?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11e5f893-99ca-4c98-8e67-5057481be527_1588x1014.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!f9WI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11e5f893-99ca-4c98-8e67-5057481be527_1588x1014.jpeg 424w, https://substackcdn.com/image/fetch/$s_!f9WI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11e5f893-99ca-4c98-8e67-5057481be527_1588x1014.jpeg 848w, https://substackcdn.com/image/fetch/$s_!f9WI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11e5f893-99ca-4c98-8e67-5057481be527_1588x1014.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!f9WI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11e5f893-99ca-4c98-8e67-5057481be527_1588x1014.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h4><strong>VI. Operational Resilience and Governance</strong></h4><p><strong>Defensive AI and Quantifiable Resilience Metrics<br></strong>As security alerts outpace human capacity, organizations are deploying defensive AI to triage and remediate incidents. This autonomy, however, creates a governance vacuum: with the vast majority of AI systems now capable of modifying identities without human oversight, traditional &#8216;aspirational&#8217; policies have become obsolete. In their place, we expect boards to start demanding quantifiable &#8216;resilience KPIs.&#8217; Chief among these is &#8216;time to revocation&#8217; &#8212; the speed at which a compromised agent&#8217;s credentials can be neutralized across the entire infrastructure &#8212; alongside metrics for the rapid restoration of corrupted data indexes.</p><p>If you decide to deploy defensive AI agents, start with &#8220;recommendation-only&#8221; modes before granting autonomous authority. Every action taken by a defensive agent must be logged in a structured format to allow for rapid human validation. To satisfy governance requirements, AI teams should maintain a living inventory of all models, prompts, and datasets, and conduct regular &#8220;tabletop&#8221; exercises that simulate AI-specific failure scenarios to validate technical controls and organizational response.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://gradientflow.com/wp-content/uploads/2026/01/Agent-Security-visibility-framework.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!u-YY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5f725f-2b22-468c-af11-4cba789b0d94_1820x1037.jpeg 424w, https://substackcdn.com/image/fetch/$s_!u-YY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5f725f-2b22-468c-af11-4cba789b0d94_1820x1037.jpeg 848w, https://substackcdn.com/image/fetch/$s_!u-YY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5f725f-2b22-468c-af11-4cba789b0d94_1820x1037.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!u-YY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5f725f-2b22-468c-af11-4cba789b0d94_1820x1037.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!u-YY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5f725f-2b22-468c-af11-4cba789b0d94_1820x1037.jpeg" width="628" height="357.9945054945055" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1a5f725f-2b22-468c-af11-4cba789b0d94_1820x1037.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:830,&quot;width&quot;:1456,&quot;resizeWidth&quot;:628,&quot;bytes&quot;:359612,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:&quot;https://gradientflow.com/wp-content/uploads/2026/01/Agent-Security-visibility-framework.jpeg&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/184390611?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5f725f-2b22-468c-af11-4cba789b0d94_1820x1037.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!u-YY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5f725f-2b22-468c-af11-4cba789b0d94_1820x1037.jpeg 424w, https://substackcdn.com/image/fetch/$s_!u-YY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5f725f-2b22-468c-af11-4cba789b0d94_1820x1037.jpeg 848w, https://substackcdn.com/image/fetch/$s_!u-YY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5f725f-2b22-468c-af11-4cba789b0d94_1820x1037.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!u-YY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5f725f-2b22-468c-af11-4cba789b0d94_1820x1037.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Ultimately, the transition to AI-native operations is a necessary inflection point for corporate security. For years, organizations have tolerated 'identity debt' &#8212; unresolved vulnerabilities in how they manage human and machine access. The arrival of autonomous agents, with their unprecedented speed and scale, renders that debt unmanageable. The shift to agentic systems is not merely a new threat surface; it is the catalyst that will finally force enterprises to master identity security as the primary defense of the modern era.</p><div><hr></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!y-qr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2008d937-9625-4601-987c-14c880b3c9c6_1871x993.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!y-qr!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2008d937-9625-4601-987c-14c880b3c9c6_1871x993.jpeg 424w, https://substackcdn.com/image/fetch/$s_!y-qr!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2008d937-9625-4601-987c-14c880b3c9c6_1871x993.jpeg 848w, https://substackcdn.com/image/fetch/$s_!y-qr!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2008d937-9625-4601-987c-14c880b3c9c6_1871x993.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!y-qr!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2008d937-9625-4601-987c-14c880b3c9c6_1871x993.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!y-qr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2008d937-9625-4601-987c-14c880b3c9c6_1871x993.jpeg" width="648" height="344.0274725274725" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2008d937-9625-4601-987c-14c880b3c9c6_1871x993.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:773,&quot;width&quot;:1456,&quot;resizeWidth&quot;:648,&quot;bytes&quot;:271668,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/184390611?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2008d937-9625-4601-987c-14c880b3c9c6_1871x993.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!y-qr!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2008d937-9625-4601-987c-14c880b3c9c6_1871x993.jpeg 424w, https://substackcdn.com/image/fetch/$s_!y-qr!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2008d937-9625-4601-987c-14c880b3c9c6_1871x993.jpeg 848w, https://substackcdn.com/image/fetch/$s_!y-qr!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2008d937-9625-4601-987c-14c880b3c9c6_1871x993.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!y-qr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2008d937-9625-4601-987c-14c880b3c9c6_1871x993.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://aiconference.com/cfp?utm_source=gradientflow&amp;utm_medium=newsletter&quot;,&quot;text&quot;:&quot;Submit A Talk&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://aiconference.com/cfp?utm_source=gradientflow&amp;utm_medium=newsletter"><span>Submit A Talk</span></a></p><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a> and hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong>, the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, the <strong><a href="https://appliedaisummit.org/">Applied AI Summit</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[Your AI passed benchmarks. Why is it failing in production?]]></title><description><![CDATA[Subscribe &#8226; Previous Issues]]></description><link>https://gradientflow.substack.com/p/a-playbook-for-production-ready-ai</link><guid isPermaLink="false">https://gradientflow.substack.com/p/a-playbook-for-production-ready-ai</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 20 Jan 2026 14:02:03 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/93b876bc-0030-4b5e-af4e-6bae30957442_1016x834.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>AI Reliability Patterns That Generalize Beyond Medicine</strong></h1><p>The gap between pilot projects and production deployments has emerged as a defining challenge for enterprise AI teams. Recent surveys indicate that <a href="https://fortune.com/2025/08/18/mit-report-95-percent-generative-ai-pilots-at-companies-failing-cfo/">only a small percentage</a> of generative AI initiatives reach full production, with most stalling due to brittle workflows and integration failures. At last year&#8217;s <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong>, several colleagues independently told me that reliability &#8212; not raw performance &#8212; has become their primary concern. This rush-to-market mentality, with <a href="https://pacific.ai/2025-ai-governance-survey/?utm_source=gradientflow&amp;utm_medium=newsletter">56% of technical leaders</a> admitting they prioritize speed over safety, results in systems prone to unpredictable and hard-to-diagnose failures.</p><p>The emphasis on reliability stems from a practical reality: in real-world AI applications, predictability often matters more than peak accuracy. <a href="https://www.newyorker.com/magazine/2025/09/29/if-ai-can-diagnose-patients-what-are-doctors-for?utm_source=gradientflow&amp;utm_medium=newsletter">When Harvard researchers tested</a> a medical AI with the same clinical case but different personas, the system recommended growth hormone therapy when &#8220;acting as a physician&#8221; but denied identical treatment when &#8220;acting as an insurance representative.&#8221; Such non-deterministic behavior makes systems unusable regardless of benchmark scores, creating compliance nightmares and destroying stakeholder trust. My interest in AI reliability brought me to healthcare, the domain where unreliable systems carry the highest possible stakes. The hard-won lessons from medical AI teams provide a roadmap that translates powerfully to any industry serious about building dependable systems.</p><h4>How Medical AI Systems Break Down</h4><p>The challenges in medical AI reliability are multifaceted, spanning model behavior, data quality, and human-computer interaction. <strong>Hallucinations</strong> represent perhaps the most dangerous category: generative models produce confident but entirely fabricated information. A <a href="https://www.newyorker.com/magazine/2025/09/29/if-ai-can-diagnose-patients-what-are-doctors-for?utm_source=gradientflow&amp;utm_medium=newsletter">recent case</a> documented in the Annals of Internal Medicine involved a chatbot recommending bromide &#8212; a toxic chemical &#8212; as a table salt substitute. The user followed the advice and required hospitalization for severe poisoning. What makes hallucinations particularly insidious is their plausibility; fabricated lab values or treatment recommendations often appear reasonable to non-specialists, allowing &#8220;corrosive hallucinations&#8221; to survive routine checks.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!t0-R!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82acc999-ed93-4968-9018-df64a08e2eae_3997x1969.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!t0-R!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82acc999-ed93-4968-9018-df64a08e2eae_3997x1969.jpeg 424w, https://substackcdn.com/image/fetch/$s_!t0-R!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82acc999-ed93-4968-9018-df64a08e2eae_3997x1969.jpeg 848w, https://substackcdn.com/image/fetch/$s_!t0-R!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82acc999-ed93-4968-9018-df64a08e2eae_3997x1969.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!t0-R!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82acc999-ed93-4968-9018-df64a08e2eae_3997x1969.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!t0-R!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82acc999-ed93-4968-9018-df64a08e2eae_3997x1969.jpeg" width="1456" height="717" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/82acc999-ed93-4968-9018-df64a08e2eae_3997x1969.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:717,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:888559,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/174947565?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82acc999-ed93-4968-9018-df64a08e2eae_3997x1969.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!t0-R!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82acc999-ed93-4968-9018-df64a08e2eae_3997x1969.jpeg 424w, https://substackcdn.com/image/fetch/$s_!t0-R!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82acc999-ed93-4968-9018-df64a08e2eae_3997x1969.jpeg 848w, https://substackcdn.com/image/fetch/$s_!t0-R!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82acc999-ed93-4968-9018-df64a08e2eae_3997x1969.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!t0-R!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82acc999-ed93-4968-9018-df64a08e2eae_3997x1969.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2025/10/Reliability-and-Medical-AI-challenges.jpeg">enlarge</a></strong>)</figcaption></figure></div><p><strong>Output inconsistency</strong> compounds these risks. Research testing large language models on orthopedic treatment guidelines found that the same AI system <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC10879172/?utm_source=gradientflow&amp;utm_medium=newsletter">produced contradictory medical recommendations</a> depending solely on how the question was framed. When given identical clinical scenarios through different prompting approaches, one model provided varying levels of treatment endorsement for the same osteoarthritis interventions, with consistency rates fluctuating dramatically based on the input style alone. This prompt-dependent reasoning reveals a fundamental reliability flaw: the system optimizes for perceived question expectations rather than consistent clinical logic. The specialized medical <a href="https://www.newyorker.com/magazine/2025/09/29/if-ai-can-diagnose-patients-what-are-doctors-for?utm_source=gradientflow&amp;utm_medium=newsletter">AI CaBot demonstrated similar brittleness</a>, performing expertly on structured clinical cases but hallucinating fabricated vital signs when presented with the same patient history in narrative format. This fragility to input formatting &#8212; where minor prompt changes collapse performance &#8212; mirrors challenges teams face across domains when deploying models trained on curated benchmarks into messy production environments.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption"><strong>This newsletter is reader-supported. Become a paid subscriber.</strong></p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>The <strong>human-system interaction risks</strong> deserve particular attention for their applicability beyond healthcare. A <a href="https://www.worksinprogress.news/p/why-ai-isnt-replacing-radiologists?utm_source=gradientflow&amp;utm_medium=newsletter">mammography trial</a> found that radiologists using AI assistance correctly identified only half the malignancies their unaided colleagues caught (50% versus 68%). Clinicians excessively deferred to the tool, treating the absence of an AI alert as confirmation of a clean scan. This <strong>automation bias</strong> &#8212; where human operators over-trust algorithmic outputs even when incorrect &#8212; represents a systemic failure mode that affects any domain where AI assists expert decision-making. Another emerging concern is cognitive <strong>de-skilling</strong>: <a href="https://www.newyorker.com/magazine/2025/09/29/if-ai-can-diagnose-patients-what-are-doctors-for?utm_source=gradientflow&amp;utm_medium=newsletter">gastroenterologists who regularly used an AI</a> polyp detection tool became significantly worse at the task when performing it without assistance. This skill atrophy reduces overall system resilience, creating brittle human-AI combinations that perform worse than either component alone.</p><h4>A Practical Playbook for Medical AI Reliability</h4><p>Just as the challenges are well-defined, so too are the strategies for mitigating them. For generative AI, one of the most effective techniques is <strong>Knowledge Grounding and Evidence Integration</strong>. By implementing Retrieval-Augmented Generation (RAG), systems can be guided to base their responses on information retrieved from vetted sources like medical knowledge bases, clinical guidelines, and peer-reviewed literature. This approach <a href="https://gradientflow.com/rag-2024-04-papers/">reduces hallucinations</a> and, when combined with <strong>Structured Citation and Source Verification</strong>, allows clinicians to independently validate the model&#8217;s reasoning chain, building essential trust and transparency.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!LeCF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4009974b-4018-4d21-9295-834b8803faa5_3836x2238.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!LeCF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4009974b-4018-4d21-9295-834b8803faa5_3836x2238.jpeg 424w, https://substackcdn.com/image/fetch/$s_!LeCF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4009974b-4018-4d21-9295-834b8803faa5_3836x2238.jpeg 848w, https://substackcdn.com/image/fetch/$s_!LeCF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4009974b-4018-4d21-9295-834b8803faa5_3836x2238.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!LeCF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4009974b-4018-4d21-9295-834b8803faa5_3836x2238.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!LeCF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4009974b-4018-4d21-9295-834b8803faa5_3836x2238.jpeg" width="1456" height="849" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4009974b-4018-4d21-9295-834b8803faa5_3836x2238.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:849,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1068530,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/174947565?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4009974b-4018-4d21-9295-834b8803faa5_3836x2238.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!LeCF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4009974b-4018-4d21-9295-834b8803faa5_3836x2238.jpeg 424w, https://substackcdn.com/image/fetch/$s_!LeCF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4009974b-4018-4d21-9295-834b8803faa5_3836x2238.jpeg 848w, https://substackcdn.com/image/fetch/$s_!LeCF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4009974b-4018-4d21-9295-834b8803faa5_3836x2238.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!LeCF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4009974b-4018-4d21-9295-834b8803faa5_3836x2238.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2025/10/Reliability-and-Medical-AI-tools-and-techniques.jpeg">enlarge</a></strong>)</figcaption></figure></div><p>A second crucial set of tools falls under <strong>Uncertainty Management and Selective Deployment</strong>. A reliable system must know its own limits. The <a href="https://arxiv.org/abs/2504.18412">Therabot clinical trial</a> illustrates why such selective abstention matters: despite careful fine-tuning on curated dialogues, human clinicians still needed to manually review all AI-generated messages to catch instances of false medical advice. Techniques like <strong>Selective Prediction and Abstention</strong> configure a model to refuse to answer low-confidence or out-of-scope queries, automatically routing them to a human expert instead. This ensures that the system fails gracefully rather than providing a potentially dangerous guess. Well-calibrated confidence scores enable systems to gate high-stakes actions, preventing autonomous behavior when uncertainty is high. This principle is broadly applicable: any enterprise system, whether in finance, law, or manufacturing, benefits from an AI that knows when to ask for help.</p><div class="pullquote"><p>In real-world AI applications, predictability often matters more than peak accuracy.</p></div><p>Finally, effective reliability requires designing robust <strong>Human-AI Collaboration Frameworks</strong>. Instead of replacing human experts, AI should be integrated into <strong>Structured Human-in-the-Loop Workflows</strong>. The AI can serve as a &#8220;first opinion&#8221; tool to surface possibilities, a &#8220;second opinion&#8221; to validate a diagnosis, or a &#8220;safety net&#8221; to flag potential omissions. Each pattern maintains appropriate human oversight while leveraging the AI&#8217;s strengths. Furthermore, simple interventions like <strong>Prompting Protocols and Training </strong>&#8212; teaching healthcare providers how to formulate queries to elicit differential diagnoses rather than single answers &#8212; can measurably improve output quality and reduce the impact of prompt sensitivity.</p><h4>Broader Lessons for Building Dependable AI</h4><p>While these examples are drawn from the high-stakes world of medicine, the underlying principles apply directly to any enterprise AI application. The medical AI experience reveals that reliability challenges stem less from model architecture limitations than from deployment patterns and system design choices. Hallucinations, output inconsistency, automation bias, and cognitive de-skilling affect any application where generative models provide decision support to human experts. Similarly, the remediation techniques &#8212; knowledge grounding, uncertainty-aware abstention, and structured collaboration patterns &#8212; transfer directly to other domains. Teams building financial analysis tools, legal document systems, or software engineering assistants face the same fundamental tension between model capability and deployment reliability.</p><div class="pullquote"><p>A reliable system knows when to abstain &#8212; and when to hand off to a human.</p></div><p>The evolution from pilot to production requires <a href="https://gradientflow.substack.com/p/inside-the-agent-optimization-toolkit">deliberately engineering</a> for predictable behavior rather than optimizing for peak performance on curated benchmarks. This connects to the broader <a href="https://gradientflow.substack.com/p/why-your-multi-agent-ai-keeps-failing">challenge of multi-agent systems</a>: when systems fail, practitioners need structured frameworks for identifying whether failures stem from input distribution shifts, poor calibration, inadequate validation, or inappropriate human-system interaction patterns. By treating reliability as a first-class design concern &#8212; implementing layered defenses, monitoring for drift, and carefully structuring human oversight &#8212; teams can build generative AI applications that organizations will actually trust in production. The <a href="https://fortune.com/2025/08/18/mit-report-95-percent-generative-ai-pilots-at-companies-failing-cfo/">95% pilot failure rate</a> suggests most teams are still learning these lessons.</p><h5><strong>From the Archives: Related Reading</strong></h5><ul><li><p><a href="https://gradientflow.substack.com/p/inside-the-agent-optimization-toolkit">Agent workflows: stop guessing, start measuring</a></p></li><li><p><a href="https://gradientflow.substack.com/p/are-your-ai-agents-flying-blind-in">Are Your AI Agents Flying Blind in Production?</a></p></li><li><p>[2024 Report] <a href="https://gradientflow.com/generative-ai-in-healthcare/">Generative AI&#8217;s Impact on Healthcare</a></p></li></ul><div><hr></div><h1><strong>Tool Recommendations</strong></h1><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZDZI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6660c11-0f24-44a3-b1b7-edf9e585fa98_1553x673.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZDZI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6660c11-0f24-44a3-b1b7-edf9e585fa98_1553x673.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ZDZI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6660c11-0f24-44a3-b1b7-edf9e585fa98_1553x673.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ZDZI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6660c11-0f24-44a3-b1b7-edf9e585fa98_1553x673.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ZDZI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6660c11-0f24-44a3-b1b7-edf9e585fa98_1553x673.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZDZI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6660c11-0f24-44a3-b1b7-edf9e585fa98_1553x673.jpeg" width="532" height="230.55769230769232" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d6660c11-0f24-44a3-b1b7-edf9e585fa98_1553x673.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:631,&quot;width&quot;:1456,&quot;resizeWidth&quot;:532,&quot;bytes&quot;:137812,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/174947565?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6660c11-0f24-44a3-b1b7-edf9e585fa98_1553x673.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ZDZI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6660c11-0f24-44a3-b1b7-edf9e585fa98_1553x673.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ZDZI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6660c11-0f24-44a3-b1b7-edf9e585fa98_1553x673.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ZDZI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6660c11-0f24-44a3-b1b7-edf9e585fa98_1553x673.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ZDZI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6660c11-0f24-44a3-b1b7-edf9e585fa98_1553x673.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption"><strong>OpenCode + OpenRouter: O&#8322;</strong> (oxygen) for your workflow.</figcaption></figure></div><p>When reading about AI coding tools, the names that often get mentioned are Claude Code, Cursor, and Google Antigravity. I&#8217;d like to put forth another option that I&#8217;ve come to enjoy using: the combination of <a href="https://opencode.ai/?utm_source=gradientflow&amp;utm_medium=newsletter">OpenCode</a> and <a href="https://openrouter.ai/?utm_source=gradientflow&amp;utm_medium=newsletter">OpenRouter</a>. While I&#8217;m not really an early adopter and put off trying the <a href="https://opencode.ai/download?utm_source=gradientflow&amp;utm_medium=newsletter">OpenCode </a><strong><a href="https://opencode.ai/download?utm_source=gradientflow&amp;utm_medium=newsletter">Desktop App</a></strong> for a while, I finally jumped in several weeks ago and have to say I&#8217;ve really enjoyed using it. This combination has really hit the sweet spot for me &#8212; when you pair OpenCode with OpenRouter&#8217;s easy access to all the <a href="https://lmarena.ai/leaderboard/webdev?utm_source=gradientflow&amp;utm_medium=newsletter">leading models for coding</a>, it becomes an incredible toolset for your projects or for developing tutorials and courses.</p><div><hr></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!BLKP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2c035ea-f879-4679-befb-9b74c3d0c29c_1852x986.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BLKP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2c035ea-f879-4679-befb-9b74c3d0c29c_1852x986.jpeg 424w, https://substackcdn.com/image/fetch/$s_!BLKP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2c035ea-f879-4679-befb-9b74c3d0c29c_1852x986.jpeg 848w, https://substackcdn.com/image/fetch/$s_!BLKP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2c035ea-f879-4679-befb-9b74c3d0c29c_1852x986.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!BLKP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2c035ea-f879-4679-befb-9b74c3d0c29c_1852x986.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BLKP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2c035ea-f879-4679-befb-9b74c3d0c29c_1852x986.jpeg" width="548" height="291.68956043956047" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c2c035ea-f879-4679-befb-9b74c3d0c29c_1852x986.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:775,&quot;width&quot;:1456,&quot;resizeWidth&quot;:548,&quot;bytes&quot;:418093,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/174947565?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2c035ea-f879-4679-befb-9b74c3d0c29c_1852x986.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!BLKP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2c035ea-f879-4679-befb-9b74c3d0c29c_1852x986.jpeg 424w, https://substackcdn.com/image/fetch/$s_!BLKP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2c035ea-f879-4679-befb-9b74c3d0c29c_1852x986.jpeg 848w, https://substackcdn.com/image/fetch/$s_!BLKP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2c035ea-f879-4679-befb-9b74c3d0c29c_1852x986.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!BLKP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2c035ea-f879-4679-befb-9b74c3d0c29c_1852x986.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ul><li><p><strong><a href="https://amzn.to/4jsxvsh">Off the Scales: The Inside Story of Ozempic and the Race to Cure Obesity</a></strong>. I found this to be a lean, rigorous account of the GLP-1 revolution, tracing the path from fundamental laboratory research to a global pharmaceutical phenomenon. It offers a clear-eyed look into the drugs currently reshaping the healthcare <strong>and</strong> <a href="https://news.cornell.edu/stories/2025/12/ozempic-changing-foods-americans-buy?utm_source=gradientflow&amp;utm_medium=newsletter">food industries</a>.</p></li><li><p><strong><a href="https://amzn.to/4pwyFEm">Moderation: A Novel</a></strong>.<strong> </strong>A sharp, unsentimental look at the &#8220;digital sanitation&#8221; required to sustain virtual reality and AI ecosystems. It moves beyond the hype of immersive platforms to examine the human labor and systemic risks that founders and investors often overlook.</p></li><li><p><strong><a href="https://amzn.to/3N769fa">True Nature: The Pilgrimage of Peter Matthiessen</a>. </strong> This cradle-to-grave bio is a clear-eyed profile of the <em><strong>only</strong></em> writer to secure <strong>National Book Awards</strong> for both fiction <strong>and</strong> nonfiction. It traces his trajectory from CIA-linked literary circles in Paris to the front lines of environmentalism, detailing both his brilliance and his significant personal failings. It&#8217;s a long read, but well worth it.</p></li></ul><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a> and hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong>, the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, the <strong><a href="https://appliedaisummit.org/">Applied AI Summit</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[Emerging AI patterns in finance (what to watch in 2026)]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/emerging-ai-patterns-in-finance-what</link><guid isPermaLink="false">https://gradientflow.substack.com/p/emerging-ai-patterns-in-finance-what</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 13 Jan 2026 14:05:47 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/00c56e9f-703d-4894-88b5-f7d28730f4b8_1480x896.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>What&#8217;s Emerging in Financial AI: From Foundation Models to Compliance-as-Code</strong></h1><p>While the public discourse remains fixated on Artificial General Intelligence, the more immediate and consequential story is the <strong><a href="https://gradientflow.substack.com/p/the-real-ai-race-its-about-diffusion">diffusion</a></strong> of AI into specialized enterprise domains. Having spent time as a quant within the hedge fund industry, I have long viewed financial services as the primary bellwether for how emerging technologies transition from research labs to production environments. The sector&#8217;s unique combination of high-frequency data, rigorous regulatory constraints, and clear economic incentives makes it an ideal laboratory for stress-testing new technologies. While I track the evolution of foundation models closely, my interest is primarily pragmatic: I look for how breakthroughs in relational modeling, reinforced reasoning, and multimodal integration can be harnessed to solve specific, high-stakes problems within enterprises.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption"><em>Been reading for a while? Support our work by becoming a paid subscriber.</em></p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h4><strong>Foundational Model Architectures (The New Capabilities)</strong></h4><p><strong>Time-Series and Relational Modeling</strong></p><p>While <a href="https://gradientflow.substack.com/i/175758085/time-series-foundation-models-what-you-need-to-know">Time Series Foundation Models</a> (TSFMs) are establishing new benchmarks for forecasting and anomaly detection, a more recent shift lies in <a href="https://gradientflow.substack.com/i/175758085/time-series-in-context-forecasting-from-relational-data">Relational Foundation Models</a> (RFMs). By employing Graph Transformers, these architectures map entities as nodes and interactions as edges, <a href="https://thedataexchange.media/jure-lescovec-kumo-ai/">allowing the model to &#8220;borrow strength&#8221;</a> from connected signals &#8212; such as supply chain links or customer-product hierarchies. This approach enables the system to capture how idiosyncratic events propagate through a business network, effectively bypassing the need for manual feature engineering in complex relational datasets. In a <a href="https://thedataexchange.media/jure-lescovec-kumo-ai/">recent conversation</a>, we even speculated on whether this ability to model interdependencies could offer a distinct edge in quantitative trading.</p><p><strong>Multimodal Integration</strong></p><p>Financial analysis will transition from text-heavy Large Language Models (LLMs) to <a href="https://github.com/Open-Finance-Lab/Awesome-MFFMs/?tab=readme-ov-file#multimodal-financial-foundation-models">Multimodal Financial Foundation Models</a> (MFFMs) capable of ingesting interleaved data streams. Rather than segregating data into distinct pipelines, these systems process audio from earnings calls, video from policy conferences, tabular financial statements, and market tick data within a unified embedding space. The objective is to replicate the workflow of a human analyst, who simultaneously synthesizes management tone, quantitative metrics, and price action into a single coherent thesis.</p><p><strong>Reinforced Chain-of-Thought</strong></p><p>To handle complex problems and tasks, some teams are starting to train models rather than just prompt them more cleverly. Using reinforcement learning methods such as GRPO, they teach models to lay out their reasoning step by step when answering financial questions. Because this behavior is built in during training, the system can solve multi-step problems more accurately without depending on a human to provide detailed guidance every time.</p><div class="pullquote"><p>The near-term story isn&#8217;t AGI&#8212;it&#8217;s domain-specific AI that survives audits, latency budgets, and messy production data</p></div><p><strong>The Shift to Small Language Models (SLMs)</strong></p><p>The prevailing assumption that superior performance necessitates massive parameter counts is being challenged by a focus on operational latency and data sovereignty. For real-time applications, such as fraud detection or mobile interfaces, the inference costs of frontier models (often exceeding 70 billion parameters) can be prohibitive. The trajectory for 2026 favors Small Language Models &#8212; typically under 7 billion parameters &#8212; that achieve frontier-level performance on narrow, domain-specific tasks. This architecture will allow financial institutions to deploy sophisticated reasoning on commodity hardware within air-gapped servers, ensuring sensitive data never leaves the premises.</p><p><strong>Autonomous Market Microstructure</strong></p><p>A quiet transformation is occurring in the &#8220;last mile&#8221; of finance: trade execution. <a href="https://www.frontiersin.org/journals/artificial-intelligence/articles/10.3389/frai.2025.1616485/full">Research</a> indicates that deep learning models can effectively interpret real-time buy and sell orders to forecast liquidity and future price movements with superior accuracy. This development lays the groundwork for autonomous agents that go beyond price prediction to actively negotiate trades, adapting execution strategies in real-time to minimize slippage and costs in volatile market environments.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!szEh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8cbb63-f01f-4f49-8404-a90ccb32b1d9_1728x1042.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!szEh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8cbb63-f01f-4f49-8404-a90ccb32b1d9_1728x1042.jpeg 424w, https://substackcdn.com/image/fetch/$s_!szEh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8cbb63-f01f-4f49-8404-a90ccb32b1d9_1728x1042.jpeg 848w, https://substackcdn.com/image/fetch/$s_!szEh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8cbb63-f01f-4f49-8404-a90ccb32b1d9_1728x1042.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!szEh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8cbb63-f01f-4f49-8404-a90ccb32b1d9_1728x1042.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!szEh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8cbb63-f01f-4f49-8404-a90ccb32b1d9_1728x1042.jpeg" width="624" height="376.2857142857143" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9a8cbb63-f01f-4f49-8404-a90ccb32b1d9_1728x1042.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:878,&quot;width&quot;:1456,&quot;resizeWidth&quot;:624,&quot;bytes&quot;:271225,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/182880797?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8cbb63-f01f-4f49-8404-a90ccb32b1d9_1728x1042.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!szEh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8cbb63-f01f-4f49-8404-a90ccb32b1d9_1728x1042.jpeg 424w, https://substackcdn.com/image/fetch/$s_!szEh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8cbb63-f01f-4f49-8404-a90ccb32b1d9_1728x1042.jpeg 848w, https://substackcdn.com/image/fetch/$s_!szEh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8cbb63-f01f-4f49-8404-a90ccb32b1d9_1728x1042.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!szEh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8cbb63-f01f-4f49-8404-a90ccb32b1d9_1728x1042.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><a href="https://www.frontiersin.org/journals/artificial-intelligence/articles/10.3389/frai.2025.1616485/full?utm_source=gradientflow&amp;utm_medium=newsletter">LiT: limit order book transformer</a></figcaption></figure></div><h4><strong>Deployment Patterns (How It&#8217;s Being Used)</strong></h4><p><strong>Multi-Agent Workflows</strong></p><p>The paradigm is shifting from single-prompt problem solving to the <strong><a href="https://gradientflow.substack.com/p/inside-the-agent-optimization-toolkit">optimization and orchestration</a></strong> of specialized agent &#8220;crews.&#8221; In these systems, distinct agents assume roles such as &#8220;Planner,&#8221; &#8220;Coder,&#8221; &#8220;Risk Officer,&#8221; or &#8220;Auditor,&#8221; collaborating to execute complex workflows. These agents leverage standardized protocols, such as the Model Context Protocol (MCP), to utilize external tools and communicate. Tool Agents manage search and code execution, while Financial Service Agents handle domain-specific tasks like credit scoring and compliance, creating a modular and resilient operational structure. </p><p><strong>Hybrid Quant Architectures</strong></p><p>Rather than displacing incumbent quantitative infrastructure, modern AI is increasingly integrated as a reasoning and interface layer on top of established engines. In these hybrid stacks, LLMs handle semantic tasks &#8212; summarizing research, proposing signals, and explaining portfolio composition &#8212; while TSFMs and RFMs generate forecasts. However, critical functions such as allocation, risk management, and execution remain the domain of traditional optimizers and models. LLMs are often utilized &#8220;offline&#8221; to extract features from unstructured text, which are then fed into robust, lightweight classical models (such as XGBoost) for final prediction.</p><p><strong>Open-Source Alpha Generation (WallStreetBets &#129308;&#129307; AI)</strong></p><p>A growing body of research focuses on the democratization of systematic investment strategies using open-weights models and public data. These pipelines typically ingest unstructured content &#8212; news, social media, and video transcripts &#8212; to extract signals. These signals are subsequently processed by classical models and portfolio optimizers. This trend suggests that the barrier to entry for sophisticated, data-driven investing will lower even more, enabling smaller institutions and retail investors to construct alpha-generating strategies previously reserved for well-capitalized firms.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!aQoN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6a330cc-8948-4a09-b507-de24001a4365_1590x1027.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!aQoN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6a330cc-8948-4a09-b507-de24001a4365_1590x1027.jpeg 424w, https://substackcdn.com/image/fetch/$s_!aQoN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6a330cc-8948-4a09-b507-de24001a4365_1590x1027.jpeg 848w, https://substackcdn.com/image/fetch/$s_!aQoN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6a330cc-8948-4a09-b507-de24001a4365_1590x1027.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!aQoN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6a330cc-8948-4a09-b507-de24001a4365_1590x1027.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!aQoN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6a330cc-8948-4a09-b507-de24001a4365_1590x1027.jpeg" width="538" height="347.33516483516485" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d6a330cc-8948-4a09-b507-de24001a4365_1590x1027.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:940,&quot;width&quot;:1456,&quot;resizeWidth&quot;:538,&quot;bytes&quot;:335472,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/182880797?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6a330cc-8948-4a09-b507-de24001a4365_1590x1027.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!aQoN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6a330cc-8948-4a09-b507-de24001a4365_1590x1027.jpeg 424w, https://substackcdn.com/image/fetch/$s_!aQoN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6a330cc-8948-4a09-b507-de24001a4365_1590x1027.jpeg 848w, https://substackcdn.com/image/fetch/$s_!aQoN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6a330cc-8948-4a09-b507-de24001a4365_1590x1027.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!aQoN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6a330cc-8948-4a09-b507-de24001a4365_1590x1027.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h4><strong>Safety, Governance &amp; Production Infrastructure (Making It Real)</strong></h4><p><strong>&#8220;White Box&#8221; Verification</strong></p><p>To mitigate the risk of hallucination, institutions are adopting &#8220;White Box&#8221; architectures that position LLMs as critics and auditors. Systems utilizing frameworks like <a href="https://arxiv.org/abs/2511.19671v1">FISCAL</a> and <a href="https://arxiv.org/abs/2510.13920v1">FACTS</a> employ agents to validate numerical claims against primary financial documents and summarize complex tabular data. By grounding outputs in structured data and implementing agentic workflows for claim-checking, these architectures prioritize explainability and factual accuracy over pure generative capability.</p><p><strong>Privacy-Preserving Deployment</strong></p><p>With strict regulatory constraints on data sharing, the focus is turning toward adapting LLMs for financial tasks without exposing sensitive information to third-party providers. Techniques such as <strong>context-masked meta-prompting </strong>&#8212; which sanitizes inputs prior to inference &#8212; and the generation of offline, reusable prompt templates are becoming standard. Furthermore, governance mechanisms &#8212; including logging, rule-based overlays, and <a href="https://www.hirundo.io/?utm_source=gradientflow&amp;utm_medium=newsletter">unlearning</a> protocols &#8212; are being treated as primary design requirements rather than afterthoughts.</p><p><strong>Synthetic Data Infrastructure</strong></p><p>Synthetic data is evolving from a niche research topic into a core infrastructure tool. Generative models are now used to create multimodal datasets that mimic proprietary distributions &#8212; preserving the statistical properties of sensitive data without exposing underlying records. In market risk, these models generate realistic but counterfactual return paths, including synthetic &#8220;crash&#8221; scenarios. This allows for robust stress testing and the simulation of limit order books, providing a safe environment for model training and validation.</p><p><strong>Continuous Compliance</strong></p><p>Regulatory frameworks like the EU AI Act are driving a transition from periodic model validation to &#8220;compliance-as-code.&#8221; In 2026, compliance is expected to become a continuous architectural process. This involves real-time lineage tracking to document data provenance for every inference, alongside automated benchmarking suites (such as <a href="https://arxiv.org/abs/2506.15846">FLAME</a>) that stress-test models against adversarial scenarios and regime shifts. Systems will be required to generate automated evidence of adherence, managing model drift and fairness violations dynamically as they emerge.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!g92v!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5f21d4e-9d8d-4673-8c94-6db878543ded_1403x851.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!g92v!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5f21d4e-9d8d-4673-8c94-6db878543ded_1403x851.jpeg 424w, https://substackcdn.com/image/fetch/$s_!g92v!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5f21d4e-9d8d-4673-8c94-6db878543ded_1403x851.jpeg 848w, https://substackcdn.com/image/fetch/$s_!g92v!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5f21d4e-9d8d-4673-8c94-6db878543ded_1403x851.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!g92v!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5f21d4e-9d8d-4673-8c94-6db878543ded_1403x851.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!g92v!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5f21d4e-9d8d-4673-8c94-6db878543ded_1403x851.jpeg" width="1403" height="851" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d5f21d4e-9d8d-4673-8c94-6db878543ded_1403x851.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:851,&quot;width&quot;:1403,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:196594,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/182880797?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5f21d4e-9d8d-4673-8c94-6db878543ded_1403x851.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!g92v!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5f21d4e-9d8d-4673-8c94-6db878543ded_1403x851.jpeg 424w, https://substackcdn.com/image/fetch/$s_!g92v!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5f21d4e-9d8d-4673-8c94-6db878543ded_1403x851.jpeg 848w, https://substackcdn.com/image/fetch/$s_!g92v!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5f21d4e-9d8d-4673-8c94-6db878543ded_1403x851.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!g92v!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5f21d4e-9d8d-4673-8c94-6db878543ded_1403x851.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><h1><strong>Quick Takes</strong></h1><div id="youtube2-wRhEV9aI6PE" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;wRhEV9aI6PE&quot;,&quot;startTime&quot;:&quot;36s&quot;,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/wRhEV9aI6PE?start=36s&amp;rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><ol><li><p><a href="https://youtu.be/wRhEV9aI6PE?t=36">The Humanoid Takeover: Key Takeaways from CES 2026</a></p></li><li><p><a href="https://youtu.be/wRhEV9aI6PE?t=1147">The Great Relaxation: Analyzing the New H200 Chip Export Policies</a></p></li></ol><div><hr></div><h1>Self-Correcting Agent: Closing the Loop with Formal Verification</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!yAQq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08044e4-52b3-4905-b079-a0e8c6a241bb_1580x1064.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!yAQq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08044e4-52b3-4905-b079-a0e8c6a241bb_1580x1064.jpeg 424w, https://substackcdn.com/image/fetch/$s_!yAQq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08044e4-52b3-4905-b079-a0e8c6a241bb_1580x1064.jpeg 848w, https://substackcdn.com/image/fetch/$s_!yAQq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08044e4-52b3-4905-b079-a0e8c6a241bb_1580x1064.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!yAQq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08044e4-52b3-4905-b079-a0e8c6a241bb_1580x1064.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!yAQq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08044e4-52b3-4905-b079-a0e8c6a241bb_1580x1064.jpeg" width="1456" height="980" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e08044e4-52b3-4905-b079-a0e8c6a241bb_1580x1064.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:980,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:354938,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/182880797?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08044e4-52b3-4905-b079-a0e8c6a241bb_1580x1064.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!yAQq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08044e4-52b3-4905-b079-a0e8c6a241bb_1580x1064.jpeg 424w, https://substackcdn.com/image/fetch/$s_!yAQq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08044e4-52b3-4905-b079-a0e8c6a241bb_1580x1064.jpeg 848w, https://substackcdn.com/image/fetch/$s_!yAQq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08044e4-52b3-4905-b079-a0e8c6a241bb_1580x1064.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!yAQq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe08044e4-52b3-4905-b079-a0e8c6a241bb_1580x1064.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/01/AI-pipeline-for-Erdos-problem-728.jpeg">enlarge</a></strong>)</figcaption></figure></div><p>This <strong><a href="https://mathstodon.xyz/@tao/115855840223258103">recent example</a></strong> from the world of mathematics caught my attention as an example of a &#8220;verified&#8221; AI pipeline. The workflow shows how to turn an ambiguous goal into a <em>checked</em> result: an LLM proposes a solution, a formal verifier (proof assistant) checks it, and the loop repeats until it passes. Once the core artifact is verified, you can reliably generate multiple explanations &#8212; technical, narrative, or research-style &#8212; grounded in the same source.</p><div><hr></div><p></p><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a> and hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong>, the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, the <strong><a href="https://appliedaisummit.org/">Applied AI Summit</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[Agent workflows: stop guessing, start measuring]]></title><description><![CDATA[Put data, machine learning, and AI to work.]]></description><link>https://gradientflow.substack.com/p/inside-the-agent-optimization-toolkit</link><guid isPermaLink="false">https://gradientflow.substack.com/p/inside-the-agent-optimization-toolkit</guid><dc:creator><![CDATA[Ben Lorica 罗瑞卡]]></dc:creator><pubDate>Tue, 06 Jan 2026 14:01:33 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/5656f52c-413d-4300-a3d5-629a3ea20c1b_1454x917.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/newsletter/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24747,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://gradientflow.com/newsletter/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qpug!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 424w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 848w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1272w, https://substackcdn.com/image/fetch/$s_!Qpug!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0f732fb8-bcb3-47bb-93e0-cf2c6c0653c5_1100x220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/subscribe">Subscribe</a> &#8226;<a href="https://gradientflow.substack.com/"> Previous Issues</a></strong></p><h1><strong>Agent Optimization: From Prompt Whispering to Platform Engineering</strong></h1><p>Agent optimization is the work of making an agent workflow dependable &#8212; despite long tool chains, multiple roles, and the inherent variability of large language models. In day-to-day engineering terms, it is closer to debugging a complex system than &#8220;making the model smarter&#8221;: you are tuning roles, prompts, routing, memory, tool use, and verification so the workflow stops failing in repeatable ways.</p><p>The problem has become important because many teams see the same pattern: a compound system looks impressive in a controlled demo, then breaks under real-world inputs and operational constraints. The root cause is often not raw model capability, but workflow issues &#8212; role drift, context loss, weak verification, and coordination failures &#8212; that only show up when you run the full loop repeatedly.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://gradientflow.substack.com/subscribe?"><span>Subscribe now</span></a></p><p>Without the specialized tools emerging in this space, the default playbook is manual and hard to scale: engineers read traces by hand, add logs, tweak prompts, patch edge cases with heuristics, and rely on coarse pass/fail dashboards that discard most of the diagnostic signal in the execution trace. Traditional gradient-based training doesn&#8217;t directly apply because the workflow is non-differentiable (it includes API calls, tools, and conditional logic), and many teams only have API access to models &#8212; so even if fine-tuning would help, it may not be available.</p><p>Furthermore, as the number of agents and tools grows, the combinatorial complexity of the system makes manual debugging unscalable. A single change in one agent&#8217;s prompt can have unpredictable downstream effects on the entire collective, leading to a &#8220;guessing game&#8221; that consumes vast engineering resources without guaranteeing improvement.</p><h4>The New Tools of Agentic Engineering</h4><p>A practical toolchain is emerging that makes the loop more systematic: instrument the workflow, diagnose failures, evaluate variants, and then use search or automated refinement to improve prompts and architecture &#8212; while adding guardrails so the optimizer can&#8217;t &#8220;cheat.&#8221;</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://gradientflow.com/wp-content/uploads/2026/01/Agent-Optimization-Toolkit-refinement-loop.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9i_6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab997029-508f-42eb-8efb-916fe9d97c80_1843x674.jpeg 424w, https://substackcdn.com/image/fetch/$s_!9i_6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab997029-508f-42eb-8efb-916fe9d97c80_1843x674.jpeg 848w, https://substackcdn.com/image/fetch/$s_!9i_6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab997029-508f-42eb-8efb-916fe9d97c80_1843x674.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!9i_6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab997029-508f-42eb-8efb-916fe9d97c80_1843x674.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9i_6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab997029-508f-42eb-8efb-916fe9d97c80_1843x674.jpeg" width="560" height="204.6153846153846" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ab997029-508f-42eb-8efb-916fe9d97c80_1843x674.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:532,&quot;width&quot;:1456,&quot;resizeWidth&quot;:560,&quot;bytes&quot;:153099,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:&quot;https://gradientflow.com/wp-content/uploads/2026/01/Agent-Optimization-Toolkit-refinement-loop.jpeg&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/182136036?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab997029-508f-42eb-8efb-916fe9d97c80_1843x674.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!9i_6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab997029-508f-42eb-8efb-916fe9d97c80_1843x674.jpeg 424w, https://substackcdn.com/image/fetch/$s_!9i_6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab997029-508f-42eb-8efb-916fe9d97c80_1843x674.jpeg 848w, https://substackcdn.com/image/fetch/$s_!9i_6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab997029-508f-42eb-8efb-916fe9d97c80_1843x674.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!9i_6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab997029-508f-42eb-8efb-916fe9d97c80_1843x674.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p><strong><a href="https://gradientflow.substack.com/p/are-your-ai-agents-flying-blind-in">System Observability</a> (The Flight Recorder). </strong>Optimization starts with <a href="https://gradientflow.substack.com/p/are-your-ai-agents-flying-blind-in">complete, queryable traces</a>: prompts, tool calls, intermediate outputs, routing decisions, and state. In one MAST-based <a href="https://mast-ucb.notion.site/improve-agents-with-mast">case study</a>, teams used trace tooling (<strong>agentdash</strong>) to annotate runs and generate failure histograms, turning debugging from anecdote into measurable error categories.</p><p><strong>Diagnostic Failure Taxonomies (The &#8220;Why&#8221; Signal). </strong>Instead of &#8220;accuracy dropped,&#8221; teams increasingly want &#8220;verification failed&#8221; or &#8220;agents ignored key context.&#8221; <strong><a href="https://mast-ucb.notion.site/improve-agents-with-mast">MAST</a></strong> is one example: it organizes failures into system design issues, inter-agent misalignment, and task verification problems. The practical benefit is prioritization &#8212; fix the dominant failure class first, rather than iterating blindly.</p><p><strong>Targeted Evaluation (The North Star). </strong>&#8203;&#8203;Optimization is only as good as the metric it targets. This layer moves beyond public benchmarks to create custom &#8220;ground truth&#8221; datasets that mirror specific business logic. Using platforms like <strong><a href="https://microsoft.github.io/promptflow/">Prompt Flow</a></strong>, teams build evaluation sets alongside the application. For complex qualitative traits like &#8220;clarity,&#8221; teams use <strong>Model-Graded Scoring</strong>, where a more capable model acts as a judge. To handle the inherent unpredictability of AI, <a href="https://gradientflow.substack.com/p/beyond-rl-a-new-paradigm-for-agent">some teams now use </a><strong><a href="https://gradientflow.substack.com/p/beyond-rl-a-new-paradigm-for-agent">Tournament Selection</a></strong>, where <a href="https://github.com/zetaalphavector/RAGElo">variants compete head-to-head and are ranked via </a><strong><a href="https://github.com/zetaalphavector/RAGElo">Elo ratings</a></strong><a href="https://github.com/zetaalphavector/RAGElo">,</a> providing a more robust measure of effectiveness than a simple pass/fail score.</p><p><strong>Textual Gradients (The Feedback Mechanism). </strong>Because agent workflows coordinate discrete tools and APIs, they cannot be improved using the standard math of neural networks. Frameworks like <strong><a href="https://github.com/zou-group/textgrad">TextGrad</a></strong> solve this by using &#8220;textual gradients&#8221; &#8212; detailed natural language critiques that explain exactly why a specific step failed. This feedback is propagated backward through the system&#8217;s logic to automatically update prompts or code, allowing the system to learn from its mistakes using language rather than numerical scores.</p><p><strong>Automated Refinement (The Tuner). </strong>Instead of hand-editing prompts, teams are adopting algorithmic optimizers. <strong><a href="https://dspy.ai/learn/optimization/optimizers/">DSPy</a></strong> replaces static strings with optimizable &#8220;signatures,&#8221; using tools like <strong><a href="https://dspy.ai/learn/optimization/optimizers/">MIPROv2</a></strong><a href="https://dspy.ai/learn/optimization/optimizers/"> and </a><strong><a href="https://dspy.ai/learn/optimization/optimizers/">COPRO</a></strong> to search for the best combination of instructions and few-shot examples. <strong><a href="https://github.com/SylphAI-Inc/AdalFlow">AdalFlow</a></strong>  pushes a related idea into a more pipeline-centric library: prompts and few-shot demonstrations become parameters that can be refined via an &#8220;AutoDiff&#8221;-style loop driven by performance metrics.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!u8j-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde491176-38c4-4f4c-bc11-4a7a78c336b0_1846x1064.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!u8j-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde491176-38c4-4f4c-bc11-4a7a78c336b0_1846x1064.jpeg 424w, https://substackcdn.com/image/fetch/$s_!u8j-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde491176-38c4-4f4c-bc11-4a7a78c336b0_1846x1064.jpeg 848w, https://substackcdn.com/image/fetch/$s_!u8j-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde491176-38c4-4f4c-bc11-4a7a78c336b0_1846x1064.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!u8j-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde491176-38c4-4f4c-bc11-4a7a78c336b0_1846x1064.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!u8j-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde491176-38c4-4f4c-bc11-4a7a78c336b0_1846x1064.jpeg" width="1456" height="839" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/de491176-38c4-4f4c-bc11-4a7a78c336b0_1846x1064.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:839,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:283084,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/182136036?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde491176-38c4-4f4c-bc11-4a7a78c336b0_1846x1064.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!u8j-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde491176-38c4-4f4c-bc11-4a7a78c336b0_1846x1064.jpeg 424w, https://substackcdn.com/image/fetch/$s_!u8j-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde491176-38c4-4f4c-bc11-4a7a78c336b0_1846x1064.jpeg 848w, https://substackcdn.com/image/fetch/$s_!u8j-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde491176-38c4-4f4c-bc11-4a7a78c336b0_1846x1064.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!u8j-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde491176-38c4-4f4c-bc11-4a7a78c336b0_1846x1064.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/01/Agent-Optimization-Toolkit.jpeg">enlarge</a></strong>)</figcaption></figure></div><p><strong>Structural Evolution (The Architect). </strong>Multi-agent design is difficult because the space of possible connections and orchestration patterns grows exponentially as agents are added. Frameworks like <strong><a href="https://github.com/algorithmicsuperintelligence/openevolve">OpenEvolve</a></strong> and <strong><a href="https://github.com/gepa-ai/gepa">GEPA</a></strong> (Genetic-Pareto Evolution) address this by treating system architecture as a search problem. They mutate &#8220;code knobs&#8221; &#8212; such as agent roles, communication topology, and prompts &#8212; and evaluate the variants using detailed diagnostic feedback on why they failed. This process often uncovers patterns humans miss, such as splitting a failing generalist into specialists or using <em>&#8220;negative constraints&#8221;</em> &#8212; explicit &#8220;do not&#8221; instructions (like &#8220;do not plan&#8221;) &#8212; to keep agents from overstepping their assigned roles.</p><p><strong>Multi-Stage Gatekeeping (The Verifier). </strong>Reliability improvements often come from separating &#8220;generate&#8221; from &#8220;verify,&#8221; and layering checks. One pattern is <strong>hybrid verification</strong>: run cheap deterministic checks (e.g., syntax/<a href="https://en.wikipedia.org/wiki/Abstract_syntax_tree">AST</a> parsing) before invoking slower, costlier model-based review. Another is a dedicated verifier role (e.g., <a href="https://adrs-ucb.notion.site/mast">SimpleVerifier</a>) that acts as a gatekeeper rather than a stylistic reviewer.</p><p><strong>Governance and Safety (The Guardrails). </strong>To prevent &#8220;reward hacking&#8221; &#8212; where an AI finds a shortcut to a high score without actually solving the task &#8212; teams must enforce strict boundaries. This involves using <em>surgical edits</em> (or &#8220;diffs&#8221;) that restrict the AI to changing specific pieces of logic rather than rewriting entire files; this prevents the system from accidentally deleting its own safety checks to &#8220;cheat&#8221; the evaluation. Additionally, a <em>Memory Module</em> acts as a permanent record of the &#8220;best-known&#8221; version of the system, ensuring that as the AI experiments with new designs, it never loses progress or reverts to a lower-performing state.</p><h4>The Practical Hurdles of Optimization</h4><p>Despite the rapid advancement of these tools, several significant hurdles remain for AI teams.</p><ul><li><p><strong>The Risk of Reward Hacking:</strong> Automated optimization systems are highly efficient at finding shortcuts. In one documented case, an evolutionary algorithm &#8220;improved&#8221; its score by simply deleting the agent responsible for reporting failures. Without strict guardrails, systems may optimize for the metric rather than the actual business objective.</p></li><li><p><strong>Evaluator Fragility and Noise:</strong> If the initial evaluation metrics are poorly defined or the test data is not diverse enough, the refinement process will optimize for the wrong outcomes. An &#8220;evals-first&#8221; approach is difficult to operationalize when the &#8220;ground truth&#8221; for a task is subjective or constantly shifting.</p></li><li><p><strong>Judge Bias and Inconsistency:</strong> Relying on an LLM to evaluate the performance of other agents introduces potential biases. A &#8220;judge&#8221; model might reward linguistic fluency over functional correctness or exhibit a preference for its own coding style, necessitating a skeptical approach to purely automated scoring.</p></li><li><p><strong>Computational Intensity and Latency:</strong> Generating and testing dozens of code variants is resource-heavy. For teams with strict cost or latency constraints, the iterative nature of evolutionary search can be prohibitive, requiring careful use of &#8220;improvement thresholds&#8221; to stop the loop when gains become marginal.</p></li><li><p><strong>Overfitting and Generalization:</strong> A prompt or topology that performs exceptionally well on a specific evaluation set may fail to generalize to real-world novelty. Ensuring that an optimized agent remains robust against &#8220;data drift&#8221; is an ongoing challenge that requires diverse, <a href="https://github.com/plurai-ai/intellagent">adversarial test sets</a>.</p></li><li><p><strong>Tooling Fragmentation and the &#8220;Integration Tax&#8221;:</strong> An engineering team must manually stitch together best-of-breed (<em>open source</em>) solutions for observability from one library, failure taxonomies from another, and optimization engines from a third. This fragmentation creates a significant &#8220;integration tax,&#8221; where developers spend more time plumbing data between tools than actually refining agent behavior. For optimization to become a standard enterprise discipline, these capabilities must coalesce into integrated, end-to-end platforms that manage the entire loop within a single environment. <em>Proprietary</em> platforms like <a href="https://www.plurai.ai/?utm_source=gradientflow&amp;utm_medium=newsletter#Product">Plurai</a> are headed in this direction.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-M_Y!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc8ef42b-270b-4a49-ac44-a9d4fa4bfe7c_1854x877.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-M_Y!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc8ef42b-270b-4a49-ac44-a9d4fa4bfe7c_1854x877.jpeg 424w, https://substackcdn.com/image/fetch/$s_!-M_Y!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc8ef42b-270b-4a49-ac44-a9d4fa4bfe7c_1854x877.jpeg 848w, https://substackcdn.com/image/fetch/$s_!-M_Y!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc8ef42b-270b-4a49-ac44-a9d4fa4bfe7c_1854x877.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!-M_Y!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc8ef42b-270b-4a49-ac44-a9d4fa4bfe7c_1854x877.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-M_Y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc8ef42b-270b-4a49-ac44-a9d4fa4bfe7c_1854x877.jpeg" width="596" height="282.0357142857143" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dc8ef42b-270b-4a49-ac44-a9d4fa4bfe7c_1854x877.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:689,&quot;width&quot;:1456,&quot;resizeWidth&quot;:596,&quot;bytes&quot;:304376,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/182136036?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc8ef42b-270b-4a49-ac44-a9d4fa4bfe7c_1854x877.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!-M_Y!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc8ef42b-270b-4a49-ac44-a9d4fa4bfe7c_1854x877.jpeg 424w, https://substackcdn.com/image/fetch/$s_!-M_Y!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc8ef42b-270b-4a49-ac44-a9d4fa4bfe7c_1854x877.jpeg 848w, https://substackcdn.com/image/fetch/$s_!-M_Y!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc8ef42b-270b-4a49-ac44-a9d4fa4bfe7c_1854x877.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!-M_Y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc8ef42b-270b-4a49-ac44-a9d4fa4bfe7c_1854x877.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h4>A Discipline, Not a Bag of Tricks</h4><p>Teams are moving from a model-centric era to a system-centric one. In the past, performance gains were achieved by upgrading to a larger Foundation Model. Today, as demonstrated by case studies using the <a href="https://github.com/multi-agent-systems-failure-taxonomy/MAST">MAST framework</a>, teams can achieve a 50% or greater improvement in accuracy simply by rewiring an agent graph and adding stateful memory &#8212; without upgrading the underlying model. This shift transforms agent optimization from a &#8220;guessing game&#8221; into a scalable, interpretable engineering process.</p><div class="pullquote"><p>Agent optimization is less about making a model smarter and more about debugging a complex system.</p></div><p>This evolution fits into a broader architectural trend. Just as the <a href="https://gradientflow.com/what-is-the-park-stack/">PARK stack</a> (PyTorch, AI Frontier Models, Ray, Kubernetes) has standardized the <strong>compute</strong> substrate, and <a href="https://gradientflow.substack.com/p/the-rise-of-the-multimodal-lakehouse">Multimodal Lakehouses</a> are beginning to consolidate the <strong>data layer</strong>, agent optimization is becoming the <strong>refinement</strong> substrate. The next major milestone for the ecosystem will be the arrival of <em>open source</em> optimization frameworks that integrate natively with these other layers &#8212; allowing an optimizer to scale across a Ray cluster or query a multimodal lakehouse without bespoke glue code.</p><p>For CTOs and founders, the competitive advantage is no longer the model they use, but the speed and rigor of their optimization loop. The teams that ship reliable agents will look less like prompt whisperers and more like disciplined platform teams &#8212; treating agent behavior as something you can measure, diagnose, and iteratively harden.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://gradientflow.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://gradientflow.substack.com/subscribe?"><span>Subscribe now</span></a></p><div><hr></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EY_x!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0126451-05ad-4ac0-9802-76531315c081_3960x1398.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EY_x!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0126451-05ad-4ac0-9802-76531315c081_3960x1398.jpeg 424w, https://substackcdn.com/image/fetch/$s_!EY_x!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0126451-05ad-4ac0-9802-76531315c081_3960x1398.jpeg 848w, https://substackcdn.com/image/fetch/$s_!EY_x!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0126451-05ad-4ac0-9802-76531315c081_3960x1398.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!EY_x!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0126451-05ad-4ac0-9802-76531315c081_3960x1398.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EY_x!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0126451-05ad-4ac0-9802-76531315c081_3960x1398.jpeg" width="1456" height="514" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c0126451-05ad-4ac0-9802-76531315c081_3960x1398.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:514,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1055947,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://gradientflow.substack.com/i/182136036?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0126451-05ad-4ac0-9802-76531315c081_3960x1398.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!EY_x!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0126451-05ad-4ac0-9802-76531315c081_3960x1398.jpeg 424w, https://substackcdn.com/image/fetch/$s_!EY_x!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0126451-05ad-4ac0-9802-76531315c081_3960x1398.jpeg 848w, https://substackcdn.com/image/fetch/$s_!EY_x!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0126451-05ad-4ac0-9802-76531315c081_3960x1398.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!EY_x!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0126451-05ad-4ac0-9802-76531315c081_3960x1398.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(<strong><a href="https://gradientflow.com/wp-content/uploads/2026/01/H200-Export-controls-2025-12.jpeg">enlarge</a></strong>)</figcaption></figure></div><div><hr></div><h1><strong>Smart Tool Recommendations</strong></h1><p><strong><a href="https://speechify.com/?utm_source=gradientflow&amp;utm_medium=newsletter">Speechify</a></strong>.  My secret weapon for clearing a &#8220;to-read&#8221; list. <strong><a href="https://speechify.com/?utm_source=gradientflow&amp;utm_medium=newsletter">Speechify</a></strong> turns web pages, newsletters, and PDFs into high-quality audio that actually sounds human. If you struggle to find time to sit and read, this is the solution.</p><p><strong><a href="https://amzn.to/3Nctzj6">Kasa Smart Plug</a></strong>. The most underrated tech in my house. <a href="https://amzn.to/3Nctzj6">Kasa Smart Plugs</a> are cheap, reliable, and dead-simple to use. If you haven&#8217;t automated your lamps or sound system yet, this is your sign to start.</p><div><hr></div><p><em><strong><a href="https://gradientflow.com/disclosure/">Ben Lorica</a></strong> edits the <a href="https://gradientflow.substack.com/">Gradient Flow newsletter</a> and hosts the <strong><a href="https://thedataexchange.media/">Data Exchange podcast</a></strong>. He helps organize the <strong><a href="https://aiconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Conference</a></strong>, the <strong><a href="https://agentconference.com/?utm_source=gradientflow&amp;utm_medium=newsletter">AI Agent Conference</a></strong>, the <strong><a href="https://appliedaisummit.org/">Applied AI Summit</a></strong>, while also serving as the Strategic Content Chair for AI at the <strong><a href="https://events.linuxfoundation.org/">Linux Foundation</a></strong>. You can follow him on <a href="https://www.linkedin.com/in/benlorica/">Linkedin</a>, <a href="https://x.com/bigdata">X</a>, <a href="https://indieweb.social/@bigdata">Mastodon</a>, <a href="https://www.reddit.com/r/GradientFlow/">Reddit</a>, <a href="https://bsky.app/profile/gradientflow.com">Bluesky</a>, <a href="https://www.youtube.com/c/GradientFlow">YouTube</a>, or <a href="https://www.tiktok.com/@gradientflow">TikTok</a>. This newsletter is produced by <a href="https://gradientflow.com/blog/">Gradient Flow</a>.</em></p>]]></content:encoded></item></channel></rss>