<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Designing Agentic AI]]></title><description><![CDATA[Deep Insights into Agentic AI.]]></description><link>https://melvintercan.com</link><image><url>https://substackcdn.com/image/fetch/$s_!wKll!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f7abcdb-a30f-4c6c-9a23-166891225445_1000x1000.png</url><title>Designing Agentic AI</title><link>https://melvintercan.com</link></image><generator>Substack</generator><lastBuildDate>Fri, 17 Apr 2026 01:40:19 GMT</lastBuildDate><atom:link href="https://melvintercan.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Melvin Tercan]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[melvintercan@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[melvintercan@substack.com]]></itunes:email><itunes:name><![CDATA[Melvin Tercan]]></itunes:name></itunes:owner><itunes:author><![CDATA[Melvin Tercan]]></itunes:author><googleplay:owner><![CDATA[melvintercan@substack.com]]></googleplay:owner><googleplay:email><![CDATA[melvintercan@substack.com]]></googleplay:email><googleplay:author><![CDATA[Melvin Tercan]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[5 Lessons for a Productive AI Engineering Team]]></title><description><![CDATA[How We Ship High-Quality Code at High Speed with Linear and Cursor]]></description><link>https://melvintercan.com/p/the-ai-native-engineering-team</link><guid isPermaLink="false">https://melvintercan.com/p/the-ai-native-engineering-team</guid><dc:creator><![CDATA[Melvin Tercan]]></dc:creator><pubDate>Wed, 17 Dec 2025 15:07:31 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!qwHy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c2c34e1-0b49-4cc7-9843-c1986681ede4_2048x2048.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qwHy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c2c34e1-0b49-4cc7-9843-c1986681ede4_2048x2048.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qwHy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c2c34e1-0b49-4cc7-9843-c1986681ede4_2048x2048.png 424w, https://substackcdn.com/image/fetch/$s_!qwHy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c2c34e1-0b49-4cc7-9843-c1986681ede4_2048x2048.png 848w, https://substackcdn.com/image/fetch/$s_!qwHy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c2c34e1-0b49-4cc7-9843-c1986681ede4_2048x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!qwHy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c2c34e1-0b49-4cc7-9843-c1986681ede4_2048x2048.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qwHy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c2c34e1-0b49-4cc7-9843-c1986681ede4_2048x2048.png" width="400" height="400" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3c2c34e1-0b49-4cc7-9843-c1986681ede4_2048x2048.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:400,&quot;bytes&quot;:6630090,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://melvintercan.com/i/181853599?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c2c34e1-0b49-4cc7-9843-c1986681ede4_2048x2048.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!qwHy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c2c34e1-0b49-4cc7-9843-c1986681ede4_2048x2048.png 424w, https://substackcdn.com/image/fetch/$s_!qwHy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c2c34e1-0b49-4cc7-9843-c1986681ede4_2048x2048.png 848w, https://substackcdn.com/image/fetch/$s_!qwHy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c2c34e1-0b49-4cc7-9843-c1986681ede4_2048x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!qwHy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c2c34e1-0b49-4cc7-9843-c1986681ede4_2048x2048.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>At this point, 99% of code in our engineering team is written by AI. The common fear is that this leads to unmaintainable AI slop. Our experience is the exact opposite. AI actually allows us to deliver high-quality code at high speeds. As resource constrained engineers we used to always have to compromise between velocity and quality - now we feel we can have both.</p><p>The models have certainly become smarter, but the tooling has also become significantly better. This combination has transformed our entire development workflow. In fact, traditional Agile sprints aren&#8217;t a thing for us anymore. We have moved beyond the old rituals into a continuous stream of execution. </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://melvintercan.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Designing Agentic AI! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>Here is what currently works for us, after much experimenting:</p><h3>1. We prioritize planning with AI</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!r2oY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe50cbcf5-6381-4709-aaa1-34822e10af3b_2754x2332.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!r2oY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe50cbcf5-6381-4709-aaa1-34822e10af3b_2754x2332.png 424w, https://substackcdn.com/image/fetch/$s_!r2oY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe50cbcf5-6381-4709-aaa1-34822e10af3b_2754x2332.png 848w, https://substackcdn.com/image/fetch/$s_!r2oY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe50cbcf5-6381-4709-aaa1-34822e10af3b_2754x2332.png 1272w, https://substackcdn.com/image/fetch/$s_!r2oY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe50cbcf5-6381-4709-aaa1-34822e10af3b_2754x2332.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!r2oY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe50cbcf5-6381-4709-aaa1-34822e10af3b_2754x2332.png" width="1456" height="1233" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e50cbcf5-6381-4709-aaa1-34822e10af3b_2754x2332.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1233,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:905433,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://melvintercan.com/i/181853599?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe50cbcf5-6381-4709-aaa1-34822e10af3b_2754x2332.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!r2oY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe50cbcf5-6381-4709-aaa1-34822e10af3b_2754x2332.png 424w, https://substackcdn.com/image/fetch/$s_!r2oY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe50cbcf5-6381-4709-aaa1-34822e10af3b_2754x2332.png 848w, https://substackcdn.com/image/fetch/$s_!r2oY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe50cbcf5-6381-4709-aaa1-34822e10af3b_2754x2332.png 1272w, https://substackcdn.com/image/fetch/$s_!r2oY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe50cbcf5-6381-4709-aaa1-34822e10af3b_2754x2332.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Even though vibe coding with AI felt like we moved fast initially, we quickly realized that relying solely on iterative coding often meant babysitting the agent all day. Instead, we now shift the heavy lifting to the planning phase. Planning now takes precedence over coding.</p><p>Sometimes, we spend an entire day just on planning. We iterate the plans with multiple models and AI providers. We refine the architecture until we have the perfect specification. We write multiple detailed tickets with clear, atomic requirements in <a href="https://linear.app">Linear</a>.</p><p>Once the plan is solidified, the building step becomes trivial. We assign Linear tickets in bulk to <a href="https://linear.app/integrations/cursor">Cursor agents</a>. We sometimes dispatch tickets to 20 agents at a time to run in parallel. The AI handles the implementation. It writes the boilerplate, connects the APIs, and builds the UI components. This elevates our engineers from &#8220;coders&#8221; to &#8220;system architects&#8221; who manage queues and review logic rather than syntax.</p><h3>2. We let AI find its own work</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!T1Rn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06792de4-04f3-48a4-8ddd-41247dcc92e3_2336x2062.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!T1Rn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06792de4-04f3-48a4-8ddd-41247dcc92e3_2336x2062.png 424w, https://substackcdn.com/image/fetch/$s_!T1Rn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06792de4-04f3-48a4-8ddd-41247dcc92e3_2336x2062.png 848w, https://substackcdn.com/image/fetch/$s_!T1Rn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06792de4-04f3-48a4-8ddd-41247dcc92e3_2336x2062.png 1272w, https://substackcdn.com/image/fetch/$s_!T1Rn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06792de4-04f3-48a4-8ddd-41247dcc92e3_2336x2062.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!T1Rn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06792de4-04f3-48a4-8ddd-41247dcc92e3_2336x2062.png" width="1456" height="1285" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/06792de4-04f3-48a4-8ddd-41247dcc92e3_2336x2062.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1285,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:607682,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://melvintercan.com/i/181853599?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06792de4-04f3-48a4-8ddd-41247dcc92e3_2336x2062.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!T1Rn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06792de4-04f3-48a4-8ddd-41247dcc92e3_2336x2062.png 424w, https://substackcdn.com/image/fetch/$s_!T1Rn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06792de4-04f3-48a4-8ddd-41247dcc92e3_2336x2062.png 848w, https://substackcdn.com/image/fetch/$s_!T1Rn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06792de4-04f3-48a4-8ddd-41247dcc92e3_2336x2062.png 1272w, https://substackcdn.com/image/fetch/$s_!T1Rn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06792de4-04f3-48a4-8ddd-41247dcc92e3_2336x2062.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A proactive codebase is better than a reactive one. Waiting for bugs to be reported is inefficient. We let the AI play offense. We often use Cursor (also works with other coding tools such as <a href="https://code.claude.com/docs/en/overview">Claude Code</a> / <a href="https://chatgpt.com/features/codex">Codex</a> etc) to proactively scan our codebase for opportunities. It looks for tech debt, refactors, potential performance improvements, and even whole new product features that are implied but missing.</p><p>Once identified, we use <a href="https://linear.app/docs/mcp">Linear MCP</a> to automatically generate backlog tickets based on these findings. Our most important task as an engineering team is then to prioritize these tickets to ensure the highest impactful work is done first.</p><p><strong>Bonus:</strong> We also generate the agent prompt directly in the ticket description. This creates a self-sustaining loop where the ticket is ready to be assigned back to a Cursor agent immediately. This reduces the friction of starting new work to near zero. </p><h3>3. We trust AI but always verify</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!0wn5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed817cea-9bd0-4f6d-bdba-8690ab85f396_2573x2387.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!0wn5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed817cea-9bd0-4f6d-bdba-8690ab85f396_2573x2387.png 424w, https://substackcdn.com/image/fetch/$s_!0wn5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed817cea-9bd0-4f6d-bdba-8690ab85f396_2573x2387.png 848w, https://substackcdn.com/image/fetch/$s_!0wn5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed817cea-9bd0-4f6d-bdba-8690ab85f396_2573x2387.png 1272w, https://substackcdn.com/image/fetch/$s_!0wn5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed817cea-9bd0-4f6d-bdba-8690ab85f396_2573x2387.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!0wn5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed817cea-9bd0-4f6d-bdba-8690ab85f396_2573x2387.png" width="1456" height="1351" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ed817cea-9bd0-4f6d-bdba-8690ab85f396_2573x2387.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1351,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:820973,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://melvintercan.com/i/181853599?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed817cea-9bd0-4f6d-bdba-8690ab85f396_2573x2387.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!0wn5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed817cea-9bd0-4f6d-bdba-8690ab85f396_2573x2387.png 424w, https://substackcdn.com/image/fetch/$s_!0wn5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed817cea-9bd0-4f6d-bdba-8690ab85f396_2573x2387.png 848w, https://substackcdn.com/image/fetch/$s_!0wn5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed817cea-9bd0-4f6d-bdba-8690ab85f396_2573x2387.png 1272w, https://substackcdn.com/image/fetch/$s_!0wn5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed817cea-9bd0-4f6d-bdba-8690ab85f396_2573x2387.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Writing code with AI is incredibly easy and fast but speed without stability is basically racing towards AI slop and technical debt. Agents are trusted, but everything is verified. We use strict linters and rigorous type checking. A strict mandate of 80-100% minimum test coverage has been established as non-negotiable (a standard that has always been hard to prioritize and enforce in traditional engineering teams).</p><p>We rely heavily on pre-commit hooks and GitHub Actions. The workflow is binary: the generated code simply does not merge until it passes every single check. There are no exceptions for small fixes. If the coverage drops or the linter complains, the agent must fix it before a human ever reviews it. This preserves our team&#8217;s mental energy for high-value problem solving.</p><p><strong>Bonus:</strong> Reliance solely on human review is risky, as we can suffer from fatigue (especially from the volumes of AI-generated code we need to review). Therefore every PR gets reviewed in depth by an AI code reviewer (we use <a href="https://www.coderabbit.ai/">CodeRabbit</a> but <a href="https://docs.github.com/en/copilot/how-tos/use-copilot-agents/request-a-code-review/use-code-review">Copilot</a> and <a href="https://docs.cursor.com/en/cli/cookbook/code-review">Cursor</a> also support this) before a human looks at it. It analyzes the logic, checks for security vulnerabilities, and ensures consistency. We in fact also have a branch rule that all of CodeRabbit&#8217;s comments need to be resolved before a PR can get merged.</p><h3>4. We codify hard earned lessons for AI</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!BtzF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6bcf9dc4-ebf7-420a-8710-e62540f60600_2044x1985.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BtzF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6bcf9dc4-ebf7-420a-8710-e62540f60600_2044x1985.png 424w, https://substackcdn.com/image/fetch/$s_!BtzF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6bcf9dc4-ebf7-420a-8710-e62540f60600_2044x1985.png 848w, https://substackcdn.com/image/fetch/$s_!BtzF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6bcf9dc4-ebf7-420a-8710-e62540f60600_2044x1985.png 1272w, https://substackcdn.com/image/fetch/$s_!BtzF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6bcf9dc4-ebf7-420a-8710-e62540f60600_2044x1985.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BtzF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6bcf9dc4-ebf7-420a-8710-e62540f60600_2044x1985.png" width="1456" height="1414" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6bcf9dc4-ebf7-420a-8710-e62540f60600_2044x1985.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1414,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:569467,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://melvintercan.com/i/181853599?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6bcf9dc4-ebf7-420a-8710-e62540f60600_2044x1985.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!BtzF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6bcf9dc4-ebf7-420a-8710-e62540f60600_2044x1985.png 424w, https://substackcdn.com/image/fetch/$s_!BtzF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6bcf9dc4-ebf7-420a-8710-e62540f60600_2044x1985.png 848w, https://substackcdn.com/image/fetch/$s_!BtzF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6bcf9dc4-ebf7-420a-8710-e62540f60600_2044x1985.png 1272w, https://substackcdn.com/image/fetch/$s_!BtzF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6bcf9dc4-ebf7-420a-8710-e62540f60600_2044x1985.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>We don&#8217;t just hope the AI understands our style; we write it down. We guide the agents with strict, project-specific rules files (we like <em><a href="https://cursor.com/docs/context/rules">.cursorrules</a> </em>but there&#8217;s also <a href="https://www.anthropic.com/engineering/claude-code-best-practices">AGENTS.md</a>). For example, one of the key agent rules we have for each project is to commit and push after every meaningful change. This ensures we have a granular history for easy rollbacks when things go wrong (which they often do). <br><br>We treat these rules as living documents of our institutional knowledge. Each time we encounter a new issue or a repeating pattern of failure, we add a specific constraint to the rules to prevent it from happening again. Over time this produces a body of work that allows us to move even faster because we&#8217;re not bogged down by repeated mistakes of the past.</p><h3>5. We ensure AI has a great Developer Experience</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3pTq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F631fc668-56fa-4ad8-969e-57aca44d186d_2030x1808.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3pTq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F631fc668-56fa-4ad8-969e-57aca44d186d_2030x1808.png 424w, https://substackcdn.com/image/fetch/$s_!3pTq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F631fc668-56fa-4ad8-969e-57aca44d186d_2030x1808.png 848w, https://substackcdn.com/image/fetch/$s_!3pTq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F631fc668-56fa-4ad8-969e-57aca44d186d_2030x1808.png 1272w, https://substackcdn.com/image/fetch/$s_!3pTq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F631fc668-56fa-4ad8-969e-57aca44d186d_2030x1808.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3pTq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F631fc668-56fa-4ad8-969e-57aca44d186d_2030x1808.png" width="1456" height="1297" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/631fc668-56fa-4ad8-969e-57aca44d186d_2030x1808.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1297,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:554222,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://melvintercan.com/i/181853599?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F631fc668-56fa-4ad8-969e-57aca44d186d_2030x1808.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3pTq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F631fc668-56fa-4ad8-969e-57aca44d186d_2030x1808.png 424w, https://substackcdn.com/image/fetch/$s_!3pTq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F631fc668-56fa-4ad8-969e-57aca44d186d_2030x1808.png 848w, https://substackcdn.com/image/fetch/$s_!3pTq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F631fc668-56fa-4ad8-969e-57aca44d186d_2030x1808.png 1272w, https://substackcdn.com/image/fetch/$s_!3pTq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F631fc668-56fa-4ad8-969e-57aca44d186d_2030x1808.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Investing in Developer Experience (DX) has always been key to productivity. With AI, it is the difference between a toy and a tool. A flaky environment confuses a human, but it breaks an agent.</p><p>We invest heavily in robust testing harnesses and puppeteering tools that allow the AI for example to control the browser directly or to easily spin up a docker environment. This enables the agent to do more than just write code; it can run the app, click through flows, verify the UI, and iterate until the solution actually works. When the DX is optimized for machines, the AI becomes exponentially more powerful because it can close its own feedback loop.</p><div><hr></div><p>Of course, we are still learning, breaking things, and rebuilding them better. So while the above might still evolve, it&#8217;s just what <em>currently</em> works for us.</p><p>I&#8217;m curious to know what part of the AI-native transition is clicking for your engineering team, and where you think we might be getting it wrong. Let&#8217;s compare notes!</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://melvintercan.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Designing Agentic AI! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[European AI: Less Red Tape, More Agentic Innovation]]></title><description><![CDATA[A Call to Transform EU Regulation into a Competitive Edge]]></description><link>https://melvintercan.com/p/rebooting-europes-ai-cut-the-red</link><guid isPermaLink="false">https://melvintercan.com/p/rebooting-europes-ai-cut-the-red</guid><dc:creator><![CDATA[Melvin Tercan]]></dc:creator><pubDate>Wed, 12 Feb 2025 18:02:08 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!8TTE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17cb615f-ee5a-4a75-a5b5-c61032bc47e1_1021x837.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8TTE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17cb615f-ee5a-4a75-a5b5-c61032bc47e1_1021x837.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8TTE!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17cb615f-ee5a-4a75-a5b5-c61032bc47e1_1021x837.jpeg 424w, https://substackcdn.com/image/fetch/$s_!8TTE!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17cb615f-ee5a-4a75-a5b5-c61032bc47e1_1021x837.jpeg 848w, https://substackcdn.com/image/fetch/$s_!8TTE!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17cb615f-ee5a-4a75-a5b5-c61032bc47e1_1021x837.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!8TTE!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17cb615f-ee5a-4a75-a5b5-c61032bc47e1_1021x837.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8TTE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17cb615f-ee5a-4a75-a5b5-c61032bc47e1_1021x837.jpeg" width="1021" height="837" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/17cb615f-ee5a-4a75-a5b5-c61032bc47e1_1021x837.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:837,&quot;width&quot;:1021,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:78032,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8TTE!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17cb615f-ee5a-4a75-a5b5-c61032bc47e1_1021x837.jpeg 424w, https://substackcdn.com/image/fetch/$s_!8TTE!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17cb615f-ee5a-4a75-a5b5-c61032bc47e1_1021x837.jpeg 848w, https://substackcdn.com/image/fetch/$s_!8TTE!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17cb615f-ee5a-4a75-a5b5-c61032bc47e1_1021x837.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!8TTE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17cb615f-ee5a-4a75-a5b5-c61032bc47e1_1021x837.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Adriaan Mol, one of the most successful Dutch entrepreneurs, frequently criticizes Europe&#8217;s eagerness for regulation over innovation.</figcaption></figure></div><p>Europe often prides itself on being the &#8220;good boy&#8221; of global tech regulation, championing ethics, privacy, and accountability. But as the AI revolution accelerates, this virtue threatens to become a liability.</p><p>The landmark <a href="https://digital-strategy.ec.europa.eu/en/policies/regulatory-framework-ai">AI Act</a>, crafted in good faith to protect citizens, now threatens to suffocate the very innovation it sought to guide. Its stringent requirements around copyright compliance, transparency, and risk mitigation are already creating an innovation chokehold. Startups, already scrambling to compete with well-funded U.S. and Chinese giants, now face a labyrinth of legal hurdles. The result? A growing exodus of talent, capital, and ambition, leaving Europe at risk of becoming a bystander in the defining technological shift of our era.</p><p>The stakes couldn&#8217;t be higher. As Apple, OpenAI, Meta and Anthropic increasingly withhold cutting-edge AI features from European users over regulatory fears, the U.S. solidifies its dominance in foundation models, while China races ahead with pragmatism (and minimal copyright scruples). Europe&#8217;s choice is stark: adapt or accept irrelevance.</p><h3><strong>The Problem: How the AI Act is Holding Back AI Innovation</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!KJoG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3049099a-9851-4803-88fa-e86477209ff3_1024x768.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!KJoG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3049099a-9851-4803-88fa-e86477209ff3_1024x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!KJoG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3049099a-9851-4803-88fa-e86477209ff3_1024x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!KJoG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3049099a-9851-4803-88fa-e86477209ff3_1024x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!KJoG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3049099a-9851-4803-88fa-e86477209ff3_1024x768.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!KJoG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3049099a-9851-4803-88fa-e86477209ff3_1024x768.jpeg" width="1024" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3049099a-9851-4803-88fa-e86477209ff3_1024x768.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:162783,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!KJoG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3049099a-9851-4803-88fa-e86477209ff3_1024x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!KJoG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3049099a-9851-4803-88fa-e86477209ff3_1024x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!KJoG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3049099a-9851-4803-88fa-e86477209ff3_1024x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!KJoG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3049099a-9851-4803-88fa-e86477209ff3_1024x768.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The EU eagerly rushed to regulate AI, proudly declaring itself the &#8220;first&#8221; to do so, as if regulation itself were a badge of honor.</figcaption></figure></div><p>The AI Act&#8217;s most controversial mandate, <strong>requiring developers to prove their models were trained on copyright-free or publicly licensed data</strong>, sounds reasonable in theory. But for startups, it&#8217;s a logistical and financial nightmare. Auditing massive datasets for copyright compliance is a herculean task, one that only deep-pocketed corporations can afford.</p><p>But that&#8217;s just the beginning. The Act&#8217;s <strong>broad definition of AI</strong> casts a wide regulatory net, potentially pulling even low-risk applications into its compliance regime. This overreach creates uncertainty for developers, making it difficult to determine what rules apply and discouraging innovation in areas that pose minimal risk.</p><p>The <strong>high compliance costs</strong> further stack the odds against startups. Conformity assessments, extensive documentation, and continuous monitoring (all mandatory under the Act) place an enormous financial burden on smaller companies. While industry giants can absorb these costs, startups with limited resources face a crushing disadvantage.</p><p>Then there&#8217;s the issue of <strong>delayed time-to-market</strong>. The EU&#8217;s extensive regulatory hurdles slow down AI development cycles, making it nearly impossible for agile startups to compete in a global landscape where speed is everything. In a field where first-mover advantage is critical, these bureaucratic delays could spell doom for European-born AI ventures.</p><p>Even <strong>Big Tech is wary, not just small startups</strong>. If the world&#8217;s largest and wealthiest tech companies find the AI Act too restrictive, what chance do startups have? Meta recently disabled its AI assistant, Meta AI, for European users, citing &#8220;regulatory uncertainty.&#8221; OpenAI quietly restricts access to voice cloning tools in the EU, and Google&#8217;s Gemini avoids entire markets over compliance fears. These aren&#8217;t small players struggling to keep up, these are trillion-dollar companies deciding that Europe&#8217;s rules are simply too difficult, too risky, or too costly to comply with.</p><p>Meanwhile, talent flees. European AI researchers increasingly decamp to Silicon Valley or Shenzhen, where funding flows freely and experimentation isn&#8217;t shackled by <strong>preemptive red tape and legal ambiguity</strong>. The Act&#8217;s vague classifications of AI risk levels leave developers second-guessing their own innovations, making compliance a constant guessing game rather than a clear roadmap.</p><p>In its attempt to lead on regulation, Europe may have written itself out of the AI race altogether.</p><h3><strong>The Geopolitical Stakes: US &amp; China Are Pulling Ahead</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!43QL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b104a53-20e4-46b7-acf5-2299fbba98e8_1200x827.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!43QL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b104a53-20e4-46b7-acf5-2299fbba98e8_1200x827.jpeg 424w, https://substackcdn.com/image/fetch/$s_!43QL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b104a53-20e4-46b7-acf5-2299fbba98e8_1200x827.jpeg 848w, https://substackcdn.com/image/fetch/$s_!43QL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b104a53-20e4-46b7-acf5-2299fbba98e8_1200x827.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!43QL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b104a53-20e4-46b7-acf5-2299fbba98e8_1200x827.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!43QL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b104a53-20e4-46b7-acf5-2299fbba98e8_1200x827.jpeg" width="1200" height="827" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3b104a53-20e4-46b7-acf5-2299fbba98e8_1200x827.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:827,&quot;width&quot;:1200,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:108679,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!43QL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b104a53-20e4-46b7-acf5-2299fbba98e8_1200x827.jpeg 424w, https://substackcdn.com/image/fetch/$s_!43QL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b104a53-20e4-46b7-acf5-2299fbba98e8_1200x827.jpeg 848w, https://substackcdn.com/image/fetch/$s_!43QL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b104a53-20e4-46b7-acf5-2299fbba98e8_1200x827.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!43QL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b104a53-20e4-46b7-acf5-2299fbba98e8_1200x827.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The global race for artificial intelligence (AI) supremacy is intensifying, with the United States and China adopting distinct strategies that are propelling them ahead of Europe.</p><ul><li><p><strong>United States: Private Sector Dominance and Deregulation: </strong>In the US, the AI landscape is driven by private sector innovation. Companies like OpenAI, Anthropic, and Google lead with advanced models such as GPT-4, Claude 3, and Gemini, respectively. The U.S. government's approach has been to minimize regulatory barriers, fostering an environment where AI research and development can thrive. In a significant move to bolster AI infrastructure, President Donald Trump recently announced the <a href="https://www.forbes.com/sites/craigsmith/2025/01/23/stargate-americas-500-billion-bid-to-corner-global-ai-capital/">Stargate initiative</a>, a joint venture involving OpenAI, SoftBank, and Oracle. This project plans to invest up to $500 billion in AI infrastructure, including the construction of data centers and energy generation facilities across the United States. The initiative underscores the administration's commitment to accelerating AI development and maintaining a leading position in the global AI race.</p></li><li><p><strong>China: State-Backed Ambitions and Rapid Implementation: </strong>China's approach to AI is characterized by substantial government support and a focus on rapid deployment. The Chinese government has committed significant funding to AI initiatives, including a <a href="https://www.cnn.com/2024/05/27/tech/china-semiconductor-investment-fund-intl-hnk/index.html">$50 billion semiconductor fund</a> to bolster its technological infrastructure. Companies like DeepSeek have emerged, developing state-of-the-art models efficiently and cost-effectively. This progress is facilitated by China's more lenient data regulations, allowing for swift integration of AI technologies across various sectors.</p></li><li><p><strong>Europe: Risk of Becoming a Consumer Rather Than a Creator: </strong>In contrast, Europe faces challenges that could relegate it to the role of an AI consumer rather than a creator. Despite having a large consumer market and abundant talent, the EU lags behind global leaders like China and the US. Recurrent issues include excessive focus on AI risks and stringent regulations that stifle innovation. Notably, groundbreaking AI tools, such as OpenAI's <a href="https://www.euronews.com/next/2025/01/24/openai-launches-first-ai-agent-operator-but-it-wont-be-coming-to-europe-yet#:~:text=OpenAI%20launches%20first%20AI%20agent,coming%20to%20Europe%20yet%20%7C%20Euronews">Operator</a>, are inaccessible in the EU due to regulatory delays, hampering competitiveness. Without cultivating homegrown AI champions, Europe risks dependence on foreign technologies for critical applications, from healthcare algorithms to defense systems, thereby ceding economic power and strategic autonomy.</p></li></ul><h3><strong>The Silver Lining: Europe&#8217;s Counteroffensive</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!aJxm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22fdb913-2ee6-4e80-9db4-92b60dec1501_2000x1333.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!aJxm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22fdb913-2ee6-4e80-9db4-92b60dec1501_2000x1333.webp 424w, https://substackcdn.com/image/fetch/$s_!aJxm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22fdb913-2ee6-4e80-9db4-92b60dec1501_2000x1333.webp 848w, https://substackcdn.com/image/fetch/$s_!aJxm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22fdb913-2ee6-4e80-9db4-92b60dec1501_2000x1333.webp 1272w, https://substackcdn.com/image/fetch/$s_!aJxm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22fdb913-2ee6-4e80-9db4-92b60dec1501_2000x1333.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!aJxm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22fdb913-2ee6-4e80-9db4-92b60dec1501_2000x1333.webp" width="1456" height="970" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/22fdb913-2ee6-4e80-9db4-92b60dec1501_2000x1333.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:186144,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!aJxm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22fdb913-2ee6-4e80-9db4-92b60dec1501_2000x1333.webp 424w, https://substackcdn.com/image/fetch/$s_!aJxm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22fdb913-2ee6-4e80-9db4-92b60dec1501_2000x1333.webp 848w, https://substackcdn.com/image/fetch/$s_!aJxm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22fdb913-2ee6-4e80-9db4-92b60dec1501_2000x1333.webp 1272w, https://substackcdn.com/image/fetch/$s_!aJxm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22fdb913-2ee6-4e80-9db4-92b60dec1501_2000x1333.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Not all hope is lost. A new wave of investment and ambition is brewing, led by France.</p><ul><li><p><strong>Macron&#8217;s &#8364;109 Billion Initiative: </strong>France is making AI a national priority under President Emmanuel Macron's leadership. He has announced a <a href="https://www.lemonde.fr/en/economy/article/2025/02/10/ai-with-the-announcement-of-a-109-billion-investment-macron-intends-to-take-on-the-us_6737985_19.html">&#8364;109 billion investment plan</a> to bolster the country's AI capabilities. This initiative includes substantial funding from international investors, such as &#8364;50 billion from the United Arab Emirates and &#8364;20 billion from Canadian firm Brookfield. A significant portion of these funds is allocated for developing advanced data centers, leveraging France's nuclear energy infrastructure to provide efficient and sustainable power solutions. This strategy not only enhances computational capacity but also positions France as a leader in eco-friendly AI development.</p></li><li><p><strong>The EU&#8217;s &#8364;200 Billion InvestAI Program: </strong>Complementing national efforts, the European Union has launched the <a href="https://digital-strategy.ec.europa.eu/en/news/eu-launches-investai-initiative-mobilise-eu200-billion-investment-artificial-intelligence">InvestAI</a> initiative, aiming to mobilize &#8364;200 billion for AI advancement. This program focuses on establishing AI gigafactories, large-scale facilities equipped with cutting-edge technology to train sophisticated AI models. The initiative seeks to foster collaboration among member states, reduce reliance on external tech giants, and ensure that Europe remains competitive in the rapidly evolving AI landscape.</p></li><li><p><strong>The EU-Inc Initiative: Streamlining Business Across Borders: </strong>In a bid to further enhance competitiveness, an online community led by Philipp Herkelmann has proposed the creation of "<a href="https://www.eu-startups.com/2024/12/eu-inc-calls-on-new-commission-turn-the-idea-of-a-single-pan-european-startup-entity-into-reality/">EU-Inc</a>," a unified, digital-first private limited liability company structure for the European Union. This initiative aims to address the challenges of fragmented national regulations by establishing a standardized corporate framework governed by EU regulations. Key features include no minimum capital requirement, flexible share structures, and streamlined cross-border operations. The proposal seeks to make it easier for businesses to operate across EU member states, thereby fostering innovation and economic growth.</p></li><li><p><strong>Learning from China&#8217;s DeepSeek: </strong>Europe is also drawing inspiration from international success stories. For instance, China's DeepSeek has demonstrated that significant AI advancements can be achieved with leaner, more <a href="https://www.cfodive.com/news/deepseeks-lower-cost-ai-could-supercharge-adoption-use-cases-goldman/739775/">cost-effective</a> models. This approach underscores the potential for European startups to innovate efficiently, provided they navigate the regulatory environment effectively.</p></li></ul><p>Through these concerted efforts, Europe aims to not only catch up with but also lead in the global AI race, balancing innovation with ethical considerations and strategic autonomy.</p><h3><strong>A Call for Regulatory Balance: Learning from Mistakes</strong></h3><p>The AI Act&#8217;s rigid early framework needs urgent recalibration. France is already pushing for carveouts to protect startups, arguing that &#8220;innovation cannot thrive under fear.&#8221; Key fixes could include:</p><ul><li><p><strong>Regulatory Sandboxes: </strong>The AI Act proposes the establishment of coordinated AI '<a href="https://artificialintelligenceact.eu/article/57/">regulatory sandboxes</a>' across EU member states. These sandboxes are controlled environments where businesses can develop, test, and validate AI systems under the supervision of national authorities before market deployment. This approach allows innovators to experiment with new technologies while ensuring compliance with regulatory standards. It also provides regulators with insights into emerging technologies, facilitating a balanced oversight mechanism.</p></li><li><p><strong>Public Data Commons: </strong>To alleviate the burden of sourcing high-quality, copyright-free data, the EU could establish publicly funded repositories. These <a href="https://openfuture.eu/publication/public-data-commons/">public data commons</a> would offer startups access to extensive datasets necessary for training AI models, thereby reducing costs and promoting innovation. Such initiatives would ensure that smaller enterprises have the resources to develop competitive AI solutions without the prohibitive expenses associated with data acquisition.</p></li><li><p><strong>Talent Fast Lanes: </strong>Attracting and retaining top AI talent is crucial for Europe's competitiveness. Implementing streamlined visa processes and offering tax incentives for AI professionals and researchers can make the EU a more appealing destination for global talent. These measures would help build a robust AI ecosystem, fostering collaboration and accelerating advancements within the region.</p></li></ul><p>By adopting these strategies, the EU can strike a balance between maintaining ethical standards and promoting a dynamic, innovation-friendly environment for AI development.</p><h3><strong>The Opportunity: Europe&#8217;s Agentic AI Frontier</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CZ98!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcadf4f9c-0fcc-4cad-820f-83131aee03b6_1024x1024.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CZ98!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcadf4f9c-0fcc-4cad-820f-83131aee03b6_1024x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!CZ98!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcadf4f9c-0fcc-4cad-820f-83131aee03b6_1024x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!CZ98!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcadf4f9c-0fcc-4cad-820f-83131aee03b6_1024x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!CZ98!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcadf4f9c-0fcc-4cad-820f-83131aee03b6_1024x1024.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CZ98!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcadf4f9c-0fcc-4cad-820f-83131aee03b6_1024x1024.webp" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cadf4f9c-0fcc-4cad-820f-83131aee03b6_1024x1024.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:493628,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CZ98!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcadf4f9c-0fcc-4cad-820f-83131aee03b6_1024x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!CZ98!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcadf4f9c-0fcc-4cad-820f-83131aee03b6_1024x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!CZ98!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcadf4f9c-0fcc-4cad-820f-83131aee03b6_1024x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!CZ98!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcadf4f9c-0fcc-4cad-820f-83131aee03b6_1024x1024.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>While the EU has fallen behind in foundational AI model development, the next wave of AI innovation is still up for grabs: <em>Agentic AI</em>, autonomous systems that act, reason, and collaborate. Unlike traditional AI models that require constant human input, these AI "agents" can execute tasks end-to-end, from managing supply chains to running customer service operations.</p><p>This shift toward Agentic AI is why I launched this newsletter. As an engineer and entrepreneur working at the intersection of AI and automation, I&#8217;ve seen firsthand how agentic systems can transform industries. While much of today&#8217;s AI conversation is dominated by large language models, the real opportunity lies in designing AI that doesn&#8217;t just generate text but acts autonomously to solve real-world problems. Europe, with its strengths in robotics, industrial automation, and ethical AI, is uniquely positioned to lead this transformation.</p><h4>Why Agentic AI?</h4><ol><li><p><strong>Lower Computational Demands: </strong>Unlike massive foundation models that require enormous GPU clusters, agentic AI is designed to be modular and efficient. These systems focus on task-specific reasoning rather than brute-force computation, making them cheaper to deploy and more accessible to startups.</p></li><li><p><strong>Alignment with European Expertise: </strong>Europe has long been a leader in robotics, IoT, and industrial automation, all fields where agentic AI can thrive. Unlike the U.S., where AI is largely consumer-focused, Europe can build AI systems that enhance factories, optimize logistics, and power next-gen smart infrastructure.</p></li><li><p><strong>Ethical Leadership</strong><br>The EU&#8217;s emphasis on trust, safety, and transparency is a competitive advantage. In a world increasingly wary of AI&#8217;s black-box decision-making, Europe can become the go-to supplier of responsible, explainable AI that companies and governments can trust.</p></li></ol><p>Agentic AI is still at the frontier of innovation. While the U.S. dominates foundation models and China races ahead in scale, Europe can carve out its own leadership niche by pioneering intelligent, autonomous systems that power the real economy.</p><p>This is the mission behind <em>Designing Agentic Systems</em>, to explore how AI agents can reshape industries, unlock new entrepreneurial opportunities, and drive the next wave of innovation. Europe still has a shot at winning this race. But it needs to act&#8212;<strong>now. </strong></p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://melvintercan.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption"><em>In upcoming issues, we&#8217;ll dive deeper into how to design autonomous agents. Subscribe to stay ahead of the curve and learn how to shape the future of work.</em></p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[A Day in the Life of an Agentic Marketing Manager]]></title><description><![CDATA[How Agentic AI can solve complex marketing challenges in a fraction of the time]]></description><link>https://melvintercan.com/p/a-day-in-the-life-of-an-agentic-marketing</link><guid isPermaLink="false">https://melvintercan.com/p/a-day-in-the-life-of-an-agentic-marketing</guid><dc:creator><![CDATA[Melvin Tercan]]></dc:creator><pubDate>Wed, 15 Jan 2025 17:23:56 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!fgYF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f7e1731-71a0-41b4-be93-e93b23bf0674_1024x1024.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fgYF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f7e1731-71a0-41b4-be93-e93b23bf0674_1024x1024.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fgYF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f7e1731-71a0-41b4-be93-e93b23bf0674_1024x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!fgYF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f7e1731-71a0-41b4-be93-e93b23bf0674_1024x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!fgYF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f7e1731-71a0-41b4-be93-e93b23bf0674_1024x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!fgYF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f7e1731-71a0-41b4-be93-e93b23bf0674_1024x1024.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fgYF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f7e1731-71a0-41b4-be93-e93b23bf0674_1024x1024.webp" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4f7e1731-71a0-41b4-be93-e93b23bf0674_1024x1024.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:470986,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fgYF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f7e1731-71a0-41b4-be93-e93b23bf0674_1024x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!fgYF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f7e1731-71a0-41b4-be93-e93b23bf0674_1024x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!fgYF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f7e1731-71a0-41b4-be93-e93b23bf0674_1024x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!fgYF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f7e1731-71a0-41b4-be93-e93b23bf0674_1024x1024.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Imagine you&#8217;re a marketing manager in charge of a direct mail campaign. Your goal is to target high-income households in Austin, Texas&#8212;a diverse city of nearly 1 million residents. You have access to a database of over a million households, each with detailed demographic data: income, age, housing type, and more.</p><p>Your challenge? Create a segment that includes at least 1,000 households but no more than 10,000,  which also needs to align closely with your Ideal Customer Profile (ICP) to maximize relevance and impact.</p><p>This isn&#8217;t a simple task. With dozens of demographic filters and numerous possible configurations, the combinations quickly explode into billions. To make matters worse, these filters aren&#8217;t independent&#8212;certain income groups correlate with specific neighborhoods, and younger families are unlikely to live in retirement areas.</p><p>Even if you spent days manually researching and testing filters, you&#8217;d constantly question whether you&#8217;d found the best audience. What if a better combination exists? This uncertainty is what makes audience segmentation such a time-consuming and challenging problem.</p><h3>Why Manual Segmentation Is Challenging</h3><p>Segmentation is one of the hardest tasks in marketing because it requires balancing two competing forces: scale and precision. On one hand, you need a segment large enough to justify your campaign, but on the other, it must be narrow enough to target the right audience effectively.</p><p>If you were to tackle this task manually, here&#8217;s what the process might look like:</p><ol><li><p><strong>Research the Region:</strong> You&#8217;d start by googling demographic data about Austin. After hours of work, you might find that the median household income is $71,500, the city skews younger, and there&#8217;s a mix of renters in urban areas and homeowners in suburbs.</p></li><li><p><strong>Propose Initial Filters:</strong> Based on your research, you&#8217;d guess that your target audience might be:</p><ul><li><p><em>Age:</em> 25&#8211;40</p></li><li><p><em>Income:</em> $75,000&#8211;$150,000</p></li><li><p><em>Housing Type:</em> Single-family homes or luxury apartments</p></li></ul><p>But these are just educated guesses. There&#8217;s no guarantee this segment will work, and the process of refining it could take hours&#8212;or even days.</p></li><li><p><strong>Test and Iterate:</strong> You&#8217;d query your database repeatedly, tweaking filters to see how many households match. If one set of filters results in 50,000 households, it&#8217;s too large. If another results in 500, it&#8217;s too small. This trial-and-error process is slow and frustrating, and even after days of effort, you&#8217;d still wonder: Did I miss a better option? And even after finding the right number, you need to ensure it&#8217;s relevant to your Ideal Customer Profile (ICP).</p></li></ol><h3>How Agentic AI Can Tackle This Problem</h3><p>Modern large language models come equipped with world knowledge&#8212;a built-in understanding of relationships between concepts, based on the vast data they&#8217;ve been trained on. This allows them to make intelligent guesses about demographic patterns, similar to how an experienced marketing manager would, but with far greater speed and scale.</p><p>For instance, an LLM might &#8220;know&#8221; that high-income households are concentrated in suburban areas or that younger professionals tend to rent apartments downtown. Instead of starting from scratch, it uses this knowledge to propose meaningful filter combinations right away.</p><p>But AI isn&#8217;t perfect&#8212;it can sometimes &#8220;hallucinate,&#8221; offering insights that sound plausible but aren&#8217;t grounded in reality. However, this challenge can be addressed by allowing the AI agent to autonomously conduct online research by querying sources like Google to validate and enhance its recommendations with real-world data.</p><p>By combining grounded insights with its world model, the AI agent avoids hallucinations and ensures its recommendations are tied to real-world data.</p><h3>Iterative Refinement with Bayesian Inference</h3><p>Once the AI agent proposes a set of filters, the next step is to test them against your database to see how many households match. This is where Bayesian inference comes into play&#8212;a statistical method that helps the AI agent refine its understanding based on new evidence.</p><p>Think of Bayesian inference as a feedback loop:</p><ol><li><p>The AI agent starts with an initial &#8220;belief&#8221; about what filters will work (e.g., income $75,000&#8211;$150,000, age 25&#8211;40).</p></li><li><p>It tests these filters and gets a result (e.g., 5,000 households match).</p></li><li><p>It updates its belief based on this result, refining the filters to get closer to your goal.</p></li></ol><p>This iterative process ensures the AI agent doesn&#8217;t waste time on irrelevant combinations. Instead, it continuously learns and adjusts, converging on the right segment far faster than any human could.</p><h3>The Science Behind the Process</h3><p>Agentic AI&#8217;s ability to tackle complex segmentation problems is grounded in well-established scientific principles:</p><ul><li><p><strong>Reducing Uncertainty (Shannon Entropy)</strong>: At the start, the AI agent faces high uncertainty with billions of possible filter combinations. Its goal is to systematically reduce this uncertainty by exploring the most promising options first.</p></li><li><p><strong>Systematic Exploration (Ergodic Theory)</strong>: The AI agent ensures that, given enough iterations, it will explore all relevant possibilities. This doesn&#8217;t mean brute-forcing every combination&#8212;it means intelligently narrowing the search space to focus on areas most likely to succeed.</p></li><li><p><strong>Iterative Learning (Bayesian Inference)</strong>: Each test provides new data, allowing the agent to update its strategy and refine its filters in real time. This dynamic learning process mirrors how humans learn from feedback but operates at a much faster scale.</p></li></ul><p>Together, these principles explain why agentic AI can solve segmentation problems in minutes, not days.</p><h3>The Result: Smarter Segmentation in Minutes</h3><p>Let&#8217;s go back to our example of targeting Austin, Texas. Using agentic AI, the process might look like this:</p><ol><li><p>The AI agent conducts online research to gather demographic insights about the city.</p></li><li><p>It uses its world model to propose an initial set of filters based on these insights.</p></li><li><p>It tests the filters against your database, refining them through iterative feedback loops until it finds a segment that meets your constraints.</p></li></ol><p>In minutes, the AI agent delivers a segment like this:</p><ul><li><p><strong>Age</strong>: 25&#8211;40</p></li><li><p><strong>Income</strong>: $75,000&#8211;$150,000</p></li><li><p><strong>Housing Type</strong>: Single-family homes or upscale apartments</p></li><li><p><strong>Segment Size</strong>: 3,200 households</p></li></ul><p>What would have taken days of manual work&#8212;and still left you questioning the results&#8212;is now done in a fraction of the time, with greater accuracy and confidence.</p><h3>Taking Personalization and Insights to the Next Level</h3><p>Agentic AI doesn&#8217;t just stop at segmentation. It opens the door to unprecedented levels of precision and creativity, allowing marketers to achieve what once seemed unattainable. Here are a few possibilities that can unlock things that were previously impossible or unfeasible:</p><ul><li><p><strong>Hyper-Personalization: </strong>Conversion rates for direct mailing campaigns are typically very low. Agentic AI can massively improve your conversation rates by utilizing hyper-personalization. Imagine crafting individual postcards tailored to <em>each </em>individual recipient&#8217;s profile&#8212;not just using their first name but designing the text, image, and messaging to resonate deeply with their lifestyle and your brand&#8217;s messaging. For instance:</p><ul><li><p><strong>Family Households with Children</strong>: The postcard might highlight benefits or products that appeal to parents, using imagery of happy families and messaging that emphasizes safety and comfort.</p></li><li><p><strong>Young Single Professionals</strong>: These recipients could receive sleek, modern designs featuring messaging that highlights efficiency, independence, or career growth. </p></li></ul></li><li><p><strong>Simulation of Consumer Surveys: </strong>Additionally, Agentic AI can simulate hundreds or thousands of consumer surveys for advanced insights. Normally, crafting 50 micro-segments and surveying representative consumers would be expensive and time-consuming. With AI, you can simulate these interactions, gaining a statistical approximation of customer preferences and messaging effectiveness. While no model is perfect, these simulations provide valuable insights without the time and cost overhead of traditional surveys.</p></li></ul><p>Agentic AI helps marketers achieve what once seemed impossible: hyper-personalized messaging and scalable, data-driven insights, all within the constraints of modern campaigns.</p><h3>Why Agentic AI Matters for Marketers</h3><p>By combining the intelligence of LLMs with real-world grounding and iterative refinement, it tackles segmentation challenges that previously felt impossible.</p><p>For marketers, this means:</p><ul><li><p><strong>Fewer Hours Spent on Repetitive Tasks</strong>: The AI agent handles the heavy lifting, freeing you to focus on strategy.</p></li><li><p><strong>More Confidence in Your Results</strong>: Grounded insights and systematic refinement ensure accuracy.</p></li><li><p><strong>Scalability Across Campaigns</strong>: With AI agent, you can apply this process to multiple regions or audiences simultaneously.</p></li></ul><p>If segmentation has ever felt like an impossible puzzle, agentic AI is the tool that solves it&#8212;quickly, intelligently, and at scale.</p><div><hr></div><p><em>Marketing is just one of many professions poised to be transformed by agentic AI. Stay tuned to learn how this powerful technology can revolutionize your work. Subscribe to this newsletter for more insights and practical guides on implementing agentic AI across industries.</em></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://melvintercan.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://melvintercan.com/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item><item><title><![CDATA[The Shortcomings of Modern Automation]]></title><description><![CDATA[Modern automation platforms like SaaS, RPA & Low-Code/No-Code promised transformation but failed to deliver, causing new problems instead.]]></description><link>https://melvintercan.com/p/shortcomings-of-modern-automation</link><guid isPermaLink="false">https://melvintercan.com/p/shortcomings-of-modern-automation</guid><dc:creator><![CDATA[Melvin Tercan]]></dc:creator><pubDate>Thu, 09 Jan 2025 19:00:51 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!npjU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94535676-caf7-4189-837d-66d8cae24dd9_1024x1024.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!npjU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94535676-caf7-4189-837d-66d8cae24dd9_1024x1024.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!npjU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94535676-caf7-4189-837d-66d8cae24dd9_1024x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!npjU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94535676-caf7-4189-837d-66d8cae24dd9_1024x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!npjU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94535676-caf7-4189-837d-66d8cae24dd9_1024x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!npjU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94535676-caf7-4189-837d-66d8cae24dd9_1024x1024.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!npjU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94535676-caf7-4189-837d-66d8cae24dd9_1024x1024.webp" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/94535676-caf7-4189-837d-66d8cae24dd9_1024x1024.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:558084,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!npjU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94535676-caf7-4189-837d-66d8cae24dd9_1024x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!npjU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94535676-caf7-4189-837d-66d8cae24dd9_1024x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!npjU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94535676-caf7-4189-837d-66d8cae24dd9_1024x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!npjU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94535676-caf7-4189-837d-66d8cae24dd9_1024x1024.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In one of my previous articles, <em><a href="https://melvintercan.com/p/ai-is-eating-the-jobs">AI is Eating the Jobs</a></em>, I explored how agentic AI workflows are rapidly replacing entire teams, ushering in a new era of automation that is as transformative as it is disruptive. This is not a new story&#8212;automation has been reshaping industries since the Industrial Revolution, from the assembly line to the rise of computers. More recently, trends like Software-as-a-Service (SaaS), Robotic Process Automation (RPA), and Low-Code/No-Code platforms promised to streamline workflows and make businesses more efficient.</p><p><strong>The goal of automation has always been to do more with less&#8212;but today&#8217;s tools aren&#8217;t solving problems; they&#8217;re creating new ones.</strong></p><p>While SaaS platforms became ubiquitous as the backbone of modern business operations, they still depend heavily on human operators to configure, manage, and extract value. RPA and Low-Code/No-Code (LCNC) systems followed, aiming to push automation even further, but they have struggled to live up to their promise. Instead of enabling businesses to scale seamlessly, these tools have left many companies grappling with inefficiency, high maintenance costs, and limited flexibility.</p><div><hr></div><h3>The Shortcomings of SaaS</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/levon377/status/1869699199169970416" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rOpw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0c9c5e-67e9-4cd3-8e3f-3815d4a17cfd_2160x2160.png 424w, https://substackcdn.com/image/fetch/$s_!rOpw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0c9c5e-67e9-4cd3-8e3f-3815d4a17cfd_2160x2160.png 848w, https://substackcdn.com/image/fetch/$s_!rOpw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0c9c5e-67e9-4cd3-8e3f-3815d4a17cfd_2160x2160.png 1272w, https://substackcdn.com/image/fetch/$s_!rOpw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0c9c5e-67e9-4cd3-8e3f-3815d4a17cfd_2160x2160.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rOpw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0c9c5e-67e9-4cd3-8e3f-3815d4a17cfd_2160x2160.png" width="1456" height="1456" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4a0c9c5e-67e9-4cd3-8e3f-3815d4a17cfd_2160x2160.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1180087,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://x.com/levon377/status/1869699199169970416&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rOpw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0c9c5e-67e9-4cd3-8e3f-3815d4a17cfd_2160x2160.png 424w, https://substackcdn.com/image/fetch/$s_!rOpw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0c9c5e-67e9-4cd3-8e3f-3815d4a17cfd_2160x2160.png 848w, https://substackcdn.com/image/fetch/$s_!rOpw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0c9c5e-67e9-4cd3-8e3f-3815d4a17cfd_2160x2160.png 1272w, https://substackcdn.com/image/fetch/$s_!rOpw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0c9c5e-67e9-4cd3-8e3f-3815d4a17cfd_2160x2160.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>SaaS platforms revolutionized how businesses use software, offering flexibility and scalability without the headaches of traditional installation or maintenance. Tools like <a href="https://www.salesforce.com/">Salesforce</a>, <a href="https://slack.com/">Slack</a>, and <a href="https://www.hubspot.com/">HubSpot</a> became the backbone of modern business operations, promising streamlined workflows and improved collaboration. </p><p>These tools often fall short when applied to internal automation. They rely on users to manually configure workflows, integrate systems, and monitor operations. For instance, CRMs require sales teams to spend hours inputting and maintaining data, while managers must constantly optimize workflows for reporting and lead tracking. This effort shifts the workload rather than removing it.</p><p><strong>Instead of automating work, SaaS often turns employees into operators.</strong></p><p>Another issue is fragmentation. Most businesses depend on multiple tools for different functions&#8212;one for project management, another for customer support, and yet another for HR. These systems rarely integrate seamlessly, forcing employees to toggle between them and manually move data.</p><p>Tool overload adds to the complexity. HR teams might use separate platforms for payroll, recruiting, and employee engagement, each with its own setup and maintenance needs. This patchwork of disconnected workflows creates inefficiencies and demands constant oversight, leaving businesses far from the streamlined operations they envisioned.</p><p><strong>The promise of SaaS was to simplify, but its reality often ties automation to human effort.</strong></p><div><hr></div><h2>The Broken Promises of RPA</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/kimberlywtan/status/1856746731989619072" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!LQny!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f425ee-f3cd-4f9a-af55-b03925202b49_2160x2160.png 424w, https://substackcdn.com/image/fetch/$s_!LQny!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f425ee-f3cd-4f9a-af55-b03925202b49_2160x2160.png 848w, https://substackcdn.com/image/fetch/$s_!LQny!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f425ee-f3cd-4f9a-af55-b03925202b49_2160x2160.png 1272w, https://substackcdn.com/image/fetch/$s_!LQny!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f425ee-f3cd-4f9a-af55-b03925202b49_2160x2160.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!LQny!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f425ee-f3cd-4f9a-af55-b03925202b49_2160x2160.png" width="1456" height="1456" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/50f425ee-f3cd-4f9a-af55-b03925202b49_2160x2160.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:642744,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://x.com/kimberlywtan/status/1856746731989619072&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!LQny!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f425ee-f3cd-4f9a-af55-b03925202b49_2160x2160.png 424w, https://substackcdn.com/image/fetch/$s_!LQny!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f425ee-f3cd-4f9a-af55-b03925202b49_2160x2160.png 848w, https://substackcdn.com/image/fetch/$s_!LQny!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f425ee-f3cd-4f9a-af55-b03925202b49_2160x2160.png 1272w, https://substackcdn.com/image/fetch/$s_!LQny!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f425ee-f3cd-4f9a-af55-b03925202b49_2160x2160.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Robotic Process Automation (RPA) emerged in the 2000s as a solution for handling repetitive workflows. Companies like <a href="https://www.uipath.com/">UiPath</a>, <a href="https://www.automationanywhere.com/">Automation Anywhere</a>, and <a href="https://www.blueprism.com/">Blue Prism</a> built multi-billion-dollar businesses on the idea that bots could mimic human actions&#8212;clicks, keystrokes, and form fills&#8212;to complete tasks like data entry, invoice processing, and customer service triage.</p><p><strong>RPA promised to simplify workflows, but in reality, it often creates more work than it saves.</strong></p><p>The problem lies in how RPA operates. Bots follow rigid scripts, navigating software interfaces based on predefined rules. These systems work well when processes remain static, but the moment something changes&#8212;a form layout updates, a button is renamed, or a vendor introduces a new invoice format&#8212;the bot fails. Companies then spend valuable time and resources fixing broken automations.</p><p>Take customer service as an example. While bots can triage basic tickets, they often misclassify complex requests, leading to longer resolution times and frustrated users. Similarly, RPA systems used for data migrations frequently stumble on edge cases, requiring constant human intervention.</p><p><strong>These issues expose RPA&#8217;s fundamental flaw: it doesn&#8217;t understand the work&#8212;it merely mimics it.</strong> This brittleness has left many businesses questioning whether RPA is worth the investment, especially when ongoing maintenance costs often exceed initial implementation savings.</p><div><hr></div><h2>The Frustrations of Low-Code/No-Code</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/tarstarr/status/1854968056315052176" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QNEF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b36883-1902-42d4-b1af-50a7be0efe29_2400x1254.png 424w, https://substackcdn.com/image/fetch/$s_!QNEF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b36883-1902-42d4-b1af-50a7be0efe29_2400x1254.png 848w, https://substackcdn.com/image/fetch/$s_!QNEF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b36883-1902-42d4-b1af-50a7be0efe29_2400x1254.png 1272w, https://substackcdn.com/image/fetch/$s_!QNEF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b36883-1902-42d4-b1af-50a7be0efe29_2400x1254.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QNEF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b36883-1902-42d4-b1af-50a7be0efe29_2400x1254.png" width="1456" height="761" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c3b36883-1902-42d4-b1af-50a7be0efe29_2400x1254.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:761,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:207021,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://x.com/tarstarr/status/1854968056315052176&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QNEF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b36883-1902-42d4-b1af-50a7be0efe29_2400x1254.png 424w, https://substackcdn.com/image/fetch/$s_!QNEF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b36883-1902-42d4-b1af-50a7be0efe29_2400x1254.png 848w, https://substackcdn.com/image/fetch/$s_!QNEF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b36883-1902-42d4-b1af-50a7be0efe29_2400x1254.png 1272w, https://substackcdn.com/image/fetch/$s_!QNEF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b36883-1902-42d4-b1af-50a7be0efe29_2400x1254.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Low-Code/No-Code (LCNC) tools&#8212;offered by companies like <a href="https://retool.com/">Retool</a>, <a href="https://www.airtable.com/">Airtable Apps</a>, and <a href="https://powerapps.microsoft.com/">Microsoft Power Apps</a>&#8212;were designed to democratize software development. Their promise was simple: empower non-technical users to build custom workflows without relying on developers.</p><p>For simple use cases, LCNC tools work well. Users can quickly create dashboards, build forms, or design approval workflows. But as workflows grow in complexity, these platforms struggle. Many &#8220;no-code&#8221; tools require users to write scripts for advanced functionality, undermining the very premise of no-code. Meanwhile, more developer-centric platforms, like Retool, are inaccessible to non-technical teams, leaving businesses stuck between two extremes.</p><p><strong>LCNC platforms claim to lower the barrier to entry, but they often introduce new bottlenecks instead&#8212;ironically requiring the very expertise they were meant to replace.</strong></p><p>Another major issue is the lack of opinionated defaults. Most LCNC platforms provide blank canvases, requiring users to define everything themselves&#8212;from data models to integrations. This leads to inconsistent implementations across teams, inefficiencies, and overreliance on technical support.</p><p>For example, an HR team might use an LCNC platform to build a performance review tracker, but without predefined templates or best practices, the result is often clunky, hard to maintain, and unable to scale. Similar problems arise when customer support teams attempt to create integrated dashboards without clear guidance.</p><div><hr></div><h2>What the Next Generation of Automation Should Look Like</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bv19!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf716e8d-eb6b-4c67-a6ce-2aa6a08b0216_1024x1024.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bv19!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf716e8d-eb6b-4c67-a6ce-2aa6a08b0216_1024x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!bv19!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf716e8d-eb6b-4c67-a6ce-2aa6a08b0216_1024x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!bv19!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf716e8d-eb6b-4c67-a6ce-2aa6a08b0216_1024x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!bv19!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf716e8d-eb6b-4c67-a6ce-2aa6a08b0216_1024x1024.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bv19!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf716e8d-eb6b-4c67-a6ce-2aa6a08b0216_1024x1024.webp" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/df716e8d-eb6b-4c67-a6ce-2aa6a08b0216_1024x1024.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:291294,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!bv19!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf716e8d-eb6b-4c67-a6ce-2aa6a08b0216_1024x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!bv19!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf716e8d-eb6b-4c67-a6ce-2aa6a08b0216_1024x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!bv19!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf716e8d-eb6b-4c67-a6ce-2aa6a08b0216_1024x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!bv19!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf716e8d-eb6b-4c67-a6ce-2aa6a08b0216_1024x1024.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>To truly transform how work gets done, the next generation of automation tools must combine the adaptability of horizontal platforms with the specificity of vertical solutions. Generative AI makes this possible by enabling tools that are both flexible and deeply integrated.</p><p><strong>Next-generation automation must be AI-native, opinionated, and seamlessly integrated into enterprise systems.</strong></p><p>An ideal platform would:</p><ol><li><p><strong>Leverage AI for workflow creation</strong>: Users describe their needs in plain language, and the platform generates a working prototype.</p></li><li><p><strong>Provide opinionated frameworks</strong>: Prebuilt templates and best practices guide users, reducing complexity and ensuring consistency.</p></li><li><p><strong>Ensure seamless integration</strong>: Robust connections to systems of record like CRMs, ERPs, and HR tools ensure enterprise readiness.</p></li><li><p><strong>Offer enterprise-grade governance</strong>: Built-in role-based access controls (RBAC), audit logs, and monitoring tools provide visibility and control.</p></li></ol><div><hr></div><p>In upcoming issues, we&#8217;ll dive deeper into how to design agentic AI workflows that deliver on this vision.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://melvintercan.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://melvintercan.com/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item><item><title><![CDATA[Designing Multi-Agent Systems: Drawing Lessons from OpenAI’s o1 Reasoning Model]]></title><description><![CDATA[How breakthroughs in OpenAI's reasoning models can influence the development of dynamic, multi-agent systems that can learn and adapt together.]]></description><link>https://melvintercan.com/p/lessons-from-reasoning-designing</link><guid isPermaLink="false">https://melvintercan.com/p/lessons-from-reasoning-designing</guid><dc:creator><![CDATA[Melvin Tercan]]></dc:creator><pubDate>Thu, 02 Jan 2025 19:01:04 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2514bbe0-d458-44c2-8dc0-1dccc4e5a30d_2092x1359.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>In the previous article, we explored <a href="https://melvintercan.com/p/anatomy-of-an-autonomous-ai-agent">the anatomy of autonomous AI agents</a>&#8212;single entities designed to handle specific tasks autonomously. While these agents can revolutionize individual workflows, the true potential of agentic AI lies in creating systems where multiple autonomous AI agents collaborate, learning and adapting together. This article explores how to design such multi-agent systems and the challenges they address.</p><p>Businesses today face mounting complexity in their operations, from supply chain disruptions to rapidly evolving customer expectations. Traditional automation systems, such as Software-as-a-Service (SaaS), Robotic Process Automation (RPA), and Low-Code/No-Code (LCNC) platforms, have promised transformation but often fall short of delivering true efficiency. These tools frequently require heavy manual intervention, struggle with adaptability, and create inefficiencies rather than solving them. Multi-agent systems offer a way forward: a network of specialized agents that interact, refine their outputs, and create a continuously improving system.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://melvintercan.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">P.S. I will be writing more on the shortcomings of modern automation tools soon, please subscribe if you haven&#8217;t already.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>OpenAI&#8217;s recent announcement of its o3 reasoning model highlighted a significant breakthrough, achieving an <a href="https://arcprize.org/blog/oai-o3-pub-breakthrough">unprecedented score</a> on the ARC-AGI benchmark. Many view this as a significant step toward the broader vision of Artificial General Intelligence (AGI), where systems exhibit the ability to generalize and adapt across diverse challenges.</p><p>Insights from the latest research<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> into the o1 model&#8217;s inner workings&#8212;such as its modular reasoning, dynamic context adaptation, and efficient knowledge integration&#8212;offer valuable lessons for creating networks of agents that interact dynamically and solve complex problems. By applying these principles, we can design more robust and adaptive multi-agent systems capable of tackling real-world challenges.</p><div><hr></div><h3>Designing Agentic Systems for the Next Levels of AGI</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CrBk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2514bbe0-d458-44c2-8dc0-1dccc4e5a30d_2092x1359.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CrBk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2514bbe0-d458-44c2-8dc0-1dccc4e5a30d_2092x1359.png 424w, https://substackcdn.com/image/fetch/$s_!CrBk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2514bbe0-d458-44c2-8dc0-1dccc4e5a30d_2092x1359.png 848w, https://substackcdn.com/image/fetch/$s_!CrBk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2514bbe0-d458-44c2-8dc0-1dccc4e5a30d_2092x1359.png 1272w, https://substackcdn.com/image/fetch/$s_!CrBk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2514bbe0-d458-44c2-8dc0-1dccc4e5a30d_2092x1359.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CrBk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2514bbe0-d458-44c2-8dc0-1dccc4e5a30d_2092x1359.png" width="1456" height="946" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2514bbe0-d458-44c2-8dc0-1dccc4e5a30d_2092x1359.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:946,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:178016,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CrBk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2514bbe0-d458-44c2-8dc0-1dccc4e5a30d_2092x1359.png 424w, https://substackcdn.com/image/fetch/$s_!CrBk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2514bbe0-d458-44c2-8dc0-1dccc4e5a30d_2092x1359.png 848w, https://substackcdn.com/image/fetch/$s_!CrBk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2514bbe0-d458-44c2-8dc0-1dccc4e5a30d_2092x1359.png 1272w, https://substackcdn.com/image/fetch/$s_!CrBk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2514bbe0-d458-44c2-8dc0-1dccc4e5a30d_2092x1359.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>To understand the trajectory of advanced AI systems, we can use the framework  that OpenAI uses <a href="https://www.bloomberg.com/news/articles/2024-07-11/openai-sets-levels-to-track-progress-toward-superintelligent-ai?srnd=technology-vp">internally</a> to guide their path toward AGI development. Each level builds upon the last, moving from basic interaction to systems capable of managing complex operations autonomously:</p><ul><li><p><strong>Level 1 (Chatbots):</strong> Conversational AI is like the entry-level customer service representative who can handle FAQs, route inquiries, and manage basic tasks. These systems excel in providing quick, straightforward answers but lack the capacity for deeper problem-solving or strategic insights.</p></li><li><p><strong>Level 2 (Reasoners) &#8592; we&#8217;re here:</strong> Reasoners function like a skilled analyst in your organization. With OpenAI's recent introduction of o3, AI reasoning is reaching a level of intelligence where systems can tackle novel challenges effectively, paving the way for more autonomous decision-making and complex problem-solving.</p></li><li><p><strong>Level 3 (Agents):</strong> Agents are comparable to a highly competent project manager. For example, an AI-driven supply chain agent could reorder inventory, adjust shipping schedules, and negotiate with vendors without human intervention.</p></li><li><p><strong>Level 4 (Innovators):</strong> Innovators take on the role of a creative product designer or strategist. These systems generate novel ideas and develop groundbreaking solutions, such as conceptualizing entirely new business models or product lines based on emerging market trends.</p></li><li><p><strong>Level 5 (Organizations):</strong> Organizational Equivalents represent an autonomous C-suite, capable of running the entire organization. These systems manage workflows, make strategic decisions, and adapt to market shifts&#8212;essentially functioning as an independent enterprise.</p></li></ul><p>Achieving Levels 3 through 5 will require robust multi-agentic designs that inherit adaptability from reasoning models while introducing capabilities for creativity and strategic management.</p><div><hr></div><h2>The Four Key Principles for Multi-Agentic Systems</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!SS1Z!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63b815c4-2b8b-4d55-8083-bdb0ffad6d31_1751x1263.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!SS1Z!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63b815c4-2b8b-4d55-8083-bdb0ffad6d31_1751x1263.png 424w, https://substackcdn.com/image/fetch/$s_!SS1Z!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63b815c4-2b8b-4d55-8083-bdb0ffad6d31_1751x1263.png 848w, https://substackcdn.com/image/fetch/$s_!SS1Z!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63b815c4-2b8b-4d55-8083-bdb0ffad6d31_1751x1263.png 1272w, https://substackcdn.com/image/fetch/$s_!SS1Z!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63b815c4-2b8b-4d55-8083-bdb0ffad6d31_1751x1263.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!SS1Z!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63b815c4-2b8b-4d55-8083-bdb0ffad6d31_1751x1263.png" width="1456" height="1050" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/63b815c4-2b8b-4d55-8083-bdb0ffad6d31_1751x1263.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1050,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:258202,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!SS1Z!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63b815c4-2b8b-4d55-8083-bdb0ffad6d31_1751x1263.png 424w, https://substackcdn.com/image/fetch/$s_!SS1Z!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63b815c4-2b8b-4d55-8083-bdb0ffad6d31_1751x1263.png 848w, https://substackcdn.com/image/fetch/$s_!SS1Z!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63b815c4-2b8b-4d55-8083-bdb0ffad6d31_1751x1263.png 1272w, https://substackcdn.com/image/fetch/$s_!SS1Z!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63b815c4-2b8b-4d55-8083-bdb0ffad6d31_1751x1263.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>To bridge the gap between reasoning capabilities and fully autonomous multi-agentic systems, four foundational principles lay the groundwork for collaboration, learning, and adaptability among agents:</p><ol><li><p><strong>Policy Initialization:</strong> Policy initialization ensures agents start with a clear understanding of their role, equipped with the tools and training to succeed. Just as an HR department provides new hires with company guidelines and access to necessary software, this principle sets up agents with domain-specific expertise, ethical guidelines, and seamless integrations to begin their tasks effectively.</p></li><li><p><strong>Reward Design:</strong> Reward design works similarly by aligning agents&#8217; actions with the organization&#8217;s objectives. Agents are encouraged to prioritize collaborative outcomes and iteratively improve through feedback, much like how a sales team adjusts its strategies based on performance metrics.</p></li><li><p><strong>Search Mechanisms:</strong> Search mechanisms empower agents to explore and refine strategies dynamically, ensuring that innovative and practical solutions emerge even under uncertainty. This is akin to iterating on a product concept until it fits market needs.</p></li><li><p><strong>Learning Frameworks:</strong> Learning frameworks enable agents to continuously improve individually and collectively. These frameworks ensure that agents learn from their experiences and adapt to evolving challenges, fostering a culture of continuous improvement.</p></li></ol><p>Let&#8217;s now explore these four key principles in greater detail.</p><div><hr></div><h3>1. Policy Initialization: Building Strong Foundations</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!tkIK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F274157ea-31bf-4699-b002-74713dba3663_1741x1986.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!tkIK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F274157ea-31bf-4699-b002-74713dba3663_1741x1986.png 424w, https://substackcdn.com/image/fetch/$s_!tkIK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F274157ea-31bf-4699-b002-74713dba3663_1741x1986.png 848w, https://substackcdn.com/image/fetch/$s_!tkIK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F274157ea-31bf-4699-b002-74713dba3663_1741x1986.png 1272w, https://substackcdn.com/image/fetch/$s_!tkIK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F274157ea-31bf-4699-b002-74713dba3663_1741x1986.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!tkIK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F274157ea-31bf-4699-b002-74713dba3663_1741x1986.png" width="1456" height="1661" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/274157ea-31bf-4699-b002-74713dba3663_1741x1986.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1661,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:390450,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!tkIK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F274157ea-31bf-4699-b002-74713dba3663_1741x1986.png 424w, https://substackcdn.com/image/fetch/$s_!tkIK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F274157ea-31bf-4699-b002-74713dba3663_1741x1986.png 848w, https://substackcdn.com/image/fetch/$s_!tkIK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F274157ea-31bf-4699-b002-74713dba3663_1741x1986.png 1272w, https://substackcdn.com/image/fetch/$s_!tkIK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F274157ea-31bf-4699-b002-74713dba3663_1741x1986.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Policy initialization is the process of equipping agents with the foundational tools and frameworks necessary to operate effectively in complex environments. Drawing from advanced reasoning models like o1, this principle focuses on three core components:</p><ol><li><p><strong>Dynamic Policy Creation</strong>: Agents are initialized with policies that adapt through iterative evaluation and reasoning. Techniques like Monte Carlo Tree Search enable agents to refine strategies by balancing exploration and exploitation, which is critical for decision-making in novel scenarios.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> This process is further enhanced by token-level action granularity, where agents operate at the finest level of decision-making, selecting individual tokens from a vast vocabulary to construct coherent responses.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> Much like a startup founder refining their business plan based on market feedback, agents dynamically adjust their policies to optimize performance. Additionally, step-by-step reasoning and alternative proposal mechanisms allow agents to explore multiple solutions and self-correct when errors are detected, significantly improving accuracy and adaptability.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a></p></li><li><p><strong>Integrated Knowledge Models</strong>: Agents are initialized with modular knowledge bases that combine pre-trained datasets and domain-specific updates. This approach integrates cross-domain knowledge with real-time data for situational accuracy, enabling agents to remain highly relevant and effective in their operational domains.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a> Consider a consultant who tailors their advice to each client&#8217;s unique needs. Pre-training establishes language understanding, world knowledge acquisition, and basic reasoning abilities, while instruction fine-tuning transforms these capabilities into task-oriented behaviors.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a> For example, multilingual training data enables cross-lingual transfer, and exposure to diverse corpora fosters domain expertise in areas like mathematics, science, and programming.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a></p></li><li><p><strong>Human-like Reasoning Behaviors</strong>: Agents are equipped with sophisticated reasoning behaviors that mimic human problem-solving. These include:</p><ul><li><p><strong>Problem Analysis</strong>: Reformulating and analyzing problems to reduce ambiguity and construct actionable specifications.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a> It&#8217;s similar to a CEO breaking down a complex challenge into clear, actionable goals.</p></li><li><p><strong>Task Decomposition</strong>: Breaking complex problems into manageable subtasks, dynamically adjusting granularity based on context.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-9" href="#footnote-9" target="_self">9</a> This mirrors a project manager dividing a large project into smaller milestones.</p></li><li><p><strong>Self-Reflection</strong>: Evaluating and correcting outputs, enabling agents to recognize flaws and refine their reasoning processes.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-10" href="#footnote-10" target="_self">10</a> It&#8217;s like a company conducting a post-mortem after a product launch to identify lessons learned.</p></li></ul></li></ol><h4>Policy Initialization Lessons from the o1 Reasoning Model</h4><p>The o1 model highlights several key insights that can inform the policy initialization of agentic systems:</p><ul><li><p><strong>Long-Text Generation</strong>: Agents must generate lengthy, coherent outputs to handle complex reasoning tasks. Techniques like <em>AgentWrite</em> and <em>Self-Lengthen</em> enhance long-text generation capabilities by fine-tuning on specialized datasets.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-11" href="#footnote-11" target="_self">11</a></p></li><li><p><strong>Logical Orchestration of Reasoning Behaviors</strong>: Agents need to strategically sequence reasoning behaviors, such as deciding when to self-correct or explore alternative solutions. Exposure to programming code and structured logical data strengthens these capabilities.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-12" href="#footnote-12" target="_self">12</a></p></li><li><p><strong>Self-Reflection</strong>: Behaviors like self-evaluation and self-correction enable agents to recognize and address flaws in their reasoning. This self-reflection capability is critical for improving accuracy and reliability.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-13" href="#footnote-13" target="_self">13</a></p></li></ul><p>Policy initialization provides agents with the foundational policies, knowledge bases, and reasoning frameworks they need to perform effectively. This ensures that every agent operates seamlessly within the larger system, maintaining both efficiency and adaptability.</p><div><hr></div><h3>2. Reward Design: Aligning Actions with Goals</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!BDsy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2236c6c-4d76-4592-a419-7038b79e7396_1741x1468.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BDsy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2236c6c-4d76-4592-a419-7038b79e7396_1741x1468.png 424w, https://substackcdn.com/image/fetch/$s_!BDsy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2236c6c-4d76-4592-a419-7038b79e7396_1741x1468.png 848w, https://substackcdn.com/image/fetch/$s_!BDsy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2236c6c-4d76-4592-a419-7038b79e7396_1741x1468.png 1272w, https://substackcdn.com/image/fetch/$s_!BDsy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2236c6c-4d76-4592-a419-7038b79e7396_1741x1468.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BDsy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2236c6c-4d76-4592-a419-7038b79e7396_1741x1468.png" width="1456" height="1228" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b2236c6c-4d76-4592-a419-7038b79e7396_1741x1468.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1228,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:234734,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!BDsy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2236c6c-4d76-4592-a419-7038b79e7396_1741x1468.png 424w, https://substackcdn.com/image/fetch/$s_!BDsy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2236c6c-4d76-4592-a419-7038b79e7396_1741x1468.png 848w, https://substackcdn.com/image/fetch/$s_!BDsy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2236c6c-4d76-4592-a419-7038b79e7396_1741x1468.png 1272w, https://substackcdn.com/image/fetch/$s_!BDsy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2236c6c-4d76-4592-a419-7038b79e7396_1741x1468.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Reward design is the cornerstone of aligning agent behavior with system-wide objectives, ensuring that every action contributes to the overall goals of the organization. Drawing insights from reinforcement learning and advanced reasoning models, this principle involves three core concepts:</p><ul><li><p><strong>Dynamic Reward Structures:</strong> Incentive mechanisms must evolve alongside the system's changing goals and environments. Techniques like policy-gradient methods and adaptive reward shaping allow systems to fine-tune incentives dynamically, ensuring agents remain aligned with overarching objectives<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-14" href="#footnote-14" target="_self">14</a>. For instance, a customer service agent might initially prioritize response speed but later focus on customer satisfaction as the system&#8217;s goals evolve. </p></li><li><p><strong>Collaborative Incentives:</strong> To foster teamwork, agents must be rewarded not only for individual success but also for contributions to collective outcomes. Methods for multi-agent credit assignment, as described in coordination algorithms, ensure agents work toward shared objectives like improving overall customer retention or maximizing team productivity. This is akin to how high-performing organizations balance individual bonuses with team-based rewards to encourage collaboration.</p></li><li><p><strong>Feedback Loops for Iterative Learning:</strong> By providing agents with continuous feedback on their performance, systems can refine agent behavior iteratively. Concepts like reward modeling, as highlighted in scalable alignment research, allow agents to adapt through both reinforcement signals and post-task evaluations<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-15" href="#footnote-15" target="_self">15</a>. For example, a sales agent might adjust its approach to prospecting based on reward adjustments for achieving long-term revenue goals. This iterative process is similar to how businesses use performance reviews and real-time analytics to optimize employee output.</p></li></ul><h4><strong>Reward Design Lessons from the o1 Reasoning Model</strong></h4><p>The o1 reasoning model might provide some valuable insights into designing reward systems for agentic systems, particularly for complex, multi-task environments. Here are key lessons and challenges:</p><ol><li><p><strong>Process Rewards for Complex Reasoning:</strong> For tasks like mathematics and code generation, where responses involve long chains of reasoning, o1 likely employs process rewards to supervise intermediate steps rather than relying solely on outcome rewards<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-16" href="#footnote-16" target="_self">16</a>. This ensures that agents are rewarded for correct reasoning processes, even if the final outcome is incorrect. Techniques like reward shaping can transform sparse outcome rewards into denser process rewards, enabling more effective learning<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-17" href="#footnote-17" target="_self">17</a>.</p></li><li><p><strong>Learning from Preference and Expert Data:</strong> When reward signals from the environment are unavailable, o1 might leverage preference data (e.g., ranked responses) or expert data to infer rewards<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-18" href="#footnote-18" target="_self">18</a>. This approach, inspired by methods like Inverse Reinforcement Learning (IRL), allows agents to learn from human-like decision-making patterns, even in the absence of explicit feedback<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-19" href="#footnote-19" target="_self">19</a>.</p></li><li><p><strong>Robust and Adaptable Reward Models:</strong> o1&#8217;s reward model is likely trained on a large and diverse dataset, enabling it to generalize across domains. It can be fine-tuned using few-shot examples, making it adaptable to new tasks with minimal data. This flexibility is crucial for agentic systems operating in dynamic environments.</p></li></ol><p>While traditional businesses use a mix of monetary incentives, career advancement, and recognition to drive performance, reward design creates a sophisticated framework of reinforcement signals that shape agent behavior. This multi-layered approach mirrors how the most effective organizations balance individual achievement with team success, creating a self-reinforcing cycle of continuous improvement and collective achievement. </p><div><hr></div><h3>3. Search Mechanisms: Exploring and Refining Strategies</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3-WV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e4af257-c642-432d-92b4-bb814aca58ff_2036x1528.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3-WV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e4af257-c642-432d-92b4-bb814aca58ff_2036x1528.png 424w, https://substackcdn.com/image/fetch/$s_!3-WV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e4af257-c642-432d-92b4-bb814aca58ff_2036x1528.png 848w, https://substackcdn.com/image/fetch/$s_!3-WV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e4af257-c642-432d-92b4-bb814aca58ff_2036x1528.png 1272w, https://substackcdn.com/image/fetch/$s_!3-WV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e4af257-c642-432d-92b4-bb814aca58ff_2036x1528.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3-WV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e4af257-c642-432d-92b4-bb814aca58ff_2036x1528.png" width="1456" height="1093" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2e4af257-c642-432d-92b4-bb814aca58ff_2036x1528.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1093,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:204728,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3-WV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e4af257-c642-432d-92b4-bb814aca58ff_2036x1528.png 424w, https://substackcdn.com/image/fetch/$s_!3-WV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e4af257-c642-432d-92b4-bb814aca58ff_2036x1528.png 848w, https://substackcdn.com/image/fetch/$s_!3-WV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e4af257-c642-432d-92b4-bb814aca58ff_2036x1528.png 1272w, https://substackcdn.com/image/fetch/$s_!3-WV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e4af257-c642-432d-92b4-bb814aca58ff_2036x1528.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Search mechanisms empower agents to navigate uncertainty, test strategies, and refine their approaches dynamically. Based on insights from the research paper, these mechanisms can be broken into five core concepts:</p><ol><li><p><strong>Guiding Signals: Internal vs. External Guidance</strong><br>Search relies on guiding signals to direct the exploration process. Internal guidance leverages the model&#8217;s own capabilities, such as model uncertainty or self-evaluation, to assess the quality of solutions. For example, self-consistency techniques use majority voting to select the most reliable answer from multiple candidates. External guidance, on the other hand, uses task-specific feedback, such as rewards or heuristic rules, to steer the search. This is akin to a business using customer feedback (external) versus internal performance metrics (internal) to refine its strategies.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-20" href="#footnote-20" target="_self">20</a></p></li><li><p><strong>Search Strategies: Tree Search vs. Sequential Revisions</strong><br>Search strategies fall into two broad categories: tree search and sequential revisions. Tree search, like Monte Carlo Tree Search (MCTS) or Best-of-N (BoN), generates multiple candidate solutions simultaneously, exploring a wide range of possibilities. Sequential revisions, in contrast, iteratively refine a single solution based on feedback or self-reflection. Think of tree search as a brainstorming session where multiple ideas are generated at once, while sequential revisions are like polishing a single idea through multiple drafts.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-21" href="#footnote-21" target="_self">21</a></p></li><li><p><strong>Exploration vs. Exploitation</strong><br>Effective search strategies balance exploration (trying new possibilities) with exploitation (leveraging known successful strategies). Techniques like epsilon-greedy algorithms ensure agents remain innovative while capitalizing on proven outcomes. For instance, during training, agents might explore diverse solutions (exploration) while also leveraging high-reward solutions (exploitation) to iteratively improve their policy. This is similar to a company investing in R&amp;D (exploration) while maximizing profits from existing products (exploitation).<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-22" href="#footnote-22" target="_self">22</a></p></li><li><p><strong>Training-Time vs. Inference-Time Search</strong><br>Search plays distinct roles in training and inference. During training, search helps generate high-quality data by exploring diverse solutions, often guided by external feedback like task-specific rewards. In inference, search refines outputs through internal guidance, such as self-evaluation, to improve accuracy. For example, during training, a model might use MCTS to explore a wide range of solutions, while during inference, it might use sequential revisions to refine its final answer.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-23" href="#footnote-23" target="_self">23</a></p></li><li><p><strong>Efficiency and Scaling Challenges</strong><br>Scaling search mechanisms introduces challenges, such as inverse scaling (where increased search degrades performance) and over-thinking on simple tasks. For example, using a reward model trained on limited data can lead to poor generalization during large-scale search. To address this, researchers propose techniques like speculative rejection (discarding low-quality solutions early) or combining tree search with sequential revisions to balance efficiency and performance. This is akin to a business optimizing its processes to avoid wasted effort while maintaining high-quality outcomes.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-24" href="#footnote-24" target="_self">24</a></p></li></ol><h4>Search Mechanism Lessons from the o1 Reasoning Model</h4><p>The o1 reasoning model may offer some insights into optimizing search mechanisms, providing a blueprint for designing efficient and effective systems. Here are three critical takeaways:</p><ol><li><p><strong>Training-Time Search: Parallel Exploration with External Guidance</strong><br>During training, o1 likely employs tree search techniques like Best-of-N (BoN) and Monte Carlo Tree Search (MCTS) to explore multiple solutions simultaneously. This parallel exploration is guided by external feedback, such as task-specific rewards or environmental signals like code execution results, ensuring alignment with real-world performance. Systems can adopt similar strategies to efficiently explore and validate solutions during training, leveraging external feedback to refine their approaches.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-25" href="#footnote-25" target="_self">25</a></p></li><li><p><strong>Inference-Time Search: Iterative Refinement with Internal Guidance</strong><br>During inference, o1 might shift to sequential revisions, iteratively refining outputs using internal guidance like self-evaluation or model uncertainty. This minimizes computational overhead while improving solution quality. By adopting iterative refinement, systems can optimize decisions in real-time without relying on costly external feedback, ensuring efficiency and accuracy.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-26" href="#footnote-26" target="_self">26</a></p></li></ol><p>Just as military planners use simulations to evaluate tactical approaches or chess grandmasters analyze multiple move sequences, search mechanisms enable agents to explore decision trees and optimize their choices through systematic evaluation. Together, these mechanisms ensure that agents navigate complex decision spaces with confidence and precision, much like a well-oiled corporate strategy team.</p><div><hr></div><h3>4. Learning Frameworks: Continuous Improvement and Collaboration</h3><p>Learning frameworks in multi-agent systems are not just about continuous improvement; they encompass a broader set of principles that enable agents to adapt, innovate, and scale effectively. Here are the core concepts, enriched with insights from the research:</p><ol><li><p><strong>Reflective Analysis</strong>: Agents evaluate their own performance to identify strengths and weaknesses, refining their strategies iteratively. This is akin to a business conducting a post-mortem analysis after a project to understand what worked and what didn&#8217;t. For example, an agent might analyze its past decisions to improve future actions, much like a sales team refining its pitch based on customer feedback.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-27" href="#footnote-27" target="_self">27</a></p></li><li><p><strong>Collaborative Knowledge Sharing</strong>: Agents share their experiences and solutions, creating a collective intelligence that benefits the entire system. This mirrors how cross-functional teams in a company share insights to solve complex problems. When one agent discovers an efficient solution, it&#8217;s shared across the network, enhancing overall performance&#8212;similar to how a company&#8217;s R&amp;D department shares innovations with other teams to drive organizational growth.</p></li><li><p><strong>Adaptive Learning Algorithms</strong>: Agents continuously update their strategies based on real-time data, ensuring they remain effective in dynamic environments. This is akin to a business adapting its operations in response to market trends. For instance, an agent might adjust its decision-making process based on new information, much like a retail company dynamically changing its inventory based on consumer demand.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-28" href="#footnote-28" target="_self">28</a></p></li><li><p><strong>Search-Driven Exploration</strong>: Agents use search algorithms (e.g., beam search, Monte Carlo Tree Search) to explore high-value actions rather than relying on random sampling. This ensures that the training data is of higher quality, similar to how a company uses data analytics to identify the most promising opportunities. The iterative process of search and learning&#8212;where the system refines its strategies based on search results&#8212;mirrors how businesses refine their operations based on performance metrics.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-29" href="#footnote-29" target="_self">29</a></p></li><li><p><strong>Hybrid Learning Approaches</strong>: Systems often combine multiple learning methods, such as behavior cloning and policy optimization, to achieve the best results. Behavior cloning focuses on replicating successful actions, while policy optimization learns from both successes and failures. This hybrid approach is like a business starting with proven strategies before experimenting with innovative approaches to achieve breakthroughs.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-30" href="#footnote-30" target="_self">30</a></p></li><li><p><strong>Scaling and Efficiency</strong>: As systems grow in complexity, ensuring efficient learning becomes a critical challenge. The process of generating and analyzing data can be resource-intensive, much like a company&#8217;s R&amp;D efforts consuming significant time and budget. To address this, systems can reuse data from previous iterations or focus on the most promising areas of exploration, similar to how businesses optimize their operations to reduce costs and improve efficiency.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-31" href="#footnote-31" target="_self">31</a></p></li><li><p><strong>Dynamic Problem Generation</strong>: As agents improve, previously challenging problems become trivial, requiring the generation of more complex challenges. This is akin to a company continuously innovating to stay ahead of competitors. Systems must dynamically adjust their focus to ensure they&#8217;re always tackling meaningful problems, much like a business pivoting its strategy to address emerging market trends.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-32" href="#footnote-32" target="_self">32</a></p></li></ol><h4>Learning Framework Lessons from the o1 Reasoning Model</h4><p>The o1 reasoning model may offer some insights that can inform the design of learning frameworks for agentic systems, particularly in policy initialization and training efficiency:</p><ol><li><p><strong>Warm-Starting with Behavior Cloning</strong>:<br>The o1 model suggests that behavior cloning is an efficient way to initialize policies, as it leverages high-reward solutions to quickly establish a strong baseline. This is akin to onboarding new employees with proven best practices to ensure they start on the right track. However, behavior cloning has limitations&#8212;it ignores lower-reward solutions, which can provide valuable learning signals. Thus, while it&#8217;s effective for warm-starting, it should be complemented with other methods for further optimization.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-33" href="#footnote-33" target="_self">33</a></p></li><li><p><strong>Transitioning to Policy Optimization</strong>:<br>Once the initial policy is established, the o1 model suggests transitioning to policy optimization methods like PPO or DPO. These methods utilize all state-action pairs, including lower-reward solutions, to refine the policy. This is similar to a company moving from standardized training programs to more dynamic, data-driven strategies that incorporate lessons from both successes and failures. PPO, while memory-intensive, offers robust performance, whereas DPO is simpler and more memory-efficient but relies on preference data.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-34" href="#footnote-34" target="_self">34</a></p></li><li><p><strong>Handling Distribution Shifts</strong>:<br>A potential challenge in o1 is the distribution shift that occurs when search-generated data (from a better policy) is used to train the current policy. This issue can also arise during policy initialization if the initial data doesn&#8217;t align with the system&#8217;s operational environment. To mitigate this, initialization can incorporate behavior<strong> </strong>cloning to narrow the gap between the initial and target policies, similar to aligning new hires&#8217; skills with organizational needs through targeted training.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-35" href="#footnote-35" target="_self">35</a></p></li><li><p><strong>Improving Training Efficiency</strong>:<br>The o1 model highlights the possible computational cost of train-time search, which can slow down policy initialization. To address this, systems can reuse data from previous iterations or focus on high-value actions, much like a company optimizing its workflows to reduce costs and improve efficiency. This ensures that initialization is both effective and resource-efficient.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-36" href="#footnote-36" target="_self">36</a></p></li></ol><p>In essence, just as a successful corporation thrives on continuous improvement, collaboration, and adaptability, multi-agent systems achieve their goals through reflective analysis, knowledge sharing, adaptive learning, and strategic exploration.</p><div><hr></div><h2>Challenges Ahead</h2><p>As we move toward implementing sophisticated multi-agent systems, several significant challenges emerge that organizations must navigate carefully. These hurdles reflect the complexity of creating systems that can think, learn, and collaborate effectively at scale.</p><ol><li><p><strong>Lack of Published Research on o1</strong></p><p>A major challenge in reproducing or understanding the o1 model is the absence of published research from OpenAI. OpenAI has not released any detailed papers or technical reports about the o1 model, forcing researchers and developers to reverse engineer its capabilities based on limited information and public demonstrations. This lack of transparency makes it difficult to fully understand the underlying mechanisms, architectures, and training methodologies used in o1. As a result, the research community must rely on open-source projects and independent studies to piece together the roadmap to achieving similar capabilities. This underscores the importance of open-source models and the need for more transparency in AI research.</p></li><li><p><strong>Technical Scalability and Performance:</strong> Just as businesses face challenges when scaling their operations, multi-agent systems encounter unique obstacles as they grow. Counter-intuitively, adding more computational power doesn't always improve performance&#8212;in fact, it can sometimes degrade it. This is similar to how adding more team members to a project doesn't necessarily increase productivity. The way these systems process information, particularly when making complex decisions, creates bottlenecks that become more pronounced at scale. Consider how a video conference becomes unstable with too many participants&#8212;multi-agent systems face similar coordination challenges but at a much more sophisticated level.</p></li><li><p><strong>Balancing Learning and Action:</strong> Much like developing effective performance metrics for employees, creating the right incentive structures for AI agents presents unique challenges. These systems need clear signals about what constitutes success, but defining these metrics becomes increasingly complex as tasks become more sophisticated. For instance, how do you balance a customer service agent's need to resolve issues quickly with the quality of their interactions? The system must learn from experience while maintaining consistent performance, similar to how businesses must innovate while maintaining day-to-day operations.</p></li><li><p><strong>Orchestrating Collaboration:</strong> Perhaps the most significant challenge lies in managing how multiple agents work together effectively. This mirrors the complexity of coordinating different departments within a large organization&#8212;each unit needs to operate independently while contributing to overall goals. The system must manage resources efficiently, ensure smooth communication between agents, and maintain performance across the entire network. Just as a company needs robust systems to coordinate between sales, operations, and customer service, multi-agent systems require sophisticated frameworks to manage their interactions and collective learning.</p></li></ol><p>Ironically, as we work to develop AI systems to solve these organizational challenges, we could find ourselves grappling with the same fundamental problems that have plagued organizations for decennia&#8212;just in a different form. Whether managed by humans or artificial intelligence, it seems the core challenges of running complex, adaptive organizations will remain a difficult problem to solve.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Zeng, Zhiyuan, et al. "Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective." 2024.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Brown, Tom B., et al. "Language Models Are Few-Shot Learners." 2020; Kaplan, Jared, et al. "Scaling Laws for Neural Language Models." 2020.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Radford, Alec, and Karthik Narasimhan. "Improving Language Understanding by Generative Pre-Training.", 2018; Brown, Tom B., et al. "Language Models Are Few-Shot Learners." 2020.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Wei, Jason, et al. "Chain-of-Thought Prompting Elicits Reasoning in Large Language Models." 2022; Puerto, Haritz, et al. "Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models." 2024.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>Radford, Alec, et al. "Language Models Are Unsupervised Multitask Learners.", 2019; Brown, Tom B., et al. "Language Models Are Few-Shot Learners." 2020.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>Wei, Jason, et al. "Chain-of-Thought Prompting Elicits Reasoning in Large Language Models." 2022; Chung, Hyung Won, et al. "Scaling Instruction-Finetuned Language Models." 2024.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>Scao, Teven Le, et al. "BLOOM: A 176B-Parameter Open-Access Multilingual Language Model." 2022; Yang, Shunyu, et al. "Tree of Thoughts: Deliberate Problem Solving with Large Language Models." 2023.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p>Deng, Yang, et al. "Prompting and Evaluating Large Language Models for Proactive Dialogues: Clarification, Target-Guided, and Non-Collaboration." 2023; Lee, Dongryeol, et al. "Asking Clarification Questions to Handle Ambiguity in Open-Domain QA." 2023.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-9" href="#footnote-anchor-9" class="footnote-number" contenteditable="false" target="_self">9</a><div class="footnote-content"><p>Bursztyn, Victor S., et al. "Learning to Perform Complex Tasks Through Compositional Fine-Tuning of Language Models." 2022; Zhou, Denny, et al. "Least-to-Most Prompting Enables Complex Reasoning in Large Language Models." 2023.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-10" href="#footnote-anchor-10" class="footnote-number" contenteditable="false" target="_self">10</a><div class="footnote-content"><p>Madaan, Aman, et al. "Self-Refine: Iterative Refinement with Self-Feedback." 2023; Cheng, Qinyuan, et al. "Can AI Assistants Know What They Don&#8217;t Know?" 2024.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-11" href="#footnote-anchor-11" class="footnote-number" contenteditable="false" target="_self">11</a><div class="footnote-content"><p>Bai, Yuntao, et al. "Constitutional AI: Harmlessness from AI Feedback." 2024.<br>Quan, Shanghaoran, et al. "DMOERM: Recipes of Mixture-of-Experts for Effective Reward Modeling." 2024.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-12" href="#footnote-anchor-12" class="footnote-number" contenteditable="false" target="_self">12</a><div class="footnote-content"><p>Sun, Qiushi, et al. "A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond." 2024; Aryabumi, Viraat, et al. "To Code, or Not to Code? Exploring Impact of Code in Pre-Training." 2024.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-13" href="#footnote-anchor-13" class="footnote-number" contenteditable="false" target="_self">13</a><div class="footnote-content"><p>Madaan, Aman, et al. "Self-Refine: Iterative Refinement with Self-Feedback." 2023; Cheng, Qinyuan, et al. "Can AI Assistants Know What They Don&#8217;t Know?" 2024.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-14" href="#footnote-anchor-14" class="footnote-number" contenteditable="false" target="_self">14</a><div class="footnote-content"><p>Ng, Andrew Y., and Stuart Russell. "Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping.", 1999.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-15" href="#footnote-anchor-15" class="footnote-number" contenteditable="false" target="_self">15</a><div class="footnote-content"><p>Christiano, Paul F., et al. "Deep Reinforcement Learning from Human Preferences.", 2017.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-16" href="#footnote-anchor-16" class="footnote-number" contenteditable="false" target="_self">16</a><div class="footnote-content"><p>Cobbe, Karl, et al. "Training Verifiers to Solve Math Word Problems." 2021.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-17" href="#footnote-anchor-17" class="footnote-number" contenteditable="false" target="_self">17</a><div class="footnote-content"><p>Ng, Andrew Y., Daishi Harada, and Stuart Russell. "Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping.", 1999.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-18" href="#footnote-anchor-18" class="footnote-number" contenteditable="false" target="_self">18</a><div class="footnote-content"><p>Christiano, Paul F., et al. "Deep Reinforcement Learning from Human Preferences.", 2017.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-19" href="#footnote-anchor-19" class="footnote-number" contenteditable="false" target="_self">19</a><div class="footnote-content"><p>Garg, Divyansh, et al. "IQ-Learn: Inverse Soft-Q Learning for Imitation." 2021.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-20" href="#footnote-anchor-20" class="footnote-number" contenteditable="false" target="_self">20</a><div class="footnote-content"><p>Wang, Xuezhi, et al. "Self-Consistency Improves Chain of Thought Reasoning in Language Models." 2023; Snell, Charlie, et al. "Scaling LLM Test-Time Compute Optimally Can Be More Effective Than Scaling Model Parameters." 2024.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-21" href="#footnote-anchor-21" class="footnote-number" contenteditable="false" target="_self">21</a><div class="footnote-content"><p>Browne, Cameron, et al. "A Survey of Monte Carlo Tree Search Methods.", 2012.<br>Cobbe, Karl, et al. "Training Verifiers to Solve Math Word Problems." 2021.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-22" href="#footnote-anchor-22" class="footnote-number" contenteditable="false" target="_self">22</a><div class="footnote-content"><p>Sutton, Richard S., and Andrew G. Barto. &#8220;Reinforcement Learning: An Introduction.&#8221;, 1998; Anthony, Thomas, et al. "Thinking Fast and Slow with Deep Learning and Tree Search.", 2017.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-23" href="#footnote-anchor-23" class="footnote-number" contenteditable="false" target="_self">23</a><div class="footnote-content"><p>OpenAI. "Learning to Reason with LLMs." 2024; Chen, Mark, et al. "Evaluating Large Language Models Trained on Code." 2021.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-24" href="#footnote-anchor-24" class="footnote-number" contenteditable="false" target="_self">24</a><div class="footnote-content"><p>Brown, Bradley C. A., et al. "Large Language Monkeys: Scaling Inference Compute with Repeated Sampling." 2024; Gao, Leo, et al. "Scaling Laws for Reward Model Overoptimization." 2023.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-25" href="#footnote-anchor-25" class="footnote-number" contenteditable="false" target="_self">25</a><div class="footnote-content"><p>Browne, Cameron, et al. "A Survey of Monte Carlo Tree Search Methods.", 2012.<br>Cobbe, Karl, et al. "Training Verifiers to Solve Math Word Problems." 2021.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-26" href="#footnote-anchor-26" class="footnote-number" contenteditable="false" target="_self">26</a><div class="footnote-content"><p>Snell, Charlie, et al. "Scaling LLM Test-Time Compute Optimally Can Be More Effective Than Scaling Model Parameters." 2024; Wang, Xuezhi, et al. "Self-Consistency Improves Chain of Thought Reasoning in Language Models." 2023.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-27" href="#footnote-anchor-27" class="footnote-number" contenteditable="false" target="_self">27</a><div class="footnote-content"><p>Silver, David, et al. "Mastering the Game of Go with Deep Neural Networks and Tree Search." 2016.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-28" href="#footnote-anchor-28" class="footnote-number" contenteditable="false" target="_self">28</a><div class="footnote-content"><p>Sutton, Richard S., and Andrew G. Barto. &#8220;Reinforcement Learning: An Introduction.&#8221; 1998.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-29" href="#footnote-anchor-29" class="footnote-number" contenteditable="false" target="_self">29</a><div class="footnote-content"><p>Silver, David, et al. "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm." 2017.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-30" href="#footnote-anchor-30" class="footnote-number" contenteditable="false" target="_self">30</a><div class="footnote-content"><p>Schulman, John, et al. "Proximal Policy Optimization Algorithms." 2017.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-31" href="#footnote-anchor-31" class="footnote-number" contenteditable="false" target="_self">31</a><div class="footnote-content"><p>Rafailov, Rafael, et al. "Direct Preference Optimization: Your Language Model is Secretly a Reward Model." 2023.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-32" href="#footnote-anchor-32" class="footnote-number" contenteditable="false" target="_self">32</a><div class="footnote-content"><p>Kumar, M. Pawan, et al. "Curriculum Learning for Reinforcement Learning Domains." 2010.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-33" href="#footnote-anchor-33" class="footnote-number" contenteditable="false" target="_self">33</a><div class="footnote-content"><p>Touvron, Hugo, et al. "LLaMA: Open and Efficient Foundation Language Models." 2023.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-34" href="#footnote-anchor-34" class="footnote-number" contenteditable="false" target="_self">34</a><div class="footnote-content"><p>Abhimanyu Dubey, et al. &#8220;The llama 3 herd of models.&#8221; 2024</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-35" href="#footnote-anchor-35" class="footnote-number" contenteditable="false" target="_self">35</a><div class="footnote-content"><p>Xie, Yuxi, et al. "Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning" 2024.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-36" href="#footnote-anchor-36" class="footnote-number" contenteditable="false" target="_self">36</a><div class="footnote-content"><p>Xu, Can, et al. "WizardLM: Empowering Large Language Models to Follow Complex Instructions." 2023.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Anatomy of an Autonomous AI Agent]]></title><description><![CDATA[Exploring the Fundamental Building Blocks That Power Agentic AI Systems]]></description><link>https://melvintercan.com/p/anatomy-of-an-autonomous-ai-agent</link><guid isPermaLink="false">https://melvintercan.com/p/anatomy-of-an-autonomous-ai-agent</guid><dc:creator><![CDATA[Melvin Tercan]]></dc:creator><pubDate>Mon, 16 Dec 2024 15:02:44 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7bda35e-7857-4152-992c-b860cd9d23fb_1872x1545.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>In my previous <a href="https://melvintercan.com/p/ai-is-eating-the-jobs">article</a>, I discussed the rise of agentic AI systems, tools that have the potential to transform business workflows by automating processes from start to finish. These systems aim to replace traditional, static tools with dynamic, adaptive agents capable of tackling complex challenges with unmatched efficiency. </p><div class="pullquote"><p><strong>Before we can fully understand how to design and implement entire agentic systems, we need to examine the individual components that make up an autonomous AI agent.</strong></p></div><p>Autonomous agents, like humans, rely on core elements such as identity, memory, planning, and action to function effectively. These components draw inspiration from human behavior, enabling agents to think, learn, and act in ways that feel natural and intuitive. At the same time, AI agents possess unique strengths, such as processing vast amounts of data, optimizing decisions in real time, and scaling operations far beyond human limitations. These insights are grounded in the latest research<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> into autonomous agents, providing a clear framework for building systems that are both practical and groundbreaking.</p><h3><strong>The Four Pillars of Autonomy</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!g01l!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cd3c534-35a8-421e-9975-4a05b88a97ac_1463x1793.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!g01l!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cd3c534-35a8-421e-9975-4a05b88a97ac_1463x1793.png 424w, https://substackcdn.com/image/fetch/$s_!g01l!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cd3c534-35a8-421e-9975-4a05b88a97ac_1463x1793.png 848w, https://substackcdn.com/image/fetch/$s_!g01l!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cd3c534-35a8-421e-9975-4a05b88a97ac_1463x1793.png 1272w, https://substackcdn.com/image/fetch/$s_!g01l!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cd3c534-35a8-421e-9975-4a05b88a97ac_1463x1793.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!g01l!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cd3c534-35a8-421e-9975-4a05b88a97ac_1463x1793.png" width="1456" height="1784" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3cd3c534-35a8-421e-9975-4a05b88a97ac_1463x1793.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1784,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:746902,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!g01l!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cd3c534-35a8-421e-9975-4a05b88a97ac_1463x1793.png 424w, https://substackcdn.com/image/fetch/$s_!g01l!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cd3c534-35a8-421e-9975-4a05b88a97ac_1463x1793.png 848w, https://substackcdn.com/image/fetch/$s_!g01l!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cd3c534-35a8-421e-9975-4a05b88a97ac_1463x1793.png 1272w, https://substackcdn.com/image/fetch/$s_!g01l!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cd3c534-35a8-421e-9975-4a05b88a97ac_1463x1793.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>At the core of every autonomous AI agent are four critical components: <strong>Profile</strong>, <strong>Memory</strong>, <strong>Planning</strong>, and <strong>Action</strong>. These systems function as an interconnected system, enabling agents to not only perform tasks but also adapt and improve continuously. Each component addresses a distinct aspect of intelligence, working together to create a cohesive and dynamic entity.</p><p>The <strong>Profile</strong> serves as the agent&#8217;s identity and personality, defining its roles, objectives, and the constraints within which it operates. This foundational element shapes how the agent interacts with the world, ensuring that its behavior aligns with its intended purpose and the expectations of users.</p><p>The <strong>Memory</strong> system acts as the agent&#8217;s experience bank, storing and retrieving information to inform future actions. By integrating short-term and long-term memory systems, agents can maintain context over ongoing tasks while building a repository of knowledge that enhances their capabilities over time.</p><p>The <strong>Planning</strong> system provides the agent with the ability to strategize and decompose complex objectives into actionable steps. Whether executing static plans or adapting dynamically to real-time feedback, this component ensures that the agent operates methodically and with purpose.</p><p>Finally, the <strong>Action</strong> component translates decisions into tangible outcomes. By leveraging internal reasoning capabilities and external tools, this component enables the agent to execute plans effectively while refining its approach based on feedback.</p><p>Together, these four pillars form the foundation of modern autonomous AI agents, empowering them to function as adaptive, intelligent entities in complex environments.</p><div><hr></div><h3><strong>Profile: Defining the Agent&#8217;s Identity</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!v0b0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7253cb35-b479-4004-9cfa-9438959c4e35_1568x1613.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!v0b0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7253cb35-b479-4004-9cfa-9438959c4e35_1568x1613.png 424w, https://substackcdn.com/image/fetch/$s_!v0b0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7253cb35-b479-4004-9cfa-9438959c4e35_1568x1613.png 848w, https://substackcdn.com/image/fetch/$s_!v0b0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7253cb35-b479-4004-9cfa-9438959c4e35_1568x1613.png 1272w, https://substackcdn.com/image/fetch/$s_!v0b0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7253cb35-b479-4004-9cfa-9438959c4e35_1568x1613.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!v0b0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7253cb35-b479-4004-9cfa-9438959c4e35_1568x1613.png" width="1456" height="1498" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7253cb35-b479-4004-9cfa-9438959c4e35_1568x1613.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1498,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:441898,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!v0b0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7253cb35-b479-4004-9cfa-9438959c4e35_1568x1613.png 424w, https://substackcdn.com/image/fetch/$s_!v0b0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7253cb35-b479-4004-9cfa-9438959c4e35_1568x1613.png 848w, https://substackcdn.com/image/fetch/$s_!v0b0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7253cb35-b479-4004-9cfa-9438959c4e35_1568x1613.png 1272w, https://substackcdn.com/image/fetch/$s_!v0b0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7253cb35-b479-4004-9cfa-9438959c4e35_1568x1613.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The profile component establishes the foundation for an agent&#8217;s behavior and decision-making, defining its purpose, personality, and role. By tailoring these attributes, the profile ensures agents interact effectively with their environment and align with user expectations, making them more relatable, functional, and human-like.</p><div class="pullquote"><p><strong>Without a clear profile, an agent is just a tool - not an expert.</strong></p></div><h4><strong>Basic Attributes</strong></h4><p>Basic attributes outline the core characteristics of an agent, such as its expertise, role, or professional domain. For instance, a financial advisor agent may be assigned attributes emphasizing analytical rigor and familiarity with economic trends. These attributes act as the agent&#8217;s &#8220;resume,&#8221; ensuring its actions are informed by relevant context and knowledge.</p><h4><strong>Psychological Traits</strong></h4><p>Psychological traits shape how agents behave, mimicking personalities like empathetic, assertive, or collaborative. For example, a customer service agent designed to handle sensitive issues might prioritize empathy and patience, creating a supportive and positive user experience. These traits allow agents to engage users in a manner that feels intuitive and human-centered.</p><h4><strong>Social Context</strong></h4><p>Social context defines the agent&#8217;s relationships and collaborative dynamics. This can include specifying its role within a team or its interaction model with other agents and humans. For instance, in a collaborative software project, one agent might function as a &#8220;project manager,&#8221; delegating tasks to others, while another acts as a &#8220;developer,&#8221; implementing the assigned components.</p><h4><strong>Creating Profiles</strong></h4><ul><li><p><strong>Manually Defined Profiles</strong>: Manually specifying profiles involves explicitly writing descriptions and constraints for the agent. For instance, &#8220;You are an analytical product manager focused on delivering data-driven insights to drive innovation.&#8221; While this approach ensures precision, it can be labor-intensive, particularly for large-scale implementations.</p></li><li><p><strong>Automated Profile Generation</strong>: Automated systems can generate diverse profiles by leveraging AI models. For example, by seeding profiles with a few initial traits, such as age or industry, the model can produce variations suited for different applications, saving time while ensuring scalability.</p></li><li><p><strong>Dataset-Informed Profiles</strong>: Real-world data can enhance profiles, making agents more relevant to specific scenarios. For instance, a sales agent&#8217;s profile could be informed by regional customer preferences, enabling it to tailor interactions effectively.</p></li></ul><p>The profile component acts as the backbone of an agent&#8217;s identity, shaping its interactions and decision-making processes. By carefully defining attributes, psychological traits, and social roles, profiles ensure agents align with their intended purpose and resonate with user expectations. Whether crafted manually, generated through AI, or informed by real-world data, robust profiles empower agents to perform seamlessly and adapt dynamically across various applications. This foundational layer ensures that every interaction, memory, and action reflects the agent&#8217;s unique purpose and value.</p><div><hr></div><h3><strong>Memory: The Agent&#8217;s Experience Bank</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zoeI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc450c30-15d8-4d67-8a25-69d2eb1f1f74_1676x1596.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zoeI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc450c30-15d8-4d67-8a25-69d2eb1f1f74_1676x1596.png 424w, https://substackcdn.com/image/fetch/$s_!zoeI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc450c30-15d8-4d67-8a25-69d2eb1f1f74_1676x1596.png 848w, https://substackcdn.com/image/fetch/$s_!zoeI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc450c30-15d8-4d67-8a25-69d2eb1f1f74_1676x1596.png 1272w, https://substackcdn.com/image/fetch/$s_!zoeI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc450c30-15d8-4d67-8a25-69d2eb1f1f74_1676x1596.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zoeI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc450c30-15d8-4d67-8a25-69d2eb1f1f74_1676x1596.png" width="1456" height="1387" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bc450c30-15d8-4d67-8a25-69d2eb1f1f74_1676x1596.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1387,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:471854,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zoeI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc450c30-15d8-4d67-8a25-69d2eb1f1f74_1676x1596.png 424w, https://substackcdn.com/image/fetch/$s_!zoeI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc450c30-15d8-4d67-8a25-69d2eb1f1f74_1676x1596.png 848w, https://substackcdn.com/image/fetch/$s_!zoeI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc450c30-15d8-4d67-8a25-69d2eb1f1f74_1676x1596.png 1272w, https://substackcdn.com/image/fetch/$s_!zoeI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc450c30-15d8-4d67-8a25-69d2eb1f1f74_1676x1596.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The <strong>memory component</strong> empowers agents to accumulate, recall, and reflect upon experiences, creating a feedback loop that enhances learning and adaptability. Inspired by human cognitive processes, the memory system integrates short-term and long-term components to navigate dynamic environments effectively.</p><div class="pullquote"><p><strong>An agent&#8217;s memory isn&#8217;t just storage; it&#8217;s the lens through which it learns and improves.</strong></p></div><h4><strong>Memory Structures</strong></h4><p>Much like human memory, an agent's memory is divided into short-term and long-term systems. Short-term memory acts as a transient workspace for managing immediate context. This is akin to remembering what someone said just moments ago during a conversation - it helps maintain coherence and enables meaningful responses. For example, an AI assistant might use short-term memory to hold recent customer queries, ensuring its answers remain relevant and conversational.</p><p>Long-term memory provides a more durable repository, consolidating key insights over time. Similar to how people remember important milestones or professional lessons learned, agents use long-term memory to store and retrieve knowledge that informs future decisions. A sales forecasting agent, for instance, could analyze years of sales data stored in long-term memory to detect seasonal trends and recommend optimal inventory levels.</p><h4><strong>Memory Formats</strong></h4><p>The way memories are stored also plays a vital role in agent performance.</p><ul><li><p><strong>Natural Language Memory</strong>: Memories can be recorded as plain text, capturing the nuances of user interactions or observations. This format is highly interpretable and enables rich, context-aware decision-making. An agent managing customer support might store previous interactions as natural language logs, allowing it to ensure continuity in follow-ups.</p></li><li><p><strong>Embedding-Based Memory</strong>: By encoding memories as vector embeddings, agents can retrieve relevant information efficiently based on similarity searches. To illustrate, when recommending a product, an agent could match current user preferences with past customer behavior stored in embeddings.</p></li><li><p><strong>Structured Data</strong>: Agents can store information in structured formats like databases or hierarchical lists. This allows for systematic organization and precise querying, such as retrieving specific financial records during audits or customer details for targeted marketing.</p></li></ul><h4><strong>Memory Operations</strong></h4><p>Memory operations govern how agents interact with stored information.</p><ul><li><p><strong>Writing Memory</strong>: Storing new information is akin to taking detailed meeting notes. An agent must decide which details to preserve - filtering for relevance and avoiding duplication. For instance, an agent tracking project updates might consolidate repetitive task notes into a concise summary.</p></li><li><p><strong>Reading Memory</strong>: Retrieving stored knowledge requires prioritization based on relevance, recency, and importance. An agent planning a delivery route could pull recent traffic data (recency), prioritize major road closures (relevance), and consider weather conditions (importance) to optimize decision-making.</p></li><li><p><strong>Reflection</strong>: Reflection allows agents to summarize past experiences and derive insights. Similar to how a team evaluates project outcomes to refine future strategies, an agent could analyze failed and successful customer service interactions to improve its performance.</p></li></ul><h4><strong>Hybrid Memory Systems</strong></h4><p>By integrating short-term and long-term memory, agents gain the ability to reason effectively in both immediate and strategic contexts. Imagine an inventory management agent that tracks real-time stock levels (short-term) while leveraging historical sales trends (long-term) to anticipate seasonal demand spikes. This dynamic capability enhances both day-to-day operations and long-term planning.</p><p>The memory component is like a knowledge management system in a business; short-term memory resembles an active project board for immediate tasks, while long-term memory acts as an archive of insights and strategies. </p><p>When designed effectively, it ensures continuity, allowing agents to draw from past experiences to inform present decisions. By integrating and balancing both short-term and long-term memory, agents can adapt to immediate demands while leveraging historical knowledge to make strategic, consistent, and impactful choices across dynamic and complex scenarios.</p><div><hr></div><h3><strong>Planning: Charting the Path Forward</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!LfMQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f74ce3d-a0be-4af5-985a-34160885c6a3_3812x3810.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!LfMQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f74ce3d-a0be-4af5-985a-34160885c6a3_3812x3810.png 424w, https://substackcdn.com/image/fetch/$s_!LfMQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f74ce3d-a0be-4af5-985a-34160885c6a3_3812x3810.png 848w, https://substackcdn.com/image/fetch/$s_!LfMQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f74ce3d-a0be-4af5-985a-34160885c6a3_3812x3810.png 1272w, https://substackcdn.com/image/fetch/$s_!LfMQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f74ce3d-a0be-4af5-985a-34160885c6a3_3812x3810.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!LfMQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f74ce3d-a0be-4af5-985a-34160885c6a3_3812x3810.png" width="1456" height="1455" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3f74ce3d-a0be-4af5-985a-34160885c6a3_3812x3810.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1455,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1419484,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!LfMQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f74ce3d-a0be-4af5-985a-34160885c6a3_3812x3810.png 424w, https://substackcdn.com/image/fetch/$s_!LfMQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f74ce3d-a0be-4af5-985a-34160885c6a3_3812x3810.png 848w, https://substackcdn.com/image/fetch/$s_!LfMQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f74ce3d-a0be-4af5-985a-34160885c6a3_3812x3810.png 1272w, https://substackcdn.com/image/fetch/$s_!LfMQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f74ce3d-a0be-4af5-985a-34160885c6a3_3812x3810.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The <strong>planning component</strong> provides agents with the ability to deconstruct complex goals into actionable steps, enabling coherent and methodical execution. Much like humans, agents rely on structured planning to approach challenges in a manageable way.</p><div class="pullquote"><p><strong>Good planning isn&#8217;t about predicting the future - it&#8217;s about adapting to it.</strong></p></div><h4><strong>Planning Without Feedback</strong></h4><p>In some scenarios, agents generate and execute plans without receiving intermediate feedback to adjust their strategies. These plans are created using predefined frameworks or processes, suitable for straightforward tasks.</p><ul><li><p><strong>Single-Path Reasoning:</strong> This involves breaking down a task into sequential steps, with each step leading directly to the next. It mirrors a to-do list approach, where completing one step logically progresses to the next. For instance, an agent organizing an event might follow steps such as booking a venue, arranging catering, and finalizing invitations in a fixed sequence. While effective for predictable scenarios, this approach lacks adaptability when conditions change.</p></li><li><p><strong>Multi-Path Reasoning:</strong> This strategy involves exploring multiple possible solutions concurrently, akin to brainstorming different routes to solve a problem. By evaluating various pathways, the agent can select the most promising one. For example, a product design agent might explore different prototypes simultaneously and refine the most viable option based on user testing results.</p></li><li><p><strong>External Planning Tools:</strong> In complex or domain-specific scenarios, agents can integrate external tools to enhance planning. A healthcare scheduling agent could use specialized software to optimize appointments based on patient preferences, doctor availability, and equipment constraints. By leveraging such tools, agents can generate precise and efficient plans.</p></li></ul><p>Planning without feedback is like a project manager creating a detailed plan for a product launch and sticking to it rigidly, regardless of market changes or team setbacks. While this approach ensures speed and consistency for predictable tasks, it falls short in dynamic scenarios where flexibility is crucial. </p><h4><strong>Planning With Feedback</strong></h4><p>For dynamic and unpredictable environments, agents employ iterative planning processes that incorporate feedback to refine their strategies.</p><ul><li><p><strong>Environmental Feedback:</strong> This involves adjusting plans based on changes in the environment or observed outcomes. A logistics agent may revise a delivery route in real time to account for traffic congestion, ensuring timely deliveries despite evolving road conditions.</p></li><li><p><strong>Human Feedback:</strong> By soliciting guidance from users, agents can align their strategies with human preferences and expectations. A design assistant might request feedback on a prototype before finalizing its design, allowing iterative refinements based on client input.</p></li><li><p><strong>Model Feedback:</strong> Agents can self-assess their reasoning and actions using internal evaluative models. A financial analysis agent might critique its investment recommendations against historical data patterns, iterating on its conclusions to enhance accuracy.</p></li></ul><p>Planning with feedback, on the other hand, is akin to a sales team adjusting its pitch in real time based on customer reactions during a meeting. While more resource-intensive and complex, it enables agents to respond effectively to unforeseen challenges, making it indispensable for tasks involving long-term reasoning or high variability. </p><p>Each method has its place: the former is ideal for stable, routine operations, while the latter is better suited to complex, evolving environments where adaptability is key.</p><div><hr></div><h3><strong>Action: From Decision to Execution</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DSL2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7bda35e-7857-4152-992c-b860cd9d23fb_1872x1545.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DSL2!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7bda35e-7857-4152-992c-b860cd9d23fb_1872x1545.png 424w, https://substackcdn.com/image/fetch/$s_!DSL2!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7bda35e-7857-4152-992c-b860cd9d23fb_1872x1545.png 848w, https://substackcdn.com/image/fetch/$s_!DSL2!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7bda35e-7857-4152-992c-b860cd9d23fb_1872x1545.png 1272w, https://substackcdn.com/image/fetch/$s_!DSL2!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7bda35e-7857-4152-992c-b860cd9d23fb_1872x1545.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DSL2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7bda35e-7857-4152-992c-b860cd9d23fb_1872x1545.png" width="1456" height="1202" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b7bda35e-7857-4152-992c-b860cd9d23fb_1872x1545.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1202,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:463981,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!DSL2!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7bda35e-7857-4152-992c-b860cd9d23fb_1872x1545.png 424w, https://substackcdn.com/image/fetch/$s_!DSL2!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7bda35e-7857-4152-992c-b860cd9d23fb_1872x1545.png 848w, https://substackcdn.com/image/fetch/$s_!DSL2!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7bda35e-7857-4152-992c-b860cd9d23fb_1872x1545.png 1272w, https://substackcdn.com/image/fetch/$s_!DSL2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7bda35e-7857-4152-992c-b860cd9d23fb_1872x1545.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The <strong>action component</strong> serves as the bridge between planning and real-world impact, transforming decisions into tangible outcomes. It is the final, yet pivotal, step where all preceding components converge to interact with the environment effectively.</p><div class="pullquote"><p><strong>The ultimate measure of intelligence is action.</strong></p></div><h4><strong>Action Goals</strong></h4><p>Actions taken by an agent are goal-oriented and shaped by the overarching objectives of its tasks. These goals often fall into three primary categories:</p><ul><li><p><strong>Task Completion:</strong> In many scenarios, the agent&#8217;s purpose is to complete specific, well-defined tasks. For example, an AI agent in e-commerce may automate order processing and inventory updates, ensuring operational efficiency. Similarly, in the context of software development, agents may execute unit tests or compile code based on predefined requirements.</p></li><li><p><strong>Communication:</strong> Effective communication is a crucial action goal, especially for agents working in collaborative or customer-facing roles. For instance, a customer support agent may communicate empathetically with users to resolve issues, while collaborative agents within a development team exchange structured information to coordinate efforts.</p></li><li><p><strong>Environment Exploration:</strong> Certain agents are designed to explore unfamiliar territories or environments to gather data or expand their capabilities. For example, a supply chain agent might analyze market trends by scanning competitor websites and extracting useful insights, refining its decision-making process iteratively.</p></li></ul><h4><strong>Action Production</strong></h4><p>Translating decisions into executable actions can occur through various strategies:</p><ul><li><p><strong>Memory-Based Actions:</strong> Actions often leverage stored knowledge. For instance, an agent retrieving prior successful strategies for handling specific customer complaints can adaptively address recurring issues. Memory-driven actions ensure continuity and relevance in execution.</p></li><li><p><strong>Plan-Based Actions:</strong> Some actions strictly adhere to pre-generated plans. A marketing automation agent, for example, may execute a pre-approved campaign schedule, systematically deploying ads, and analyzing engagement metrics without deviation unless prompted by dynamic inputs.</p></li></ul><h4><strong>Action Space</strong></h4><p>The breadth of possible actions an agent can perform is dictated by its capabilities and integration with external tools:</p><ul><li><p><strong>External Tools:</strong> Agents often utilize APIs, databases, or specialized models to extend their action space. For instance, a financial agent might access external economic data APIs to inform portfolio adjustments.</p></li><li><p><strong>Internal Knowledge:</strong> LLMs' inherent reasoning, conversational, and common-sense capabilities enable agents to make informed decisions. For example, an AI consultant might provide nuanced business recommendations purely based on its internal reasoning processes.</p></li></ul><h4><strong>Action Impact</strong></h4><p>Actions inevitably have consequences that ripple across various dimensions:</p><ul><li><p><strong>Environmental Changes:</strong> Agents can alter their surroundings, such as updating inventory levels in a warehouse management system or generating new content for social media campaigns.</p></li><li><p><strong>Internal Adaptations:</strong> Actions often result in updated internal states. An agent completing a sub-task may refine its long-term memory with insights, improving its future performance.</p></li><li><p><strong>Triggered Follow-Up Actions:</strong> In complex workflows, one action may cascade into others. For instance, sending an invoice could trigger payment tracking and follow-up reminders if deadlines aren&#8217;t met.</p></li></ul><p>The action component is akin to a customer service representative implementing solutions. Without a robust system, actions might be reactive or misaligned with company goals. With an optimized action system, like an agent using CRM data, every interaction aligns with customer needs and business objectives, whether solving a support ticket, upselling a product, or adjusting based on feedback. </p><p>By effectively leveraging internal knowledge, external tools, and real-time inputs, the action component ensures agents not only execute tasks efficiently but continuously improve their performance, driving meaningful business outcomes.</p><div><hr></div><h3><strong>Capability Acquisition: Building Agent Skills</strong></h3><p>While the architecture of an agent serves as its &#8220;hardware,&#8221; the true effectiveness of an autonomous AI agent lies in its ability to acquire the &#8220;software&#8221; it needs: task-specific skills, knowledge, and experiences. Capability acquisition is a critical process that enables agents to grow and adapt, transforming them from general-purpose tools into highly specialized entities capable of handling complex and diverse tasks.</p><div class="pullquote"><p><strong>Capability acquisition is the key to transforming autonomous agents from static tools into adaptive, versatile systems that thrive in complex and dynamic environments.</strong></p></div><h4><strong>Acquiring Capabilities Through Fine-Tuning</strong></h4><p>One of the most effective ways to enhance agent performance is by fine-tuning large language models (LLMs) using task-specific datasets. These datasets can be created through several methods, including human annotation, LLM-generated content, or real-world data collection. For example, fine-tuning with datasets such as those collected from e-commerce platforms or web interactions, provide valuable context to optimize agents for domain-specific challenges.</p><p>Fine-tuning allows agents to integrate substantial task-specific knowledge into their model parameters, making them highly effective at addressing particular use cases. However, this method is most suitable for open-source LLMs, as it requires direct access to the model&#8217;s architecture.</p><h4><strong>Enhancing Capabilities Without Fine-Tuning</strong></h4><p>When fine-tuning is not feasible, agents can acquire new capabilities through techniques such as prompt engineering and mechanism engineering. </p><ul><li><p><strong>Prompt engineering</strong> leverages the natural language understanding of LLMs to describe desired behaviors or provide few-shot examples. For instance, including intermediate reasoning steps in a prompt can significantly improve an agent&#8217;s problem-solving capabilities. Similarly, prompts that incorporate social or reflective contexts can enhance conversational adaptability and self-awareness.</p></li><li><p><strong>Mechanism engineering</strong>, on the other hand, involves creating new operational strategies for agents. This can include iterative trial-and-error processes, where agents learn from feedback to refine their actions, or the development of self-driven evolution systems, where agents set goals and explore environments autonomously. These strategies enable agents to continuously improve their performance without altering their underlying model parameters.</p></li></ul><h4><strong>Building a Learning Framework</strong></h4><p>Agents can also acquire new capabilities through frameworks that emphasize experience accumulation and collaborative problem-solving. By storing and refining successful actions in a memory system or a skill library, agents can draw on past experiences to solve similar tasks more efficiently in the future. Collaborative approaches, where agents exchange knowledge and adjust their roles dynamically, further enhance their ability to tackle complex problems that require collective intelligence.</p><div><hr></div><h3><strong>Evaluation: Measuring Agent Performance</strong></h3><p>Evaluating the performance and effectiveness of LLM-based autonomous agents is a challenging yet crucial task. While these agents hold the potential to revolutionize workflows across industries, assessing their capabilities requires a nuanced approach that balances qualitative insights with quantitative rigor. </p><div class="pullquote"><p><strong>Evaluation is not just about measuring performance - it&#8217;s about understanding how well agents align with human needs and expectations while excelling in task execution.</strong> </p></div><h4><strong>Subjective Evaluation</strong></h4><p>Subjective evaluation focuses on human judgment to assess the agent&#8217;s capabilities in tasks where no standardized datasets or metrics exist. This approach is particularly useful for evaluating aspects such as user-friendliness, creativity, or the agent&#8217;s ability to mimic human-like behavior.</p><ul><li><p><strong>Human Annotation: </strong>In this method, human evaluators score or rank the agent&#8217;s outputs. For instance, annotators might rate agents on qualities like engagement, helpfulness, and honesty, as seen in studies where agents&#8217; outputs are compared against human benchmarks. This approach captures nuanced feedback that reflects the real-world impact of an agent&#8217;s behavior.</p></li><li><p><strong>Turing Test: </strong>The Turing Test involves asking human evaluators to differentiate between outputs produced by agents and those created by humans. If the evaluators are unable to distinguish between the two, the agent demonstrates human-like performance. This strategy has been widely used to assess agents&#8217; ability to generate human-like responses, emotional intelligence, and decision-making skills.</p></li></ul><h4><strong>Objective Evaluation</strong></h4><p>Objective evaluation employs quantifiable metrics to measure agent performance, offering a more systematic and scalable approach. This method focuses on three key aspects: metrics, protocols, and benchmarks.</p><p><strong>Metrics</strong><br>Evaluation metrics are designed to capture specific dimensions of agent performance.</p><ul><li><p><strong>Task Success</strong>: Metrics such as success rate, accuracy, and goal completion are used to assess how effectively the agent achieves its objectives.</p></li><li><p><strong>Human Similarity</strong>: Metrics like dialogue similarity, trajectory accuracy, and mimicry of human responses gauge how closely the agent&#8217;s behavior aligns with human norms.</p></li><li><p><strong>Efficiency</strong>: This includes measures like planning speed, cost of execution, and inference time, which evaluate the agent&#8217;s operational efficiency.</p></li></ul><p><strong>Protocols</strong><br>Protocols define how metrics are applied in different contexts:</p><ul><li><p><strong>Real-World Simulation</strong>: Agents are tested in immersive environments such as games or simulators, where task success and human similarity metrics can be observed in action.</p></li><li><p><strong>Social Evaluation</strong>: Agents are assessed based on their interactions in collaborative or competitive settings, analyzing qualities like teamwork, empathy, and communication.</p></li><li><p><strong>Multi-Task Evaluation</strong>: A diverse set of tasks from different domains is used to measure the agent&#8217;s generalization capability.</p></li><li><p><strong>Software Testing</strong>: Metrics like bug detection rate and test coverage are used to evaluate agents in coding and debugging scenarios.</p></li></ul><p><strong>Benchmarks</strong><br>Benchmarks provide standardized environments and datasets for consistent evaluation. Examples include AgentBench<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> for general-purpose assessments, WebShop<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> for e-commerce capabilities, and EmotionBench<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> for evaluating emotional intelligence. These benchmarks ensure that agents are tested against a wide range of real-world challenges, providing valuable insights into their adaptability and robustness.</p><div><hr></div><h3><strong>Challenges: Key Barriers Ahead</strong></h3><p>While LLM-based autonomous agents have achieved significant milestones, the field is still in its early stages. There are numerous challenges that researchers and developers must overcome to unlock the full potential of these systems. Below, we outline some of the most critical challenges shaping the trajectory of development in this domain.</p><h4><strong>Role-Playing Capability</strong></h4><p>Autonomous agents are often required to assume specific roles, such as a researcher, programmer, or teacher, to complete tasks effectively. While LLMs can simulate some roles convincingly, they struggle with less common or emerging roles, as well as aspects of human cognition like self-awareness in conversations. This limitation stems from the datasets on which these models are trained, which may lack diverse role-specific data. Fine-tuning with curated datasets or designing optimized prompts and architectures may improve role-playing, but balancing these enhancements with general-purpose performance remains a significant challenge.</p><h4><strong>Generalized Human Alignment</strong></h4><p>For agents to serve humans effectively, they must align with human values. However, in applications like real-world simulations, agents may need to replicate both positive and negative human behaviors to provide accurate models of societal dynamics. This dual alignment is particularly challenging as most LLMs are optimized for positive, unified human values. The development of prompting strategies or controlled re-alignment techniques is needed to tailor agents to diverse scenarios without compromising ethical standards.</p><h4><strong>Prompt Robustness</strong></h4><p>The integration of complex modules like memory and planning requires structured, reliable prompts to ensure consistent agent behavior. However, even slight modifications to prompts can cause significant deviations in agent outputs. This challenge is exacerbated by the interconnected nature of agent modules, where changes in one prompt may affect others. Developing unified and resilient prompt frameworks that are robust across diverse tasks and LLMs remains an open problem.</p><h4><strong>Hallucination</strong></h4><p>Hallucination, where agents confidently produce incorrect information, poses a major challenge in high-stakes applications. For example, in coding tasks, hallucinations can generate erroneous outputs with potentially severe consequences. Addressing hallucinations requires iterative feedback mechanisms, improved training datasets, and fail-safe systems to validate outputs before execution.</p><h4><strong>Knowledge Boundary</strong></h4><p>LLMs possess extensive knowledge from training on vast datasets, which can sometimes hinder their ability to simulate realistic human behaviors. For instance, when tasked with replicating user behaviors with limited prior knowledge, agents may leverage their extensive training corpus inappropriately. Constraining agents to appropriate knowledge levels requires innovative methods for controlling context and access to information.</p><h4><strong>Efficiency</strong></h4><p>Agents often need to query LLMs multiple times for tasks such as memory retrieval, planning, and decision-making. The inherent slowness of LLMs due to their autoregressive architecture creates significant efficiency bottlenecks. Addressing this requires optimizing agent workflows, leveraging caching or batching strategies, and developing faster inference techniques.</p><div><hr></div><p><strong>In upcoming issues, we&#8217;ll dive even deeper into how to design autonomous agents. Subscribe to stay ahead of the curve and learn how to shape the future of work.</strong></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://melvintercan.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://melvintercan.com/subscribe?"><span>Subscribe now</span></a></p><p></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Wang, L., Ma, C., Feng, X. <em>et al.</em> A survey on large language model based autonomous agents. <em>Front. Comput. Sci.</em> <strong>18</strong>, 186345 (2024).</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Liu X, Yu H, Zhang H, Xu Y, Lei X, Lai H, Gu Y, Ding H, Men K, Yang K, Zhang S, Deng X, Zeng A, Du Z, Zhang C, Shen S, Zhang T, Su Y, Sun H, Huang M, Dong Y, Tang J. AgentBench: evaluating LLMs as agents. 2023, arXiv preprint arXiv: 2308.03688</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Yao S, Chen H, Yang J, Narasimhan K. WebShop: towards scalable real-world Web interaction with grounded language agents. In: Proceedings of the 36th Conference on Neural Information Processing Systems. 2022, 20744&#8722;20757</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Huang J T, Lam M H, Li E J, Ren S, Wang W, Jiao W, Tu Z, Lyu M R. Emotionally numb or empathetic? Evaluating how LLMs feel using emotionbench. 2024, arXiv preprint arXiv: 2308.03656</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[AI Is Eating the Jobs]]></title><description><![CDATA[Hundreds of vertically integrated AI startups are busy replacing entire teams. At the same time, starting and scaling a billion dollar business has never been easier.]]></description><link>https://melvintercan.com/p/ai-is-eating-the-jobs</link><guid isPermaLink="false">https://melvintercan.com/p/ai-is-eating-the-jobs</guid><dc:creator><![CDATA[Melvin Tercan]]></dc:creator><pubDate>Mon, 09 Dec 2024 15:30:59 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!DI2k!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6601fe8-2c07-4e12-8cc7-87d47ae5811f_1024x1024.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DI2k!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6601fe8-2c07-4e12-8cc7-87d47ae5811f_1024x1024.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DI2k!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6601fe8-2c07-4e12-8cc7-87d47ae5811f_1024x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!DI2k!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6601fe8-2c07-4e12-8cc7-87d47ae5811f_1024x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!DI2k!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6601fe8-2c07-4e12-8cc7-87d47ae5811f_1024x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!DI2k!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6601fe8-2c07-4e12-8cc7-87d47ae5811f_1024x1024.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DI2k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6601fe8-2c07-4e12-8cc7-87d47ae5811f_1024x1024.webp" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a6601fe8-2c07-4e12-8cc7-87d47ae5811f_1024x1024.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:520142,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!DI2k!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6601fe8-2c07-4e12-8cc7-87d47ae5811f_1024x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!DI2k!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6601fe8-2c07-4e12-8cc7-87d47ae5811f_1024x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!DI2k!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6601fe8-2c07-4e12-8cc7-87d47ae5811f_1024x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!DI2k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6601fe8-2c07-4e12-8cc7-87d47ae5811f_1024x1024.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>More than a decade ago, Marc Andreessen famously wrote that <a href="https://a16z.com/why-software-is-eating-the-world/">software is eating the world</a>. Back then, the idea that digital platforms would disrupt entire industries seemed bold. Yet time proved him right: software solutions did spread through every sector, transforming the way we consume media, shop online, and conduct business. <strong>Now, AI is not just eating the world; it&#8217;s eating the jobs along with it.</strong></p><p>As we speak, countless AI startups are building vertical solutions that go beyond assisting human workers. They&#8217;re aiming to replace them entirely. <strong>No job is safe in this wave of automation, from the simplest data entry position to the most senior data analyst.</strong> We&#8217;re talking about entire teams being replaced by systems that learn continuously and never clock out.</p><p>Big Tech CEOs often downplay these concerns. They&#8217;ll tell you that people will find meaning in creative and artistic work, that new types of jobs will emerge requiring everyone to be retrained, or that society will adopt a wealth dividend so nobody has to work anymore. But none of these ideas hold up under scrutiny. AI is already replacing creative and artistic work, producing everything from paintings and music to screenplays and advertising campaigns. There&#8217;s no clear evidence that the pace of new job creation will match the speed at which AI displaces old ones. And the wealth dividend? It&#8217;s more a hypothetical discussion than a concrete plan, with no clear path for implementation. <strong>The reality is this: the only way to thrive in this new era is to embrace it and start a business yourself, leveraging AI as the powerful tool it is becoming.</strong></p><p>Just as the SaaS boom of the 2000s created a new generation of tech giants by lowering barriers for software distribution, AI will be able to handle the same work (or better) without the need for large teams of people. <strong>For every iconic SaaS success story, there could soon be an $300B+ AI-driven agent company.</strong> The difference? These businesses don&#8217;t just save on software costs; they thrive with small, efficient teams while achieving massive scale.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3Iu0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8663196-02cc-415c-aeb4-30b508c2b85e_1024x1024.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3Iu0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8663196-02cc-415c-aeb4-30b508c2b85e_1024x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!3Iu0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8663196-02cc-415c-aeb4-30b508c2b85e_1024x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!3Iu0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8663196-02cc-415c-aeb4-30b508c2b85e_1024x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!3Iu0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8663196-02cc-415c-aeb4-30b508c2b85e_1024x1024.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3Iu0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8663196-02cc-415c-aeb4-30b508c2b85e_1024x1024.webp" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e8663196-02cc-415c-aeb4-30b508c2b85e_1024x1024.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:523450,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3Iu0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8663196-02cc-415c-aeb4-30b508c2b85e_1024x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!3Iu0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8663196-02cc-415c-aeb4-30b508c2b85e_1024x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!3Iu0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8663196-02cc-415c-aeb4-30b508c2b85e_1024x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!3Iu0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8663196-02cc-415c-aeb4-30b508c2b85e_1024x1024.webp 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Much like the early days of SaaS&#8212;when the idea of subscribing to software over the internet felt fresh&#8212;AI-based services are still in their infancy. But every three months, these models become smarter, faster, and more reliable, unlocking new verticals ripe for takeover.</p><p><strong>We&#8217;re at a point where, if you look around for any boring, repetitive administrative jobs, you&#8217;ll likely find a billion-dollar AI opportunity.</strong> These tedious chores are exactly what AI excels at, and a determined founder can leverage these advanced tools to build a lean, high-margin business that runs on autopilot. We&#8217;ve already seen early wins. There are AI services <a href="https://momentic.ai/">handling full-scale testing automation</a>, <a href="https://trysalient.com/">voice-based outreach for auto loan collections</a>, and <a href="https://www.kapa.ai/">specialized chatbots that replace entire developer relations teams</a>. Instead of humans spending their days on repetitive work, AI takes over, freeing individuals to do something else&#8212;or leaving them out of the loop entirely.</p><p>Of course, the go-to-market strategy for these AI startups looks very different from what we saw during the software boom. <strong>If you try to sell these AI solutions to the very team they&#8217;re going to replace, you&#8217;ll face fierce resistance.</strong> Those employees, knowing the automation threatens their roles, have no incentive to adopt the technology that makes them redundant. <strong>Instead, successful AI startups focus on high-level decision makers&#8212;executives who see the big picture and care about bottom-line efficiency.</strong> From their vantage point, fewer employees and more output is a no-brainer. And as AI agents deliver quantifiable results&#8212;no salaries, no HR overhead, no burnout&#8212;these decision makers will choose the technology that streamlines their operations.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!u0N7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d5d8d7-9615-4c2b-b91c-f8b38b0b5ade_1024x1024.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!u0N7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d5d8d7-9615-4c2b-b91c-f8b38b0b5ade_1024x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!u0N7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d5d8d7-9615-4c2b-b91c-f8b38b0b5ade_1024x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!u0N7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d5d8d7-9615-4c2b-b91c-f8b38b0b5ade_1024x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!u0N7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d5d8d7-9615-4c2b-b91c-f8b38b0b5ade_1024x1024.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!u0N7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d5d8d7-9615-4c2b-b91c-f8b38b0b5ade_1024x1024.webp" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/02d5d8d7-9615-4c2b-b91c-f8b38b0b5ade_1024x1024.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:452634,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!u0N7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d5d8d7-9615-4c2b-b91c-f8b38b0b5ade_1024x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!u0N7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d5d8d7-9615-4c2b-b91c-f8b38b0b5ade_1024x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!u0N7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d5d8d7-9615-4c2b-b91c-f8b38b0b5ade_1024x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!u0N7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d5d8d7-9615-4c2b-b91c-f8b38b0b5ade_1024x1024.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This new dynamic means that old constraints on company size and management complexity will break down. With vertical AI agents at the helm, companies no longer need armies of staff to achieve massive scale. <strong>A small founding team, equipped with the right AI systems, can run an enterprise that serves millions of customers.</strong> A business that once demanded offices, HR teams, and multiple management layers can now exist almost entirely in the cloud, orchestrated by a handful of AI models that coordinate production, distribution, marketing, and sales. <strong>It&#8217;s the birth of the billion-dollar, one-person company&#8212;a vision that sounds wild today but could become routine tomorrow.</strong></p><p>In the long run, AI will likely remain vertical and specialized rather than consolidating into a few dominant platforms. Just as SaaS products proliferated into highly specific niches, AI agents will become ever more tailored, filling tightly defined roles. From legal document review to dental insurance claim processing, each narrow market offers a chance for AI to replace human effort with near-perfect, always-on service. <strong>Vertical agentic systems will pave the way for entirely new categories of companies.</strong></p><div><hr></div><p><em>Personal Note:</em></p><p>&#128075; I&#8217;m <a href="https://www.linkedin.com/in/melvintercan/">Melvin Tercan</a>. I&#8217;m the Head of Engineering at <a href="https://www.octagonai.co/">Octagon AI</a>, an vertical agentic AI startup with a mission to build agentic systems that empower financial analysts to do more impactful work. <strong>Over the past three years,</strong> <strong>I&#8217;ve gained deep insights into the design of agentic systems and how they&#8217;re reshaping industries.</strong></p><p>If you&#8217;re interested in learning how to design and build agentic systems, subscribe for free to this newsletter. Let&#8217;s build the future together!</p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://melvintercan.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading <em>Designing Agentic Systems</em>! Subscribe for free to explore the future of Agentic AI systems &amp; learn how to automate complex workflows.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div><hr></div><p></p>]]></content:encoded></item></channel></rss>