How Intercom 2x’d their engineering velocity in 9 months with Claude Code

Overview

This episode explores how Intercom dramatically increased engineering throughput by going “all in” on AI-assisted software development, especially with Cloud Code and agent-first workflows. Senior Principal Engineer Brian Scanlan explains how Intercom treated internal AI adoption like a product: setting clear goals, instrumenting usage, building internal skills and hooks, and redesigning engineering workflows around agents rather than just faster autocomplete.

Key Takeaways

The biggest insight is that Intercom did not achieve gains by casually adding AI tools to existing habits. They committed organizationally: leadership set an explicit goal to double R&D throughput, teams were enabled to experiment, and the company invested in telemetry, quality controls, and reusable internal tooling. The result was a 2x increase in pull requests per R&D head over roughly nine months.

A counterintuitive point is that AI did not appear to reduce code quality. If anything, Intercom’s internal and external signals suggested quality improved. Scanlan noted no meaningful increase in incidents, while research support from Stanford indicated code quality was trending upward. His explanation is practical: when AI compresses the cost of maintenance work, teams can finally address flaky tests, tech debt, CI bottlenecks, and poor developer experience that usually stay deprioritized.

Another notable theme is that the main barrier has shifted from typing speed to imagination. Once models became good enough, the limiting factor was no longer whether the tool could produce code, but whether teams were rethinking how work should be done in an “agent-first” world. Intercom’s approach was to move beyond autocomplete and ask how planning, debugging, PR creation, test fixing, and even product setup could be restructured around agents.

The conversation also highlights that AI adoption at scale requires infrastructure, not just enthusiasm. Intercom built telemetry into skills, stored and analyzed session data, and used that information to improve workflows. They even created hooks to force better PR descriptions because left alone, AI-generated descriptions were getting worse. This reflects a broader lesson: high performance comes from designing guardrails and feedback loops, not trusting raw model output.

Practical Steps

Set a concrete adoption target. Intercom used PR throughput as a simple leading indicator. Pick one imperfect but useful metric and track it consistently.
Treat your engineering org like a product. Instrument AI workflows with telemetry so you can see which skills are used, where people get stuck, and what actually improves outcomes.
Start with painkillers, not demos. Identify everything your team hates about the codebase—flaky tests, slow CI, poor docs, legacy services—and spend a month using AI to attack that backlog.
Build reusable skills and hooks for common workflows. Good candidates include PR creation, CI interaction, test debugging, log lookup, and internal admin tooling.
Enforce quality upstream. If AI outputs weak PR descriptions, bad branching behavior, or inconsistent patterns, add automated hooks so agents follow your standards by default.
Give people protected time to explore. The episode suggests that time away from regular sprint work often accelerates AI literacy because people return with better instincts and workflows.
Prepare for rising AI costs by treating them as investment spend early on, but delay optimization until you understand where the real productivity gains are.
Audit your product for agent usability. If an agent cannot sign up, verify email, or configure your tool easily, you may lose future users before they ever reach your interface.

Notable Quotes

“Your imagination is now the barrier, not the tool.” — Brian Scanlan

“Backlog zero is a realistic thing for teams to be able to go after.” — Brian Scanlan

“Everything you hate about the code base, go spend a month fixing and see how fast we can speedrun that.” — Claire Vooght

Full Transcript

Source: openai 1h 18m runtime

Suddenly, you started realizing that you have to think bigger about things, or that your imagination is now the barrier, not the tool. How is this not happening in your organization? Like, literally, the physical limits of my ability to type code are unlocked by AI. Today, we are seeing twice the number of throughputs as we did compared to nine months ago on our engineering team. Now it's like, why can't it be 10x? This is a little bit more of what my instinct tells me is possible, which is if you go all in, if you prepare your team, if you prepare your code base, I think your overall product quality is going to go up. I think your overall developer experience is going up. There's just so many good things that come out of using these tools and using them correctly. Backlog zero is a realistic thing for teams to be able to go after. All the things that you wish you ever wanted to do, it's now just achievable. I often advise a lot of CTOs and VPs of engineering, when figuring out how to get their engineering team AI-pilled, say, everything you hate about the code base, go spend a month fixing and see how fast we can speedrun that. That's going to feel really good. I've been having the most amount of fun in my career over the last three months. Welcome back to How I AI. I'm Claire Vogue, product leader and AI obsessive here on a mission to help you build better with these new tools. Today, I am showing how Intercom 2x'd the number of PRs that their R&D department is shipping in just a few months. Brian Scanlon is a senior principal engineer at Intercom, and he is going to show us truly all of their secrets to getting a large product and engineering organization cooking on cloud code. Let's get to it. This episode is brought to you by Cilego. Every company today wants AI to improve how work gets done. The fastest way is building it directly into everyday business processes, automating employee onboarding, keeping customer data accurate, managing orders and inventory, or resolving finance and operations issues. When AI lives inside the flow of work, it can update records, trigger approvals, route work, and kick off the next step across systems. That's how teams operationalize AI and deliver measurable results. Cilego makes this possible. And now, with Cilego Aura, it's never been easier. Cilego Aura gives you access to the entire platform through natural language, connecting your systems and turning intent into action. All of it under your control. Companies like Databricks, PayPal, and Olipop rely on Cilego to run critical business operations at scale. Ready to operationalize AI? Visit cilego.com slash howiai. That's C-E-L-I-G-O dot com slash howiai. Brian, welcome to HowIAI. Why I am so thrilled that you agreed to join the podcast is I think Intercom has done it. Which is, you all have met the moment in sort of two ways. One, clearly met the moment from a product perspective. We're one of the first companies that had, sorry, I don't want to say legacy business, but had a going concern business that saw AI coming and really transformed how your product worked for customers. And I'm a happy Finn customer. They did not tell me to say that. And then second, what we're going to talk about is the team at the moment in terms of really understanding AI was going to change how, in particular, product engineering and design orgs and engineering organizations were going to work. And you just went full speed at changing how the team works. What drove sort of the urgency around meeting the moment? How did that come to be? Was it a single person? Was it everybody? What was your experience? I think in some ways it's been the easiest place to be driving out the adoption of AI in engineering and product. And because we've focused the company so much on folks and product on adopting AI and being AI first and how we think about the product, future customer support and all that. And we also had very clear expectations. Like we've seen what's possible in the product space. And this is very clear and obvious to us as like connoisseurs of AI. It's like, this is clearly going to be huge in engineering and product and building. And honestly, there's been a lot of impatience for like, why isn't this happening today? You know, if we go back a few years and the cursor is picking up a bit of business and the models are getting better. But it still wasn't transformative. It still wasn't like the whole business was changed and we're seeing vast amounts of extra productivity. We knew there was potential, but it still felt like we needed to have some sort of breakthrough moments or something big had to happen for us to get to the kind of huge velocity wins that I think now we're starting to achieve. That said, we still want more, you know, we're proud of where we're at, but we're not content with what we've achieved so far. I feel like every three months I have a breakthrough moment. And in fact, I feel like Opus 4.6, I don't know, something just like really inflected in what was possible when that particular model came out. Now, I think the GPT 5.4 models are also exceptional. And so it's something about that one moment with models that really inflected my own personal use of AI and engineering. Did you all see the same sort of inflection around that model point? Totally. I think you can go back to like November, December last year. And suddenly you started realizing that you have to think bigger about things or that your imagination is now the barrier, not the tool. You're spending less time massaging the tool to get it to the right place. And it's less about autocomplete and more about just literally giving us your ideas and seeing what happens. I think the Christmas break happened as well. I remember we pretty much decided for Christmas, like, hey, we're going to go all in on cloud code. Because up to that point, there was a bit of cursory here and there and augmenting different tools. And the Christmas break really helped. Like I saw everybody go wild on Twitter X, you know, that people were talking about how this was like they were getting so much done and they were building all these things. It's come back to work after Christmas break going like, OK, everything's changed. Like we knew that there was something here and that we're starting to see the signs of it. But now the whole world is convinced, or at least all of the influencers on Twitter and that would be me. Yeah, I'm actually kind of convinced that companies should increase their PTO and parental leave policies because everybody I know right now in tech that is, quote unquote, taking time off, goes on their vacation and pops open cloud code and comes back like 10 times more skilled than they were before their time off. And so if anybody wants a little minor hack to AI literacy in your org, give people time off to hack and they will come back with more information than you expected. OK, I think we're going to skip to the punchline, which I love, which is we're going to see how AI has actually changed how you all ship at Intercom. So can you just show us a little bit of how this has changed inside the org? And I think you all are measuring a lot of this. Yeah, so I think we've been diligent as product owners inside of Intercom that we've been trying to get feedback from people and see how they're using the tools and really like just doing everything that we would normally do with a regular product. And so we've spent a lot of time hooking up cloud code with telemetry, both into things like Honeycomb and data also going into Snowflake where we have our data warehouse and we also store session data in S3 and we mine this stuff for useful insights. But one of the main things that we use to drive adoption of the tool was our CTO, Dara, setting a goal of us 2x'ing, like doubling the throughput of R&D and we use pull requests as a crude, simple measure, but, you know, you can argue back and forth about what's a good measure, what's a bad measure and whether measuring anything is appropriate or whatever. But I think it's reasonable to just have the expectation that if you can get a lot more done and it's so fast and fun, then why isn't everyone just shipping more stuff? And so it's a basic measure that the tools are being adopted and that they're being used well. And, you know, of course, we don't tolerate lowering quality and we're a high trust environment, so we don't expect people not to gain these stats or whatever. But our metrics and what I'm showing on the screen here is, you know, it's a classic number goes up kind of thing that where we started tracking this back, like how many PRs and what percentage of them were generated by either Cloud or Cursor or whatever. And, yeah, since our major investment in Cloud Code, the platform and going all in on this and really pushing out like enablement and giving people freedom to explore and start to build skills and everything, but also pushing them on, we expect kind of throughput increase. We've seen a big, big increase in the throughput of pull requests to our system. And, you know, like last year, like our CI system completely broke. It melted. You know, but I don't mean it got like 10 times as expensive. And, you know, we did the work. We fixed the bottlenecks. We improved the performance of our CI system. That stopped being a bottleneck. And now Coderview is our bottleneck. But today we are seeing twice the number of throughput as we did compared to nine months ago on our engineering team. And like we're very proud of that. And, you know, now it's like, why can't it be 10x? So what I love about this chart, just for a moment, is I had spent the last two decades of my career in product and engineering, last decade of my career as a CPTO. And it's so funny. I want to go back to a couple of things you said, which is, one, you have to treat your org like a product. And I always thought that my job was not just the product strategy and the capital P product that we were delivering to customers. It was to design our organization to, I would say, like output innovation on demand, which is that was the job. And less romantically put, my job is to invest R&D for positive enterprise value. That was like fundamentally my job as a CPTO. And so what I love about this is it's merged PRs per R&D head. I'm presuming that includes, does that include product managers and non-engineering R&D? Or is that purely software engineers? Yeah, this is all of R&D. And it's definitely the case that our designers and product managers and TPMs, like every role in intercom is really actively using cloud code and shipping code and all that. And also we've been hiring, like this number has not been static. So the number of PRs, the raw number is dramatically higher than just 2x what it was a good while ago. So this is everything from your newest hire to your product manager who's like adding some copy or shipping like small changes or whatever. You know, that's all based in this number. The other thing I want to call out for folks is every board meeting I have been in for the last three years have said, how are we getting? Well, actually, every board meeting I've ever been in, period, has been how can we get more velocity out of R&D? Certainly in the last three years, it's been how is AI inflecting our velocity? And it's so funny. I talk to so many people that are like, it doesn't really inflect velocity. We're not actually becoming that more efficient. And I'm like, is that true? Because I look at a chart like this and I say, this is a little bit more of what my instinct tells me is possible, which is if you go all in, if you prepare your team, if you prepare your code base, if you have, as you said, I think a high trust culture, people are going to look at this and say, oh, they're shipping these smaller PRs or like engineers are getting in the system. I just I have not worked at a place that has such kind of like bad culture that that would actually come as an outcome of setting some sort of ambitious, fun target like this. And so I take this as at face value. And I think, how is this not happening in your organization? And you're like literally the physical limits of my ability to type code are unlocked by by AI. You should get some inflection there. And so, you know, for VPs of engineering, CTO, even people that are on these R&D teams, look at this and think, you know, this is possible. And it may be a crude measurement, but it's, I think, an appropriate one as a leading indicator of what's happening around AI. Yeah, and we support this with not just telling people to move faster. Like that's that, you know, we're really looking from first principles of how to how to do the work. Like we believe that like all technical work will become agent first. I'd like to set like a deadline for that, that, you know, at the end of the month, we're just going to go all in. And it's never going to be the first thing that happens, say, in response to an alarm or in a planning meeting that there isn't like an agent in there kind of doing the basic work. And I think that's a realistic expectation, but it involves not just we're not just moving faster for the sake of it. We're seeing that we're moving faster by looking at the fundamentals of where we're spending our time and reimagining how that work could be done in an agentic world. And honestly, if like the if the agents didn't get better, if the models didn't get better, the harnesses didn't get better. We've got the building blocks just today to be able to just continue going, moving around, looking at how we do our technical work today. By technical work, I mean everything in delivery of product and and move it to entirely be agents first and allow us to move up to a higher level to be able to like work on higher level concerns or just getting more stuff built, more stuff out there or higher quality. That's all within every org's grasp today. But you have to be very open to change. And I guess what's been fortunate to come over the last while is that we have been extremely open for change, both in the product side of things and adapting the company to how I think companies need to work now with AI. And we're starting to see results. Yeah, the other other reflection I have upon looking at this chart is we're recording this in kind of the spring of twenty twenty six. And Anthropic just said that they crossed 30 billion in revenue, I think up from 19 a couple of months ago. And I suspect their revenue chart looks a little bit like your merged PRs per R&D chart. So how are you all thinking about the tradeoff on cost here? Right. Like we're all consuming quad tokens. Yes. You know, efficiency or output is going up, throughput's going up. But is cost scaling proportionately? Are you all worried about this? Is that the problem right now? Are you even worried about it? How do you think about that? Yeah, we're definitely worried in that the build looks exactly like this. And, you know, I spent a lot of my career worrying about AWS costs and worrying about our margins and stuff. And then suddenly you've got these costs showing up and they're disproportionate to the growth that we've seen anywhere before. It's like hiring whole new offices of people. But at the moment, our attitude has been, look, everyone just turn on Opus for everything, 1 million with a context window. You know, we just used the API plan, so it's all just on demand. And we think that there's enough alpha or benefit in, at this point, going as fast as possible and caring about the bill later because of the later benefits we'll get. And maybe that's a position of where Indicom is. I don't think it's realistic or feasible for absolutely every single business to do it. And honestly, I do kind of respect when you have to actually think about your token use and how that can kind of force you to be more considerate. Or it sometimes even gets you better results. You know, you don't need Opus for everything. Like there's faster models out there. And so we're just kind of avoiding that optimization. we're just kind of avoiding that optimization phase until the point of where we, until, you know, we've gotten serious benefits from investments in this platform. And so I think this investment, and I think we are treating it as like an investment at this point, is worthwhile. For us, you know, if this keeps going at this rate, yeah, we should all work for Anthropic, you know. I think the way they're hiring, we're all gonna end up working for Anthropic. So, okay, and then one other thing, because I think, you know, folks are gonna look at this, certainly engineers, and they're going, okay, like you're shipping more PRs, but it's all slop, it's all garbage. You know, I know you all are measuring quality on the outside of this, on the other side of shipping all this stuff. So how have you seen this inflect your measurements around quality or customer value or what you're trying to achieve at the end, not just lines of code? Yeah, I have a standalone graph that I can share, which is kind of interesting. And so we've started to look at the time it takes from the first line of code written in a feature to the time it gets posted on our news channel, like our updates. And that has decreased consistently over the last few months. And we're not optimizing for this, but we're interested in it. And the other thing is, like the sheer volume of things we have shipped also appears to have kind of just rapidly increased in the last few months as well. And that should be a bit of a trailing metric. So we believe that these numbers, these like increase in volume is being borne out in real features, real products that our customers are using. And even we've been running some experiments like how far can one person guess on their own building something that's plausibly a whole entire product area feature to be able to sell. So this is something we're taking seriously. And we also care a lot about quality. We've been working with a research group in Stanford. We've been giving them our data and mostly just looking for any kind of insights to make sure we're not blind. I join absolutely every single incident. I'm a ambulance chaser and I make like, and I'm not seeing any increase in kind of regular kind of incidents or outages or customer facing problems. We've had a few kind of weird problems, but not related to production. And, but also the interesting thing from the Stanford data when we checked back in on it last week was that their measures of code quality reckons that the code quality was improving. And the models are improving, the agents are improving. We're adding more and more guidance and skills and all these kinds of things, which I think do craft or do force people down a road, which should result in higher quality output. But it's great to see when tools kind of can independently pull that out. Now, devils and details, you've got to go into the weeds. You've got to actually really have a strong sense for what quality means in your own environment. But, you know, we're not seeing some of the things that people are worried about out there. But that's it, we've got a mature environment. We're a 15 year old SaaS company. We've been doing this for years. You know, AI and speeding up your velocity will magnify all of your strengths and weaknesses. And thankfully, I think we've got a lot of strengths on the software delivery side of things that we've been able to take advantage of. One thing that I want to kind of call out here, which is you said that you've seen your code quality increase, which again, intuitively, I've always believed to be the ultimate end game of this. And every engineer, not every, many engineers that I've talked to just don't believe it to be true. But when you have the capacity to take on tech debt, when you have the capacity to take on the dragons in your code base, you actually can do those things, whether it's developer experience, security and compliance, just general maintainability of your code base, flaky test, improving your CICD, all those things become very tractable, not just technically, not just can an engineer execute on it, but actually the business, and I feel like people don't appreciate this, the business, capital T, capital B, only has so much capacity for internal projects, meaning we can only allocate so much of R&D towards improving code quality, just how we live, we don't generate ARR on code quality, unfortunately. But when the cost of doing that compresses, then you're able to say, yes, as a business, we should invest there, one, because we can, and two, because it'll unlock velocity on the outside for our agents and for our product managers and for engineers. And so I think this is actually a really important moment for folks to invest in code quality, and I often advise a lot of CTOs and VPs of engineering when figuring out how to get their engineering team AI-pilled say, everything you hate about the code base, go spend a month fixing and see how fast we can speed run that, that's gonna feel really good. Okay, we've chit-chatted, we've shown graphs, point of how AI is to actually ship some code. So let's switch over to that, we can probably come back to all these topics, I think they're so interesting, but you're gonna show us how you all, again, in your mature code base, mature organization, are actually getting things live and some stuff you've done in the repo to make that possible. Yeah, sure, so I'm gonna do a fairly trivial change in our majestic Ruby on Mails monolith. So this is- Love it. Millions of lines of code, all the tests. Yeah, the code base is older than Intercom, it was created before Intercom was incorporated, and, you know, it's got its problems, but we love it and we tend to it. And so I'm just gonna do a relatively simple change of adding a lobster emoji, Rails redirects to chatprd.ai. So, and also, I try and give hints to Claude when I'm actually demoing something, I don't know if it actually helps, but it makes me feel better. Just trying to add a bit of urgency here, you know? I think that's everybody's prompting strategy, which is, I don't know if it helps, but it makes me feel better. Totally, and so that's a nice way to interact with the agents, you know? And so what we're seeing here is, I mean, it's already kind of figured out where to put a redirect, it's got the nice lobster emoji, and it's asking me if I want to open my PR. So obviously I do, and I think it's actually gotten the URL wrong, it's app.intercom.com, which will have the URL, but we can tell Cloud Code later on about that. So what we're seeing here is, first of all, an important point, so I'm just gonna scroll back up. One of the things we noticed early on when we started getting Cloud Code to write all of our code, and, you know, we're up well above 90% now, is that it would create pull request descriptions that were kind of terrible. It would describe the code, and that's the least interesting part of a pull request. You actually, as a human, or even as an agent reviewing code, you want to know the intent behind the pull request. You want to know the interesting bits, what's kind of related to this, and, you know, LLMs are very good at just regurgitating or rewriting code into English, that's fine, but it's not what we need. And so one of the things, and we noticed as well, when people were using Cloud Code, we created an LLM judge to evaluate, because we had suspicions that the quality of the pull request descriptions was going downhill. So we created an LLM judge to evaluate what does a good pull request, we decided what a good pull request description should look like, and then got an LLM judge to go through all, like, months and months of data, and yeah, the trend was awful. The trend was going in one direction, and this is bad. And, you know, look, humans aren't perfect at creating pull request descriptions. Sometimes they're just blank and whatever, but I think with our use of tools like Cloud Code and setting up these kind of platforms around us, you really have to be pushing for, like, higher standards. You want as close to perfection as possible, and this was clearly something that we're just not gonna tolerate a lowering of standards in our environment. So we created a skill called Create Pure, and what it does is, it uses whatever context it can from the session to describe the pull request. So it's not quite rocket science, but often the session knows exactly why it's doing the thing. And so, but then we had to kind of force it in. You know, we started, we told people, like, oh, just use the Create Pure skill, and then people would want to use it. You don't really actually want to be, have people remembering things. So we added it as a hook. So if Claude decides to, you know, use the GitHub CLI to open a pull request, we just block it, and we say, yeah, tough. You need to use the Create Pure skill, and also, you're probably gonna have to, like, figure out a different text description, and then it might interview you if it's just not enough context there. Hopefully, there's enough context in this, but the point being that, you know, this is a platform, we want great outcomes, and we measure the inputs and outputs, and after we put this in place, the LLM judge reckoned we're doing a great job now, and so we're at higher quality pull request descriptions now. This is not the most important thing in the world. This is not gonna get intercom to 2X or to 10X revenue or anything like that, but it's all of the composite little jobs that, like, when you assemble, means you have an extremely competent engineer who works appropriately in our environment, and that's where we're putting our investments for each little skill and hook to do these things. So they almost look inconsequential, but, you know, they result in better outcomes, and so we look through here. It's creating a PR. I'm gonna have to check on what's going. This probably will be automatically approved as well, which is pretty cool, and we might even see some pull request feedback as well in action. And it's still building. We'll come back to it in a couple minutes. One thing I wanna call out for folks, as you were describing sort of why you put in this skill to improve the PR, and for those who don't know, a skill is basically just like a set of instructions and sometimes scripts that a LLM or a agent harness can invoke at a certain step in your flow. One of the things that I was thinking as you were describing why you put this skill together and got really opinionated about PR descriptions is in engineering, we have been able to architect really opinionated CICD pipelines. So how written code goes from being written to deployed in production, and we have, I mean, you saw it in GitHub, we have all these checks and lints and pre-deploy, pre-flight things and preview branches, all these things once the code is written. What I think is really interesting about skills is you can bring some of that determinism to as you write the code, how you want that process to go. And we used to not be able to do it because it used to flow through the hearts and minds and hands of humans, which are much harder to put in these structured guardrails. And we would do this by writing wikis or having SOPs where it said, can you please follow step A, B, C, D, E. And now you can just make it really easy to enforce those standards across the team, which I don't think is micromanaging, it's actually just making everybody's golden path much smoother to production. And so I think there's this just very interesting parallel to how we've approached CICD to how we approach things more upstream, even from the product management perspective. Totally. We're on this movement towards a software factory. And what factories are great at is, like an Ikea factory or something, it's all the same furniture, all the different bits, and you know how to assemble it, and look, it's not your artisan stuff, it's not, or it's not cutting edge or whatever, but it's very predictable and has a certain quality and meets certain standards when it comes out the other side of the factory. And so while pull request descriptions, again, they're not make or break for the factory or the pull request or whatever. It's one of those qualities of just good quality work that's reliable, predictable, and then when assembled together, you've got your Ikea factory. Well, and people don't wanna feel, certainly engineers don't wanna feel like they're part of a slot factory, right? And so these things that you can add into the flow that actually up-level and meet the standards of the engineering team really help your human engineers on the team feel like they're working in a place that values quality. And so I appreciate that you've put that effort into these behind the scenes hooks and skills because I'm sure it reinforces to a culture that's being asked to move very fast, to ship things differently than they have before, that you still do care about their experience, reading pull request descriptions. You meet their bar for quality, and I just think it makes everybody happier. Yeah, well, it's great when the robots just produce the work that you'd expect of your best engineers. Yeah, and maybe as you get this live, I also think there are just still such more interesting problems to solve in software engineering, and we can talk a little bit later in the episode about some of the interesting problems that you are all solving on the product side, on the technical side. I think there is no lack of hard, intellectually stimulating, creative problems to solve for customers, and coding redirects is just 100% not one of them. So do we get my redirect live, or are we close? It's still there. I'm waiting for an automatic review to kick in, but we can come back to it. So one of the things I would like to show next might be some of the telemetry that we have in place. So we saw that there was different skills getting invoked, and we don't like flying blind. To run a system like this, you need to know how well people are using it. Are people using these skills at all? You know, with the kind of basic information that you'd expect of, like, when you ship a product to your customers, like, you know, where can I see the usage? How can I fight for the usage? What's going wrong, or what's not going wrong? And so we collect a bunch of telemetry using different mechanisms, and have different homes for us. The most open one that we have is we collect basic usage information for skills and the like, and we send it to Honeycomb. So we just have a shared key that's deployed to all of our laptops, and anyone can go in and kind of look through this data. So if you're developing a skill internally in Intercom, and, like, hundreds of people do this, it's very easy for you to go in to discover, like, hey, who's actually using this? When are they using it? And you've got to use this as a kickoff to, like, follow up on just, like, basic discovery of usage of your skills and all. And, like, unsurprisingly, the kind of main skills that we have are things like creating PRs. Admin tools is our admin, like, internal tooling APIs, or where we have an MTP in front of it. BuildKai is our CI system. Snowflake logs is where we put Snowflake. So you can see from this, like, a lot of work, a lot of the skills that are being involved are all around the building, and then seeing where my stuff is, and maybe some troubleshooting-type information as well. And so this is the first kind of step. It's like, if you don't have this, it's hard to have a large system like all these hundreds of skills and hundreds of creators working in this area without having decent telemetry. The next thing we do as well is we also collect all of the session data and put it into S3. And so we anonymize it. We do a few things to make sure we're not doing anything too private. You know, people put all sorts of stuff in their sessions. They yell at their sessions. Yeah. Yeah, people have personal relationships at times with Claude. And, like, we don't really want to know about that, but we do want to be able to dive deeper into how things are going. You know, I think understanding, like, what the dropout rate of sessions, like, dude, how quickly people got to something useful, like whether... quickly people got to something useful, like whether it was a PR or something like that. This kind of information is pretty interesting. And so we're harvesting a lot of session data and we're doing different things. This is what I'm showing here on the screen is like a very simple tool that we put together, which just gives you some personalized insights. And you know, you can do this inside Clouds these days as well. There's plenty of skills out there on GitHub where you can do session analysis. But I think we just built a little tool on top of our session collection system to give people feedback. And it's feedback that we're interested in giving feedback about how their sessions are going and how they're kind of fitting in, how you should think about your own, I guess, use of Cloud Code compared to everybody else in the org. And, you know, I'm not doing too bad here. It's like 79th percentile, you know, someone has to be down the bottom of every percentiles. And there's some interesting feedback here. Like it's kind of getting annoyed at me. I was getting annoyed at Cloud a few weeks ago because I'd set up Gog to interact with all of our Google stuff internally. And, but it kept on trying to do the wrong thing. And I was kind of giving a head twist and ended up adding stuff to Cloud and MD and stuff. It's kind of giving out to me here, or it's reminding me that this wasn't a very effective way to interact with Cloud Code. So, you know, it's a good prompt for me to actually go and fix up my memory or whatever. And like, we all, like people are at different levels, even at Intercom, people are at different levels of adoption. People are joining Intercom. They may not have like seen a system like this before and they want to know how things are going and get feedback. And so this is one example of how we're just trying to pull together this information to give useful, actionable insights to people so that they can, they feel supported and that we're not just throwing them an API key and saying, best of luck. It's like, no, we've got, we understand what growth looks like and the progression that people go through when they're using these tooling and getting better and kind of self-improving. And we want to support all that. So this is one of the things that we're doing with the session data. There's loads of other things that's work in progress like being able to, like we want to get insights to which skills are the highest quality, which gets you, which skills get you to your results as quickly as possible. And then which ones need work, you know, which ones aren't working out so well might need a bit of attention to improve. This episode is brought to you by Cursor. If you all have been watching How I AI, you already know this. Cursor is my favorite way to code with AI. Whether I'm using plan mode to build out an ambitious feature, reviewing AI generated diffs right in my editor or kicking off cloud agents to multi-thread our roadmap, I reach for Cursor as my favorite multi-model coding platform. Even better than building myself in Cursor, I love collaborating with BugBot to fix PRs for code security and quality and have begun relying on Cursor's automated agents to keep our code base clean. It's not just me. The most ambitious teams love Cursor too, including engineers at Stripe, OpenAI, and Figma. Ready to build more? We're giving $50 in Cursor credit to How I AI listeners. Claim your credits at chatprd.ai slash howiai. That's $50 in Cursor credits by going to chatprd.ai slash howiai. I have to pause before we look at your list of skills because I'm so excited about that part, but if folks aren't watching it, they may have missed how amazing what you just showed us. So I'm gonna reiterate it, which is one, you've instrumented all your internal skills with telemetry so that, and you're using Honeycomb, love the Honeycomb team. You're using Honeycomb to see how often those skills are invoked over time. So this is just a tip for anybody building out a skills repository internally, or even somebody who is maybe trying to get some visibility into their impact across the org. Let's say you build a skill and you wanna go to your boss and be like, boss, my skill is being used by literally everybody every day. Find a way to put event level telemetry invoked in the skill, a little dashboard, and you can track those over time. Again, treating your org like a product, treating your repo like a product, treating your AI setup as a team like a product, and all products, all good products have tracking plans. And so figuring out how you put that telemetry in, I think is really smart. And then the second thing for those that missed it or how to do it is you're taking all the raw session, I'm presuming JSON files. So for folks that don't know, Cloud Code stores all your chats with Cloud Code on your computer in JSON. And you can go look at those or query those at any time. It sounds like you all are uploading those files to S3 and then layering on top of it, some anonymization, some user level views. And then you're essentially building sort of what I would call like an internal eval of how people are using Cloud Code and what problems they are having over time so that individuals, one, can triage their own implementation. As you said, oh, it looks like I need to do this or that or improve my agent's MD. But then if you're seeing consistent themes over the organization on it's never invoking this MCP when we need it to invoke this MCP or people are yelling no every time the create PR skill gets queued up, you can fix that at a systems level. But you can't do that if you don't have the visibility. So again, my VPs of engineering, my CTOs, my friends out there, put some telemetry in your skills and then do some meta analysis on your Cloud Code sessions across the org. And you'll be able to identify places where some probably some high leverage fixes are gonna get your team unblocked over time. I do hope and expect that this stuff will get easier over time. I'm happy to kind of invest the work so that we can move fast and kind of be on the bleeding edge. But there's something to be said also for having like last mover advantage and just getting all this stuff for free whenever Entropiq ship it or whoever ship it. I mean, maybe this is a product just that people should buy or build. But for us right now, we have no choice. We just got to build it. We're fascinated with the insights that are locked away in these sessions. And so we just got to build stuff so that we can see what's going on. Okay, can we see some of these skills? Yes, so it's a very exciting GitHub repo. Our lives are all GitHub repos and markdown files. Totally, and we have a lot of activity at the moment. We ran an AI day last week, kind of getting more people contributing to us. And so, well, so what this is, is it's a plugin repo and we have a series of plugins and they're growing daily at the moment. Kind of every team will have their own kind of specific plugins. And actually in general, though, we're very liberal. We want stuff to end up in here, even if it's not great. And, but we do sweat the details on the core plugins, things that we think are fundamentals, foundational ones that go out to everybody. And so where we start off was we have like these base plugin, which gets installed. Oh yeah, so we distribute this not via the cloud code plugin mechanism. We found it was just a bit flaky. It was, you know, sometimes it updates, sometimes it wouldn't and it ended up kind of like trying to manage a Python install on hundreds of different laptops. You know, you just don't want to do it. And so we ended up using our internal IT systems to synchronize all of the plugins to the disks of everyone's laptops. So this is a great cheat code and yeah, I strongly recommend getting very close with your IT team to be able to deliver things like this reliably and not have to rely entirely on the cloud code plugins mechanism. Just our experience is a bit flaky and it's just gives us a lot of reassurance. We don't have to do certain types of debugging once it's all on disk. So this is, we know this stuff works anywhere because we've got our IT team pushing it out to disk. And so we've got some safety hooks. We have some of the foundational things like, yeah, merging peers. We don't want our agents going off into AWS and then just different settings and the telemetry things as well. So these are the core things that absolutely everybody gets and, but we, you know, these are minimalist. We don't want anything that could be inappropriate in say a non-technical person's laptop or whatever. So that's, this is like the basic building block. The next main bit for us is like what we call developer tools. Again, like this would be things that we then do all of engineering and beyond at this point. And these would be generally skills that would be appropriate to be used by any engineer in the course of their work day to day. And again, we would have a high quality bar again for all of these. These would all require evals. These will all require to pass different kinds of tests or analysis that we do on the quality of skills. And so we try and maintain these and make sure that they're well updated and well used. And we pay a lot of attention to. I can maybe go through one of these skills in a bit of detail. This one's near and dear to my heart. It's flaky specs. And I think the interesting part here is not the skill itself. The skill does reliably fix flaky specs. And I can pull up in the meantime, like here is a list of flaky specs that we have at the moment. I'm gonna open up the skill and just start to run it on this issue. And so while this is running, just walk through what's in the flaky spec skill. And so there's a checklist here. And the fun part about how I built this was not that I was a world-class expert of fixing flaky specs. I roughly know the problem and have fixed a few of them in my time, but there's different classifications. In a large test environment like ours, we have hundreds of thousands of tests. And if you're not super careful about data poisoning or race conditions and all these kinds of things that can kind of kick in when you're running millions and millions of tests a day, you end up with these tests that end up slowing down your ability to deliver code to production fast and reliably and not confuse developers by things randomly breaking. And there's kind of known patterns and known ways you would go about this. I knew my goal, which was to have a skill fixing all of these flaky specs. And it was something that agents are pretty good at where you give them a kind of testable goal. This wasn't quite open-ended. And I also had this huge backlog, or yeah, there was a backlog of probably a few hundreds, but then also all of this historical flaky spec information. And so you can just harvest all of this data in your environment to go, hey, Claude, I'm gonna build a skill. First of all, go and research every single flaky spec we've ever had. And then we're gonna build a checklist. We're gonna build a mechanism. And then we're just going to crunch through them over and over and over. And you get to this like one X kind of, you know, it's doing a good job, probably as good as jobs I would do. But then as you keep building up all of these like little teeny steps, which are the kind of things that, you know, our best Rails coders kind of do, they've got all the stuff in their heads and all the different classifications of flaky specs and, you know, verifying against real data. And then, but the really fun part is then you get, so you get something that's starting to be like 10X. It's fixing flaky specs that I'm not even sure if I could do. It might take me a day or something. And I probably wouldn't do it. But then you start to add in stuff into the skill along the lines of like, okay, when you fix something and it's novel, you need to update yourself as well. So in that session, it's updating the skill. So the skill itself is kind of learning as it goes along. And we also fan out. So it's like, okay, I'm very happy that you fixed that flaky spec. Now find every flaky spec that got impacted by that nature of it. And so I went from zero to like 100X in terms of this skill now is like, you know, see your distinguished engineer that are being able to fix these specs. But it was more like the process that got there. And so like working with a feedback loop, working with like a very clear goal and then giving it the freedom to do it, you know, giving access to the systems where it needs to pull in metadata, being able to run builds itself and having that feedback loop where it's learning. And then, you know, designing the skill as well so that it's, you have to edit it every so often. It ends up taking up too much information that might confuse things. But then you break things out into like reference guides. So you're doing this like progressive discovery thing. And I've even accidentally pointed this skill as like a Python code base. And Claude has just gone, like, it's just Python, I'll give it a go. And it kind of uses the knowledge that's applicable to us. And so again, this skill is not going to make Intercom's revenue go 100X. But it's now this like perfectly reliable thing that we really no longer have to think about. Now we can expand out into many, many different areas. And we just have to maintain this. And the maintenance work for a skill like this just isn't much. And we have evals and stuff so that when we're upgrading models or maybe even moving to cheaper models or whatever, that we can make sure, yeah, this thing isn't progressing. It's still working as well as we think it is. And we've got confidence and certainty that this is still a reliable building block. And again, the constituent parts put together, you've got like a very senior engineer who's able to get any work done in your environment. And so yeah, we can take a look at what it's doing. Oh, it's asking me for permissions. I should have checked. You forgot to make no mistakes, dangerously skip permissions. That's the rule on how AI. One thing while it's running, I wanted to say is, you know, this skill is a perfect example of what I call the like, and then AI workflow, which is I tell everybody like pull your skills and pull your workflows through a bunch of and thens. So I want to fix flaky tests. So I go to GitHub, I find a flaky test. I run through this kit. Let's say you fix it. And then what would you do? Well, I would document how I fixed it. And then what would you do? Well, I would go find all the other ones that are just like this and fix that. And then what would you do? I would go from, you know, our Rails code base to our Python code base and apply the same. You can just do that over and over. And because the cost of running these is so low, you can actually pull the thread of a bunch of stuff. Any reasonable human would have quit at step one because you're not limited again by head count or coordination costs. You're limited by the technical capacity to solve the problem, which I think is a really interesting way to think about how you get from like the, you know, engineering intern whose job is to go through and take a first, you know, gentle pass at all these flaky tests through to the distinguished engineer who has just speed run through 300 of them and has thought of a completely different way to architect your testing overall in your repo. So I think that's a really great model for things. And then the other thing is like, again, engineers, go speed run your tech debt, fix your flaky test. Like these are all things that as somebody who has run engineering organizations, I have heard over and over, we can't because our code base, blah, blah, blah, blah, blah. Like, can we pretty please allocate this amount of time to just fixing this really annoying front end flaky test? Like you don't have to ask permission for that stuff anymore because there's just a new way to solve it. And I think, again, just going back to some of the stuff we were talking about earlier, I think your overall product quality is gonna go up. I think your overall developer experience is going up. There's just so many good things that come out of using these tools and using them correctly. Yeah, I think backlog zero is a realistic thing for teams to be able to go after. You know, all the things that you wish you'd ever wanted to do, you know, it's now just achievable. But of course, you got to balance it with, you know, all of the extra stuff that you can just deliver at the same time. But it's so sweet to be able to think that, hey, we actually have a path to getting rid of all of our backlogs and all of the kind of architecture changes or whatever. You know, recently I was taking a Go microservice and re-implementing this in Ruby. And it was a single cloud code session. Before November, this was something that I would have had to advocate for on a roadmap and like, you know, plant some seeds and different engineers heads and kind of move people towards it and kind of blame a lot of problems on the existence of this microservice. Wait, trigger warning first before you talk about that process. I'm sorry, I'm giving the secret sauce here of how to influence an org. But yeah, but now it's like, well, I don't even have to think about this now. It's a single session. And in fact, I can get cloud to implement it five times and compare the styles or compare the, you know, get us to review them and figure out what the best way of implementing the thing is. And this is just like this level of kind of creativity and freedom that where like your imagination is the blocker, not the time it takes to actually knock out one of these things, which was months in the past, you know. I completely agree. And I feel this at chat parody where people are like, what are your, I mean, I'm a product tool for product people. They're always asking what my roadmap is. And I was like, I literally don't have a roadmap. We burn down the roadmap every week. And then we figure out what we're going to ship next. And of course we have thematic ideas we want to pursue and things that are larger. And one of the things that I do to keep myself from over shipping absent product market fit is literally constrain the ideas to what I can do in my brain, which is there's like a natural throttle on not getting slop out because it's not engineering throttling me. It's actually just good commercializable ideas. And I think that's where we're going to see some of the limits start to come in play. Again, referring to Anthropic, another big news piece came out is that they're hiring a bunch of PMs because they have so much engineering capacity. They're actually limited at the PM capacity. And so it'll be interesting to see where the bottlenecks in your business end up and which bottlenecks are appropriate. It's probably good to have a product bottleneck a little bit because then you're not shipping anything, which customers can absorb. And so I just, I think it's going to, and it's going to evolve over time and then, you know, product is going to have a whole set of skills and then I don't know what we're going to do with our time, hang out on the beach. But I think it's a pretty interesting time to run orgs. Yeah. You know, I think engineers, designers, product managers, maybe it's just all going to be one blob of builders or something like that. And everyone, everyone just does things. Everyone just does things. And you know, it's great. It's lowering the barriers to like just getting a lot of stuff done. And it's like so much fun when you can, when you don't have to ask somebody or get something on a backlog or whatever, you can just get it done yourself or even just get it done very fast in a small group. It doesn't matter what your discipline is. It's just like a great leveler at the moment. So yeah, so we're live. I think our lobster is live and it should be on app.intercom lobster emoji. Look at that. It was amazing. I need to get you all an affiliate code, you know? Yeah. I mean, lobster emojis, they're the new thing. They're the new growth hack. They are the new growth hack. Okay. So we have seen your PR per R&D employee go up. We've seen how you can get from kind of cloud code to production very, very fast with a bunch of guard rails. We've seen your list of, it looks like hundreds of skills, but at least dozens of skills that you're invoking via hooks. You're using that to not only ship customer facing product, but you're also using that just to make developer experience better, burn down tech debt, all those things we want to see. You all are, you're measuring it both from a telemetry perspective, both like quantitative and qualitatively, you're measuring your cloud code sessions. And 2X isn't enough. You're going to get to 10X. So you all are on the edge, at least for folks that I talk to. And I'm sure you're like me where you're like, sure, you think we're on the edge, but then I see people and they're really on the edge. So we always have ambitions to move forward. But my question now to you is, how has this impacted how you think about your customers product? I'm an intercom customer. I'm a thin customer. I interact with intercom code and intercom UI literally every day. My open claw has an intercom API key. How do you think about, now that you have this experience with cloud code internally, how do you think about what that customer experience is going to look like? Yeah. There's a few things going on. One is that people are outsourcing a lot of decisions to their agents. And this is a good thing in many cases, but there was good research done recently about what does cloud code pick? And certainly I've had the experience in the distant past where I'd ask an agent to add something, except do it behind a feature flag. And then it would start to go and implement its own feature flag system. In our code base, which has a pretty sophisticated old school, home-rolled feature flag system. So nowadays mostly we'll stick to whatever's in the code base and that's fine. But SaaS products, they're really good at their jobs. They're actually worth paying money for. And getting back to the feature flag situation, if you're building a new business, you're relying on your agents to make decisions. Often an agent will, when prompted, it's like, hey, how should I solve a feature flag problem? I want to make sure I'm doing all these safe deploys and that. The agent will just go, yeah, I'll do it myself. And the kind of build over buy decision. And you can see why the agents do it this way, because they can achieve this. They can get it done. They don't have to rely on the human. Okay, like open cloud changes things here a little bit and maybe computer use does as well. But still we haven't really adopted SaaS businesses to be agent friendly. And that means, well, all sorts of things around how do we position our websites and content? And how do you get updated in their knowledge? And how do they discover it? But also, can they actually just get it done? Can you ask an agent, hey, could you just sign me up to Intercom and get me in working on my website? And so this goes alongside just having to make more APIs for things. I think I'm kind of like omni-channel as such. I think there's a feature for CLIs and MCP and REST APIs. I think I'd like us to get more comfortable around things like ephemeral APIs or multi-step APIs. I think CLIs are good at wrapping these kinds of things. But the whole point of all this, where I'm getting at is you want to be able to just help agents out at the time when they're interacting, they're in discovery mode and you want to give them clues. You want to give them hints. You want to give them help to be able to do things like sign up for something fully without having to go back to the user and say, yeah, sorry, I can't help you there. You've got to go away and figure out how to sign up for something. So I've been working on something in the last few weeks, which hopefully should solve our problem. I can paste in a prompt and then see how far it gets. I also, just while we're running this, I have to go back to your feature flag example, because you know where I used to work. It broke my heart that build-it-yourself was at the top of the feature flagging list. But I do think I have a paranoia moment about this, which is model providers and harness providers are highly incentivized to build-it-yourself consumes lots of tokens versus buy-it maybe consumes less. So I'm just really interesting to see how this all shakes out. You know, people are very anti-SaaS is dead. And I'm a little bit more like, yeah, but like the current form factor of SaaS really has something coming for it. And a particular dev tools, because these models are so good at writing code. I think you're in a real pickle to try to figure out how to find the right value wedge at the right moment, how you can allow agents to not just sign up and set up things, but purchase it. You know, like what is your trial experience look like if your first user is an agent? I think all of that is super important. And then, you know, to your point earlier where you said, you know, are we APIs, ephemeral APIs, CLIs, MCP? I think the answer is yes right now, which you cannot predict the medium by which a user is going to come to your site. They could come through a search and hit your website and download things and look through your docs. They could come through cloud code. They could come through an open cloud. You just really don't know. And so you sort of have to meet your customers and your non-human customers where they're at. And I think it's really smart for teams that have any part of their product that needs to be implemented via code to be thinking about this problem yesterday, because you will be left behind, I think, if your agent experience isn't there. Yeah, I agree entirely. And I think there's a whole craft in how to make a CLI agent-friendly. I think MCPs obviously get that right a lot of the time. But, you know, for example, one of the things that we do in the help is kind of just give a hint to the agent. It's almost like prompt injection to a certain extent, except it's not malicious. You're just trying to get us along to what it's trying to achieve. It's like, well, maybe you could check email. And if an agent has access to your email... That's what I was looking at. So it's just they're going, oh, yeah, I can probably get this done. Or you can hint to them, I've kind of cheated with this. So this is my own personal website hosted in Vercel. And I've kind of pre-populated a few articles so they can upload and Finn has some content to answer questions with. But you can also just return in the help going like, hey, you should probably think about creating some articles if you want Finn to actually start answering questions. And that can be extracted from, you know, the code base or whatever. Yeah. I've been also thinking like a lot of interfaces, like CLI interfaces, like I use GOG, you know, it's part of the OpenClaw universe. And I think it's a lot better than the official Google GWS one. But I think if you start to use it, it's actually just more human as in it's the interface just kind of makes more sense to a human. I think the Google one is like, I kind of get what they're getting at. And there's kind of JSON in there and stuff like that. It's not that, but it feels more human friendly or something. Things that are effective for agents can often be things that are more human friendly because they're discoverable and they use verbs and words and not just kind of inscrutable, weird stuff going on in command line options. I think I've confused Claude here. I'm not sure where he is. That's okay. I'm going to, I'm going to narrate for folks what's happening here, which is you basically said like, install intercom on this site. There's an intercom CLI that's like, cool. I can access the intercom APIs and do a lot of this. My favorite part of it though, is signing up, getting a verification email in your email address, invoking via like this hint basically of like, if the user has email access set up in however you're accessing it, go check for this verification email because we have a code in there that we got to snag. And because you're using Gog, which is a command line tool to access Google workspace, you can go do that, pull that code in. And what I think is so interesting about that particular flow is, I think AI is creating sort of race conditions in shipping across the org, which is like, you can YOLO a CLI probably faster than whatever team that manages email authentication can change how email verification works. And so you're like, I'm not going to let that break my product. What I'm going to do is create a flow that I can, I can use that sort of sticky part of the flow, AI brains and get through it. And so again, your product doesn't have to be perfect for an agent to traverse it. And this is one of the things I'm actually really excited about SAS is all those things that are just so complicated to do as a human, multi-step forms and like nested fields on nested fields and finding, you know, categories and just those things that I would say UX designers and product managers have written their most tedious PRDs on and done their most detailed specs on. Like you don't actually have to worry about making that quote unquote usable because you can just brute force intelligence against it and solve the problem. And so I think that's interesting because the core value proposition can get bigger and bigger without being constrained by the surface area of a website or a UI or any of those things. And so I think if you're not thinking about what does that CLI look like for you and what adjacent systems does your product butt up against, it may be email, it may be some other dependency and how an agent might traverse those systems, you're just going to get less and less adoption because this is going to be more how people install products. Yeah. And if I don't poke holes and if I don't make a CLI that kind of bypasses some of the ways their product works, somebody else will, you know, they'll just put their own agents on us and they'll burn more tokens, they might get frustrated and you may as well shortcut them and give them an interface which just works. It may not be the perfect interface, but that's the beauty of these things. You can get updated over time. Agents can just pull down the latest version. And yeah, like hopefully I have something to show here though. Well, the other thing that I want to call out while you're talking about that, which is as I'm watching this and it's taking some time to build, your conversion rate drop-off point is somebody pressing the escape button and just saying, forget it. Like this is clearly not working, what if we built it ourselves? And so I think it's a really interesting moment for product managers who right now are not getting the visibility of the drop-off, right? When you were going through a website, you could put telemetry in it, you could say, okay, user's going to the signup page, drop-off, email verification drop-off, going to the docs drop-off. You could build this nice little funnel that identifies where your users are having problems. You can put some telemetry in your CLI, but the end of the day, some of that drop-off and the alternatives is very invisible to you here. And the switching cost quote unquote is like pressing escape and saying, do it a different way. And so again, how quickly you can speed run to a zero to one installation in an agent, I think is something that everybody should be running right now. And it doesn't just have to be a code product. I think more and more people are doing non-technical tasks and interacting with non-technical SaaS in cloud code, in cloud co-work. And so even if you're not DevTools, if you're not thinking about how can a user do this quickly in a third-party harness or system where an agent can do this quickly, you're really missing out on customer growth. Customer growth. Okay, how are we doing? It's on its fourth attempt. That's fine. And you know what, let's press the escape because you know what? Let me tell you how cheap that exercise was. It was like five minutes and some tokens and you're gonna spin up a fresh cloud code. I don't know if you put make no mistakes, that was probably what we missed. Make no mistakes and it could have done it. And again, this is just learning, like why isn't every engineer, every PM doing this once a week or once a month just to figure out how it can work? And it's great. So Ryan, you've shown us everything. You've given us all the secrets. Let's get out of the terminal and let's do some lightning round questions. So my first question for you is how does it feel? Because what I observe from our conversation is it feels fun. Like culture has in fact gotten better, not worse because of this investment. And so as a company that has really put in the effort both on the customer side and internally, how do you think it's shifted culture? Has it at all? What have you observed? Yeah, everything is just faster and more exciting. I mentioned feedback loops a good few times and you can just get stuff out there so fast now. And I've been having the most amount of fun in my career over the last three months or something like that. And like, it's fun in many ways. It's fun because I can do stuff that again, I would have had to convince other people to do or they were just things on my wishlist and I could never get around to them. I would just kind of complain about them. But now they're just realizable. But also the fun aspect of like making other people productive, like leveling people up, like removing work. Intercom is pretty good culture around resisting like the kind of slow movements towards being a large company and all this process and stuff like that. We're kind of in denial that we're like a large company. I think it's a healthy way to work in many ways. But this has kind of got us back to our roots in a lot. That's, you know, you can make fast decisions and get them delivered and get that feedback super fast. And I've been able to like ship actual features, like not just the CLI, but I ship some webhook features. And it's been a long time since I've done that. I've been in the weeds and platform space for a long time. And, but it wasn't even a big deal. It was like just a couple of hours, just kind of get something done. It was like something a customer asked for. So my job has become more varied. I'm able to kind of see more and get more done and help other people get a lot more done. So you get this kind of excitement and velocity increases and, you know, we have all those measurements and that's all kind of good stuff. But just the excitement of waking up in the morning going like, I'm going to get a lot done today. Like that is a fun way to go about your day. I completely agree. And I hear this over and over and over again. I certainly feel it myself, which is this is the, it brings me back to why I learned to code. It's like that same moment of, I didn't learn to code because I like to type code. I learned to code because of the magic of you running like hello world and it shows up somewhere. And that feels so, it's just a very creative experience, which leads us to my second question, which is I see all the time that one of the most impactful change agents inside an engineering organization can be a senior principal engineer saying, let's go ham on some AI code. And the single most blocking person in the organization can be a senior principal engineer going, I don't believe it. Absolutely not, not me, not here, not, no way. And in fact, last week I heard a story of somebody who had their most senior staff engineer quit. Says, and I quote, I do not believe in AI. I will not work at a place that does this. So what is your appeal sort of engineer to engineer of why to invest in this? Why you think it's the way that engineer organizations are moving and how you kind of come to meet skeptics where they are and hopefully see things a little bit from, more from where kind of Intercom is approaching them. I mentioned that Intercom kind of had it on easy mode. We didn't have to convince leadership that there's something to this AI stuff. Like we were pretty much had decided the direction of the company the weekend that Chat GPT came out. So we already had this expectation that this will be transformative across many parts of our work, including all of building products and engineering. We were just kind of mostly annoyed about how long it took. But I think for sure it does need strong advocates and you need to push boundaries. Like one of the biggest things that I've been able to do successfully was kind of push through the barrier of like, should we let an agent connect to Snowflake? Like, and there's all these things can go wrong or should we let our agents run real production code in our Rails console over API? And the easiest thing to answer there is like, you know, I'm not sure. Or like, this is risky or we should think about this. But we've been largely pushing through it. So now like not recklessly, like we've lots of good controls and we're a mature business. And we have like, I've been on our security team but definitely not trying to do anything too wild. But there's still, even then I have apprehension. Just like, is this, I think we should do this. But it seems weird or it seems hard. But then I just have to give myself permission. And then I realized if I had to give myself permission, there's loads of people out there who just need me permission. And honestly, like one of the biggest things I do at Intercom is just telling people they can do things. And just this is pre-AI and post-AI and or telling them like, look, whatever you do, just blame me if it all goes wrong. And I guess maybe we can blame Claude now, but ultimately it's that like permission. And just like, there's a level of ambition which comes from as well. It's like, if you're out there saying, I'm not sure if AI is going to take or have a big role to play in all of our work. And you keep on saying that, that kind of will permeate through the culture and people say that. But if you're very clear, you're saying that like, look, all work is gonna be agent first, like at some stage in the near future. And so we're gonna figure out the path there. And so we're gonna break down every barrier as we come across them. And look, it's your job, it's my job. And if anything goes wrong, blame me. Like that's largely been how I've been approaching it. Not just me, like this has been a very large collective efforts, but giving that kind of permission thing, but also the kind of like freedom to like explore or push things or whatever, it's kind of necessary. And look, it might be a less stressful way to go about it to like just take a nap for a few years and come back. And then with all the problems have been solved and we've got these perfect agents running amok in our environments, then that would avoid some of this. But like, I think all places have to get through that kind of apprehension and initial kind of issues that some of these can, some of the introduction of agents in the environments can have. And I think our job as leaders, whether it's as an engineer or as a manager or whatever, just has to be on that like enablement and giving people space to go deep on the work, enjoy it and like have that moment where things click and you start realizing like, oh my God, this is something that will transform how much I can get done. Say it again for the people in the back. I love, I was like, oh my gosh, I love this so much. And you know, it is absolutely those two things, which is like, give permission. You can, please just go, please, by all means, go ahead. Designer, hit me with a PR, no one's gonna get mad at you. Like go ahead. And then the second thing of just accountability can roll to the top and not in a scary way. Let's not do irresponsible things. But I, you know, I've seen a couple incidents in the past month, some big ones. And what you see is CEOs or big leaders coming out and saying like, the team's shipping and we wanna keep shipping and we're gonna be careful with our customer data and we care for the customer experience and stuff happens, we've learned from it. It's ultimately on me, I'm gonna call the customers and we're gonna move on and deliver great innovation for you. And you know what I tell people to, you know, to get them over that hump, which is like, you really gotta know what your existential problem is. And I love what you said is the second that ChatGPT came out, Intercom changed because that is an existential problem. Who writes the code in your code base, agents or humans, not an existential problem. Like, will you be fundamentally disrupted by a new technology? That is the real problem in your business. So I always tell people like, let's differentiate the real problems in our business from problems that we can tolerate and then go use the problems we can tolerate to move fast. And so it sounds like you have a really good call. I mean, I think at the end of the day, the results speak for themselves. And again, you all are not asking me to say this, Intercom has met the moment. You went all in on AI assisted, you know, customer support and experience, you're now building models. And so it's not just a one and done, ChatGPT is here, we need to change how our product works or AI assisted coding's here, so we need to change how our engineering team works. You know, models are gonna be how people differentiate, we need to go there. CLIs are gonna be how people use products, we need to go there. And so I think this sort of like fearlessness and what I would suspect is like, just a fun, nice, high trust culture, good people, you actually see the business results on the other side. So I'm gonna hype you up, I see a lot of teams, I see a lot of leaders, and I think people can take a lot of inspiration from this. But let's uninspire them really quickly before I get you out of here. Which is my last question, which is when Finn takes 15 solid minutes on a live podcast to do a very basic task that you know it can do, or not Finn, when Cloud Code. What do you do? Do you yell? Are you a yeller? What does your meta analysis on this internal dashboard say the human needs to improve on? I do lapse into giving Cloud Codes like, just like smiley faces or unhappy faces, or, you know, not over the top. I certainly haven't cursed at us. Very polite. That's kind of not my style, but I do like the odd kind of like, at a boy kind of smiley face. And I don't know if it knows like that I'm deeply thinking about this and like these little subtle kind of hints or whatever, but yeah, no, I think like professional with a few emojis is my style with Cloud. You know, hopefully that'll come back to me someday with an emoji. Same. I waste the tokens on telling it it did a good job. I somehow in my mind, I'm like, that's going into its own sense of itself. And it's gonna know what good looks like. So I am there. I am there with you. All right, Brian, this has been one of my favorites. Y'all, if you have gotten to the end, there is so much alpha in this episode. I cannot believe it. This is a cheat code to winning friends and influencing SaaS through AI engineering. Brian, where can we find you and how can we be helpful? I can be found on the internet as a nice vanity URL, which is brian.scanlan.ie. And I got a few links here to some other talks and some of the writing and different bits and bobs. As you can tell, I'm not a designer. I asked Claude to design this as if I was a Unix systems administrator writing a little webpage and it kind of shows. I'm active on X Twitter. I'm brian-scanlan. I'm on LinkedIn, Scanlan B, or something like that. I think I'm the most famous Brian Scanlan on the internet. So generally, Brian Scanlan, and that tends to work. And I tend to be active in showing up to different conferences and just getting good word out about what we do at Intercom, mostly these days AI, but I've also given lots of talks about many other different topics. And yeah, I'm also a big believer in just saying yes to a lot of things. So if you look me up, you got a good idea, you want to get in touch, you want to run stuff past me or whatever, chances are I'll say yes and we can, I'll just keep on doing this until things break and then I start saying no, so. But I'm still not there yet, so bring it on. Great, so search for Brian and ask him to do something for you. That's it. Well, thank you so, thank you truly for sharing all this information. People are going to get tons of value out of this. It's going to be a hit for sure. And I just really appreciate you joining How I AI. Of course, this is so much fun. Thanks so much for watching. If you enjoyed this show, please like and subscribe here on YouTube or even better, leave us a comment with your thoughts. You can also find this podcast on Apple Podcasts, Spotify, or your favorite podcast app. Please consider leaving us a rating and review which will help others find the show. You can see all our episodes and learn more about the show at howiaipod.com. See you next time.