# Scaling Engineering Velocity through Agentic AI: Intercom's Framework

**Podcast:** How I AI
**Published:** 2026-04-20

## Transcript

suddenly you started realizing that you have to think bigger about things or that your imagination is now the barrier not the tool how is this not happening in your organization like literally the physical limits of my ability to type code are unlocked by ai today we are seeing twice the number of throughput as we did compared to nine months ago on our engineering team now it's like why can't it be tax this is a little bit more of what my instinct tells me is possible which is if you go all in if you prepare your team if you prepare your code base i think your overall product quality is going to go up i think your overall developer experience is going up there's just so many good things that come out of using these tools and using them correctly backlog zero is a realistic thing for teams to be able to go after all the things that you wish you would ever wanted to do it's now just achievable i often advise a lot of ctos and vps of engineering when figuring out how to get their engineering team ai pilled say everything you hate about the code base go spend a month fixing and see how fast we can speed run that that going to feel really good i've been having the most amount of fun in my career over the last three months welcome back to how iai i'm claire veau product leader and ai obsessive here on a mission to help you build better with these new tools today i am showing how intercom 2x the number of prs that their r d department is shipping in just a few months brian scanlon is a senior principal engineer at intercom and he is going to show us truly all of their secrets to getting a large product and engineering organization cooking on cloud code.
Let's get to it.
This episode is brought to you by Celigo.
Every company today wants AI to improve how work gets done.
The fastest way is building it directly into everyday business processes, automating employee onboarding, keeping customer data accurate, managing orders and inventory, or resolving finance and operations issues.
When AI lives inside the flow of work, it can update records, trigger approvals, route work, and kick off the next step across systems.
That's how teams operationalize AI and deliver measurable results.
Soligo makes this possible.
And now, with Soligo Aura, it's never been easier.
Soligo Aura gives you access to the entire platform through natural language, connecting your systems and turning intent into action.
All of it under your control.
Companies like Databricks, PayPal, and Olipop rely on Soligo to run critical business operations at scale.
Ready to operationalize AI?
Visit soligo.com slash howiai.
That's C-E-L-I-G-O dot com slash howiai.
Brian, welcome to How I AI.
Why I am so thrilled that you agreed to join the podcast is I think Intercom has done it.
Which is you all have met the moment in sort of two ways.
One, clearly met the moment from a product perspective.
We're one of the first companies that had, sorry, I don't want to say legacy business, but had a going concern business that saw AI coming and really transformed how your product worked for customers.
And I'm a happy Finn customer.
They did not tell me to say that.
And then second, what we're going to talk about is the team at the moment in terms of really understanding.
AI was going to change how, in particular, product engineering and design orgs and engineering organizations were going to work.
And you just went full speed at changing how the team works.
What drove sort of the urgency around meeting the moment?
How did that come to be?
Was it a single person?
Was it everybody?
What was your experience?
I think in some ways it's been...
the easiest place to be driving out the adoption of AI in engineering and product because we've focused the company so much on our folks and product on adopting AI and being AI first and how we think about the product, future customer support and all that.
And we also had very clear expectations.
We've seen what's possible in the product space and It's just very clear and obvious to us as like connoisseurs of AI.
It's like, this is clearly going to be huge in engineering and product and building.
And honestly, there's been a lot of impatience for like, why isn't this happening today?
You know, if we go back a few years and cursor is picking up a bit of business and the models are getting better, but it still wasn't transformative.
It still wasn't like.
the whole business was changed and we're seeing vast amounts of extra productivity.
We knew there was potential, but it still felt like we needed to have some sort of breakthrough moments or something big had to happen for us to get to the kind of huge velocity wins that I think now we're starting to achieve.
That said, we still want more.
We're proud of where we're at, but we're not content with what we've achieved so far.
I feel like every three months I have a breakthrough moment.
And in fact, I feel like Opus 4.6, I don't know, something just like really inflected in what was possible when that particular model came out.
Now I think the GPT-5.4 models are also exceptional.
And so it was something about that one moment with models that really inflected my own personal use of AI and engineering.
Did you all see the same sort of inflection around that model?
point totally i think you can go back to it's like november december last year and suddenly you started realizing that you're you have to think bigger about things or that your imagination is now the barrier not the tool you're spending less time massaging the tool to get it to the right place um and it's less about autocompletes and more about just literally giving us your ideas and seeing what happens uh i think the christmas break happened as well i remember we pretty much decided before Christmas, like, hey, we're going to go all in and clog code.
Because up to that point, there was a bit of cursor here and there and augmenting different tools.
And the Christmas break really helped.
I just saw everybody go wild on Twitter X, you know, that people were talking about how this was, like, they were getting so much done and they were building all these things.
I just come back to work after Christmas break going, like, okay, everything's changed.
Like, we knew that there was something here and that we were starting to see the signs of us.
But now, The whole world is convinced, or at least all of the influencers on Twitter.
That would be me.
Yeah, I'm actually kind of convinced that companies should increase their PTO and parental leave policies because everybody I know right now in tech that is, quote unquote, taking time off, goes on their vacation and pops open Claude Code and comes back like 10 times more skilled than they were before their time off.
If anybody wants a little minor hack to AI literacy in your org, give people time off to hack.
And they will come back with more information than you expected.
Okay, I think we're going to skip to the punchline, which I love, which is we're going to see how AI has actually changed how you all ship at Intercom.
So can you just show us a little bit of how this has changed inside the org?
And I think you all are measuring a lot of this.
Yeah, so I think we've been diligent as...
you know, product owners inside of Intercom that we've been trying to get feedback from people and see how they're using the tools and really like just doing everything that we would normally do with a regular product.
And so we've spent a lot of time hooking up cloud code with telemetry, both into things like Honeycomb and data also going into Snowflake, where we have our data warehouse.
We also store session data in S3.
And we mine this stuff for useful insights.
But one of the main things that we used to drive adoption of the tool was our CTO, Dara, setting a goal of us 2xing, like doubling the throughput of R&D.
And we used pull requests as a crude.
a simple measure but you know there's and you can argue back and forth about what's a good measure what's a bad measure and whether measuring anything's appropriate or whatever but i think it's reasonable to just have the expectation that if you can get a lot more done and it's so fast and fun then why aren't why isn't everyone just shipping more stuff and so it's a basic measure that like the tools are being adopted um and that they're being used well and you know of course we don't tolerate lowering quality and we're high trust environments so we don't expect people not to game these stats or whatever but our metrics and what i'm showing on the screen here is you know it's a classic number goes up uh kind of thing that where we had we started tracking this like uh like how many uh pors and what percentage of them were generated by either claude or cursor or whatever and um yeah since our major investment in cloud code the platform and going all in on it and really pushing out like enablement and uh giving people uh freedom to explore and start to build skills and everything and but also pushing them on on on we expect kind of throughput uh increase we've seen a big big increase in in the throughput of pull requests to our system and you know like last year like our ci system completely broke it melted it you know but now i mean it got like 10 times as expensive and you know we did the work we fixed the bottlenecks we improved the performance of our ci system that stopped being the bottleneck and now code review is our bottleneck.
But today we are seeing twice the number of throughput as we did compared to nine months ago on our engineering team.
And we're very proud of that.
And now it's like, why can't it be 10x?
So what I love about this chart, just for a moment, is I had spent the last two decades of my career in product and engineering, last decade of my career as a CPTO.
And it's so funny.
I want to go back to a couple of things you said, which is, one, you have to treat your org like a product.
And I always thought that my job was not just the product strategy and the capital P product that we were delivering to customers.
It was to design our organization to, I would say, like output innovation on demand, which is that was the job.
And less romantically put, my job is to invest R&D for positive enterprise value.
That was like fundamentally my job as a CPTO.
And so what I love about this is it's merged PRs per R&D head.
I'm presuming that includes, does that include product managers and non-engineering R&D or is that purely software engineers?
Yeah, this is all of R&D and it's definitely the case that our designers and product managers and TPMs, like every role in intercom is really actively using cloud code and shipping code and all that.
And also we've been hiring like this number has not been static.
So the number of PRs, the raw number is dramatically higher than just 2x what it was a good while ago.
So this is everything from your newest hire to your product manager who's like adding some copy or shipping like small changes or whatever.
You know, that's all based in this number.
The other thing I want to call up for folks is every board meeting I have been in for the last three years have said.
How are we getting?
Well, actually, every board meeting I've ever been, period, has been how can we get more velocity out of R&D?
Certainly in the last three years, it's been how is AI inflecting our velocity?
And it's so funny.
I talk to so many people that are like, it doesn't really inflect velocity.
We're not actually becoming that more efficient.
And I'm like, is that true?
Because I look at a chart like this and I say, this is a little bit more of what my instinct tells me is possible, which is if you go all in.
If you prepare your team, if you prepare your code base, if you have, as you said, I think a high trust culture, people are going to look at this and say, oh, they're shipping these smaller PRs or like engineers are gaming the system.
I just I have not worked at a place that has such kind of like bad culture that that would actually come as an outcome of setting some sort of ambitious, fun target like this.
And so I take this as at face value.
And I think, how is this not happening in your organization?
Like literally the physical limits of my ability to type code are unlocked by AI.
You should get some inflection there.
And so, you know, for VPs of engineering, CTO, even people that are on these R&D teams look at this and think, you know, this is possible.
And it may be a crude measurement, but it's, I think, an appropriate one as a leading indicator of what's happening in your org around AI.
Yeah, and we support this with not just...
telling people to move faster.
We're really looking from first principles of how to do the work.
We believe that all technical work will become agent first.
I'd like to set a deadline for that.
At the end of the month, we're just going to go all in and it's never going to be the first thing that happens, say, in response to an alarm or in a planning meeting that there isn't an agent in there doing the basic work.
And I think that's a realistic expectation, but it involves not just, we're not just moving faster for the sake of it.
We're seeing that we're moving faster by looking at the fundamentals of where we're spending our time and re-imagining how that work could be done in an agentic world.
And honestly, if the agents didn't get better, if the models didn't get better, the harnesses didn't get better.
We've got the building blocks just today to be able to just continue going, moving around, looking at how we do our technical work today.
By technical work, I mean everything in delivery of product and move it to entirely be agents first and allow us to move up to a higher level, to be able to work on higher level concerns or just getting more stuff built, more stuff out there or higher quality.
That's all within every org's grasp today.
be very open to change and i guess what's been fortunate in to come over the last while is that we have been extremely open for change both in the product side of things and adapting the the company to how uh i think companies need to work now and with ai and we're starting to see results yeah the other other reflection i have upon looking at this chart is we're recording this in kind of the spring of 2026 and anthropic just said that they crossed 30 billion in in revenue, I think up from 19 a couple months ago.
And I suspect their revenue chart looks a little bit like your merged PRs per R&D chart.
So how are you all thinking about the trade-off on cost here, right?
Like we're all consuming Claude tokens.
Yes, you know, efficiency or output is going up, throughput's going up.
But is cost scaling proportionately?
Are you all worried about this?
Is that the problem right now?
Are you even worried about it?
How do you think about that?
Yeah.
Definitely worried in that the bill looks exactly like this.
And, you know, I spent a lot of my career worrying about AWS costs and worrying about our margins and stuff.
And then suddenly you've got these costs showing up and they're disproportionate to the growth that we've seen anywhere before.
It's like hiring whole new offices of people.
But at the moment, our attitude has been, look, everyone just turn on Opus for everything.
One million.
with a context window.
You know, we just use the API plan.
So it's all just on demand.
And we think that there's enough alpha or benefit in, at this point, going as fast as possible and caring about the bill later because of the later benefits we'll get.
And maybe that's a position of where Indicom is.
I don't think it's realistic or feasible for absolutely every single business to do it.
And honestly, I do kind of respect.
when you have to actually think about your token use and how that can kind of force you to be more considerate, or it sometimes even gets you better results.
You know, you don't need Opus for everything.
There's faster models out there.
And so we're just kind of avoiding that optimization phase until the point of where we've gotten serious benefits from investments in this platform.
And so I think this investment, and I think we are treating it like an investment at this point.
um is worthwhile and for us you know if this keeps going at this race yeah we should all work for anthropic you know um i think the way they're hiring we're all going to end up working for anthropic so okay and then one other thing because i think you know folks are going to look at this certainly engineers and they're going okay like you're shipping more prs but it's all slop it's all garbage you know i know you all are measuring quality on the outside of this on the other side of shipping all this stuff so how have you seen this inflect your measurements around quality or customer value or what you're trying to achieve at the end not just lines of code yeah i have a stand alone graph that i can share which is kind of interesting um and so we've started to look at the uh the time it takes from the first line of code written in a feature to the time it gets posted on our news channel like our updates um and that's uh That has decreased consistently over the last few months.
Now, we're not optimizing for this, but we're interested in it.
And the other thing is the sheer volume of things we have shipped also appears to have kind of just rapidly increased in the last few months as well.
And that should be a bit of a trailing metric.
So we believe that these numbers, this increase in volume, is being borne out in real features, real products that our customers are using.
And even we've been running some experiments like How far can one person get on their own building something that's plausibly a whole entire product area feature to be able to sell?
So this is something we're taking seriously.
And we also care a lot about quality.
We've been working with a research group in Stanford.
We've been giving them our data and mostly just looking for any kind of insights to make sure we're not blind.
I join absolutely every single incident.
ambulance chaser.
And I'm not seeing any increase in kind of regular kind of incidents or editors or customer-facing problems.
We've had a few kind of weird problems, but not related to production.
But also, the interesting thing from the Stanford data when we checked back in on it last week was that their measures of code quality reckons that the code quality was improving.
the models are improving the agents are improving we're adding more and more guidance and skills and all these kind of things which i think do craft uh or do force people down the road which should result in higher uh quality output but um it's great to see when to when when tools kind of can independently pull that out uh now devils in details you've got to go into the weeds you've got to actually really have a strong sense for what quality means in your own environment but you know we're not seeing Some of the things that people are worried about out there.
But that's it.
We've got a mature environment.
We're a 15-year-old SaaS company.
We've been doing this for years.
You know, AI and speeding up your velocity will magnify all of your strengths and weaknesses.
And thankfully, I think we've got a lot of strengths on the software delivery side of things that we've been able to take advantage of.
One thing that I want to kind of call out here, which is you said that you've seen your code quality increase, which again.
Intuitively, I've always believed to be the ultimate endgame of this.
And every engineer, not every, many engineers that I've talked to just don't believe it to be true.
But when you have the capacity to take on tech debt, when you have the capacity to take on the dragons in your code base, you actually can do those things, whether it's developer experience, security and compliance, just general maintainability of your code base, flaky test, improving your CICD.
All those things become very tractable, not just technically, not just can an engineer execute on it, but actually the business, and I feel like people don't appreciate this, the business, capital T, capital B, only has so much capacity for internal projects, meaning we can only allocate so much of R&D towards improving code quality.
Just how we live.
We don't generate ARR on code quality, unfortunately.
But when the...
When the cost of doing that compresses, then you're able to say, yes, as a business, we should invest there.
One, because we can.
And two, because it'll unlock velocity on the outside for our agents and for our product managers and for engineers.
And so I think this is actually a really important moment for folks to invest in code quality.
And I often advise a lot of CTOs and VPs of engineering when figuring out how to get their engineering team AI pilled, say, Everything you hate about the code base, go spend a month fixing and see how fast we can speed run that.
That's going to feel really good.
Okay, we've chit-chatted.
We've shown graphs.
The point of how AI is to actually ship some code.
So let's switch over to that.
We can probably come back to all these topics.
I think they're so interesting.
But you're going to show us how you all, again, in your mature code base, mature organization, are actually getting things live and some stuff you've done in the repo to make that possible.
Yeah, sure.
So I'm going to...
do a fairly trivial change in our majestic Ruby on Rails monolith.
So this is millions of lines of code, all the tests.
Yeah, the code base is older than Intercom.
It was created before Intercom was incorporated.
And it's got its problems, but we love it and we tend to it.
And so I'm just going to do a relatively simple change of adding a A lobster emoji, rails redirect to chatprd.ai.
So also, I try and give hints to Claude when I'm actually demoing something.
I don't know if it actually helps, but it makes me feel better.
I'm just trying to add a bit of urgency here, you know?
I think that's everybody's prompting strategy, which is, I don't know if it helps, but it makes me feel better.
Totally.
So that's a nice way to interact with the agents, you know?
And so what we're seeing here is, I mean, it's already kind of figured out where to put a redirect.
It's got the nice lobster emoji and it's asking me if I want to open IPR.
So obviously I do.
And I think it's actually gotten the URL wrong.
It's app.intercom.com, which will have the URL, but we can tell Cloud Code later on about that.
So what we're seeing here is, first of all, an important point.
I'm just going to scroll back up.
One of the things we noticed early on when we started getting Cloud Code to write all of our code, and we're up well above 90% now, is that it would create pull request descriptions that were kind of terrible.
It would describe the code.
And that's the least interesting part of a pull request.
You actually, as a human, or even as an agent reviewing code, you want to know the intent behind the pull request.
You want to know the interesting bits, what's kind of related to this.
You know, LLMs are very good at just regurgitating or rewriting code into English.
That's fine, but it's not what we need.
And so one of the things, and we noticed as well, when people were using Cloud Code, we created an LLM judge to evaluate, because we had suspicions that the quality of the pull request descriptions was going downhill.
So we created an LLM judge to evaluate what does a good pull request...
We decided what a good pull request description should look like and then got an element just to go through all, like, months and months of data.
And yeah, the trend was awful.
The trend was going in one direction.
And this is bad.
And, you know, look, humans aren't perfect at creating pull request descriptions.
Sometimes they're just blank and whatever.
But...
I think with our use of tools like Cloud Code and setting up these kind of platforms around us, you really have to be pushing for higher standards.
You want as close to perfection as possible.
And this was clearly something that we're just not going to tolerate a lowering of standards in our environment.
So we created a skill called CreatePR.
And what it does is it uses whatever context it can from the session to describe the pull request.
So it's not quite rocket science.
But often the session knows exactly why it's doing the thing.
But then we had to kind of force it in.
You know, we started, we told people like, oh, just use the create PR skill.
And then people would want to use it.
You don't really actually want to have people remembering things.
So we added it as a hook.
So if Claude decides to, you know, use the GitHub CLI to open up a pull request.
We just block it.
And we say, yeah, tough.
You need to use the CreatePR skill.
And also, you're probably going to have to figure out a different text description.
And then I might interview you if it's not enough context there.
Hopefully, there's enough context in this.
But the point being that this is a platform.
We want great outcomes.
And we measure the inputs and outputs.
And after we put this in place, the LRM judge reckoned we're doing a great job now.
And so we're at higher quality pull request.
descriptions now.
This is not the most important thing in the world.
This is not going to get Intercom to 2x or to 10x revenue or anything like that.
But it's all of the composite little jobs that when you assemble means you have an extremely competent engineer who works appropriately in our environment.
And that's where we're putting our investments for each little skill and hook to do these things.
So they're almost looking consequential.
you know, they result in better outcomes.
And so we look through here, it's creating a PR.
I'm going to have to check on what it's going.
This probably will be automatically approved as well, which is pretty cool.
And we might even see some pull request feedback as well in action.
It's still building.
We'll come back to it in a couple of minutes.
One thing I want to call out for folks, as you were describing sort of why you put in this skill to improve the PR for those.
who don't know, a skill is basically just like a set of instructions and sometimes scripts that a LLM or a agent harness can invoke at a certain step in your flow.
One of the things that I was thinking as you were describing why you put this skill together and got really opinionated about PR descriptions is in engineering, we have been able to architect really opinionated CICD pipelines.
how written code goes from being written to deployed in production.
And we have, I mean, you saw it in GitHub.
We have all these checks and lints and pre-deploy, you know, pre-flight things and preview branches, all these things once the code is written.
What I think is really interesting about skills is you can bring some of that determinism to, as you write the code, how you want that process to go.
And we used to not be able to do it because it used to flow through the hearts and minds and hands.
of humans, which are much harder to put in these structured guardrails.
And we would do this by writing wikis or having, you know, SOPs where it said, can you please follow step A, B, C, D, E?
And now you can just make it really easy to enforce those standards across the team, which I don't think is micromanaging.
It's actually just making everybody's golden path much smoother to production.
And so I think there's this just very interesting parallel to how we've approached CICD.
to how we approach things more upstream even from the product management perspective totally we're on this movement towards a software factory and uh what factories are great at is uh you know like an ikea factory or something it's all the same furniture all the different bits and you know how to assemble it and uh look it's not your artisan stuff it's not uh or it's not cutting edge or whatever but it's very predictable and you know, has a certain quality and meets certain standards when it comes out the other side of the factory.
And so, well, pull request descriptions, again, they're not make or break for the factory or the pull request or whatever.
It's one of those qualities of just good quality work that's reliable, predictable.
And then when assembled together, you've got your IKEA factory.
Well, and people don't want to feel, certainly engineers don't want to feel like they're part of a slot factory, right?
And so these things that you can add into the flow that actually up level and meet the standards of the engineering team really help your human engineers on the team feel like they're working in a place that values quality.
And so I appreciate that you've put that effort into these behind the scenes hooks and skills.
Because I'm sure it reinforces to a culture that's being asked to move very fast, to ship things differently than they have before, that you still do care about their experience reading pull request descriptions.
You meet their bar for quality.
And I just think it makes everybody happier.
Yeah, well, it's great when...
the robots just produce the work that you'd expect of your best engineers, you know?
Yeah.
And I, you know, maybe as you get this live, I also think there are just still such more interesting problems to solve in software engineering.
And we can talk a little bit later in the episode about some of the interesting problems that you are all solving on the product side, on the technical side.
I think there is no lack of hard, intellectually stimulating, creative problems to solve for customers and coding redirects.
is just 100% not one of them.
So did we get my redirect live or are we close?
It's still there.
I'm waiting for an automatic review to kick in, but we can come back to it.
So one of the things I would like to show next might be some of the telemetry that we have in place.
So we saw that, you know, there was different skills getting invoked and we don't like flying blind.
To run a system like this, you need to know how well people are using it.
Are people using these skills at all?
The kind of basic information that you'd expect of when you ship a product to your customers.
Where can I see the usage?
How can I fight for the usage?
What's going wrong or what's not going wrong?
And so we collect a bunch of telemetry using different mechanisms and have different homes for us.
The most open one that we have is we collect basic usage information for skills and the like.
And we send it to Honeycom.
So we just have a shared key that's deployed to all of our laptops.
And anyone can go in and kind of look through this data.
So if you're developing a skill internally in Intercom and like hundreds of people do this, it's very easy for you to go in to discover like, hey, who's actually using this?
When are they using it?
And you've got to use this as a kickoff to like follow up on.
just basic discovery of usage of your skills and all.
And unsurprisingly, the main skills that we have are things like creating PRs.
Admin tools is our admin internal tooling APIs, where we have an MTP in front of it.
BuildKite is our CI system.
Snowflake Log is where we put Snowflake.
So you can see from this, a lot of work, a lot of the skills that are being involved are all around the building and then seeing where my stuff is, and maybe some troubleshooting-type information as well.
And so this is the first kind of step.
It's like you don't have this.
It's hard to have a large system like all these hundreds of skills and hundreds of creators working in this area without having decent telemetry.
The next thing we do as well is we also collect all of the session data and put it into S3.
And so we anonymize it.
We do a few things to make sure we're not doing anything too private.
You know, people put all sorts of stuff in their sessions.
They yell at their sessions.
Yeah.
Yeah, people have personal relationships at times with Claude.
And we don't really want to know about that, but we do want to be able to dive deeper into how things are going.
I think understanding what the dropout rate of sessions did, how quickly people got to something useful, like whether it was a PR or something like that, this kind of information is pretty interesting.
And so we're harvesting a lot of session data.
We're doing different things.
What I'm showing here on the screen is like a very simple tool that we put together, which just gives you some personalized insights.
And, you know, you can do this inside Cloud these days as well.
There's plenty of skills out there on GitHub where you can do session analysis.
i think we we just built a little tool on top of our session collection system to give people feedback and it's feedback that we're interested in giving feedback about how their sessions are going and how they're kind of fitting in how you should think about your own i guess use of cloud code compared to everybody else in the org and you know i'm not doing too bad here it's like 79 percentile um you know someone has to be down the bottom of every percentiles and um there's And there's some interesting feedback here.
Like, it's kind of getting annoyed at me.
Or rather, I was getting annoyed at Claude a few weeks ago because I'd set up GOG to interact with all of our Google stuff internally.
But I kept on trying to do the wrong thing.
And I was kind of giving a head to it and ended up adding stuff to Claude and MD and stuff.
It's kind of giving out to me here.
It's reminding me that this wasn't a very effective way to interact with Claude Kerr.
But it's a good prompt for me to actually go and fix up my memory or whatever.
And people are at different levels, even at Intercom.
People are at different levels of adoption.
People are joining Intercom.
They may not have seen a system like this before.
And they want to know.
how things are going and get feedback.
And so this is one example of how we're just trying to pull together this information to give useful, actionable insights to people so that they feel supported and that we're not just throwing them an API key and saying, best of luck.
It's like, no, we understand what growth looks like.
the progression that people go through when they're using these tooling and getting better and kind of self-improving.
And we want to support all that.
So this is one of the things that we're doing with the session data.
There's loads of other things that could work in progress, like being able to, like we want to get insights to which skills are the highest quality, which skills get you to your results as quickly as possible, and then which ones need work, which ones aren't working out so well and might need a bit of attention to improve.
This episode is brought to you by Cursor.
If you all have been watching How I AI, you already know this.
Cursor is my favorite way to code with AI.
Whether I'm using plan mode to build out an ambitious feature, reviewing AI-generated diffs right in my editor, or kicking off cloud agents to multi-thread our roadmap, I reach for Cursor as my favorite multi-model coding platform.
Even better than building myself in Cursor, I love collaborating with BugBot to fix PRs for code security and quality and have begun relying on Cursor's automated agents to keep our code base clean.
It's not just me.
The most ambitious teams love Cursor too, including engineers at Stripe, OpenAI, and Figma.
Ready to build more?
We're giving $50 in Cursor credit to How I AI listeners.
Claim your credits at chatprd.ai slash howiai.
That's $50 in cursor credits by going to chatprd.ai slash howiai.
I have to pause before we look at your list of skills because I'm so excited about that part.
But if folks aren't watching it, they may have missed how amazing what you just showed is.
So I'm going to reiterate it, which is one, you've instrumented all your internal skills.
with telemetry so that, and you're using Honeycomb, love the Honeycomb team, you're using Honeycomb to see how often those skills are invoked over time.
So this is just a tip for anybody building out a skills repository internally, or even somebody who is maybe trying to get some visibility into their impact across the org.
Let's say you build a skill and you want to go to your boss and be like, boss?
My skill is being used by literally everybody every day.
Find a way to put event level telemetry invoked in the skill, a little dashboard, and you can track those over time.
Again, treating your org like a product, treating your repo like a product, treating your AI setup as a team like a product.
And all products, all good products have tracking plans.
And so figuring out how you put that telemetry in, I think is really smart.
And then the second thing for those that missed it or how to do it is you're taking all the raw session, I'm presuming JSON files.
So for folks that don't know, CloudCode stores all your chats with CloudCode on your computer in JSON.
And you can go look at those or query those at any time.
It sounds like you all are uploading those files to S3 and then layering on top of it some anonymization, some user level views.
And then you're essentially building sort of what I would call like an internal eval of how people are using cloud code and what problems they are having over time so that individuals, one, can triage their own implementation.
As you said, oh, it looks like I need to do this or that or improve my agent's MD.
But then if you're seeing consistent themes over the organization on.
It's never invoking this MCP when we need it to invoke this MCP or people are yelling no every time the create PR skill gets queued up.
You can fix that at a systems level, but you can't do that if you don't have the visibility.
So again, my VPs of engineering, my CTOs, my friends out there, put some telemetry in your skills and then do some meta analysis on your cloud code sessions across the org.
And you'll be able to identify places where some probably some high leverage fixes are going to get your team unblocked over time.
I do hope and expect that this stuff will get easier over time.
You know, I'm happy to kind of invest the work so that we can move fast and kind of be on the bleeding edge.
But there's something to be said also for being, for having like last mover advantage and just getting all this stuff for free whenever Entropic ship is or whoever ships.
I mean, maybe this is a product just that people should buy or build.
But for us right now, we have no choice.
We just got to build it.
We're fascinated with the insights that are locked away in these sessions.
And so we just got to build stuff so that we can see what's going on.
I love it.
Okay.
Can we see some of these skills?
Yes.
So it's a very exciting GitHub repo.
Our lives are all GitHub repos and Markdown files.
Totally.
And we have a lot of activity at the moment.
We ran an AI day last week, kind of getting more people contributing to it.
And so, well, so what this is, is it's a plugin.
repo and we have a series of plugins and they've been they're growing daily at the moment and kind of every team will have their own kind of specific plugins and actually in general though we're very liberal we want stuff to end up in here even if it's not great and but we do sweat the details on the core plugins things that we think are fundamentals foundational ones that go out to everybody and so where we start off was we have like these base plugins which gets installed.
Oh, yeah.
So we distribute this not via the Claude code plugin mechanism.
We found it was just a bit flaky.
It was, you know, sometimes it updates, sometimes it wouldn't.
And it ended up kind of like trying to manage a Python install on hundreds of different laptops.
You know, you just don't want to do it.
And so we ended up using our internal IT systems to synchronize all of the plugins to the disks.
of everyone's laptops.
So this is a great cheat code.
And yeah, strongly I recommend getting very close with your IT team to be able to deliver things like this reliably and not have to rely entirely on the cloud code plugins mechanism.
Just our experience is a bit flaky, and it just gives us a lot of reassurance.
We don't have to do certain types of debugging once it's all on disk.
So we know this stuff works anywhere because we've got our IT team pushing it out to disk.
And so we've got some safety hooks.
We have some of the foundational things like merging PRs.
We don't want our agents going off into AWS.
And then just different settings and the telemetry things as well.
So these are the core things that absolutely everybody gets.
But these are minimalists.
We don't want anything that could be inappropriate in, say, a non-technical person's laptop or whatever.
So this is like the basic building block.
Main bit for us is what we call developer tools.
Again, this would be things that we then do all of engineering and beyond at this point.
And these would be generally skills that would be appropriate to be used by any engineer in the course of their work day to day.
And again, we would have a high quality bar, again, for all of these.
These would all require evals.
These would all require to pass.
different kind of tests or analysis that we do on the quality of skills.
And so we try and maintain these and make sure that they're well updated and well used, and we pay a lot of attention to.
I can maybe go through one of these skills in a bit of detail.
This one's near and dear to my heart.
It's flaky specs.
And I think the interesting part here is not the skill itself.
The skill does reliably fix flaky specs.
And I can pull up in the meantime.
Like, here is a list of flaky specs that we have at the moment.
I'm going to open up the skill and just start to run it on this issue.
And so while this is running, just walk through what's in the flaky spec skill.
And so there's a checklist here.
And the fun part about how I built this was not that I was a world-class expert at fixing flaky specs.
I roughly know the problem and have fixed a few of them in my time.
But there's different classifications.
In a large test environment like ours, we have hundreds of thousands of tests.
And if you're not super careful about data poisoning or race conditions and all these kind of things that kind of kick in when you're running millions and millions of tests a day, you end up with these tests that end up slowing down your ability to deliver code to production fast and reliably and not confuse developers by things randomly breaking.
And there's kind of known patterns and known ways you would go about this.
I knew my goal.
which was to have a skill fixing all of these flaky specs.
And it was something that agents are pretty good at when you give them a kind of testable goal.
You know, this wasn't quite open-ended.
And I also had this huge backlog, or yeah, there was a backlog of probably a few hundreds, but then also all of this historical flaky spec information.
And so you can just harvest all of this data in your environment to go, hey, Claude, I'm going to build a skill.
First of all, go and research every single flaky spec we've ever had.
And then we're going to build a checklist.
We're going to build a mechanism.
And then we're just going to crunch through them over and over and over.
And you get to this like 1x kind of, you know, it's doing a good job, probably as good as a job as I would do.
But then as you keep building up all of these like little teeny steps, which are the kind of things that, you know, our best Rails coders kind of do, they've got all the stuff in their heads and all the different classifications of flaky specs and, you know, verifying it gets real data and...
But the really fun part is then you get something that's starting to be like 10x.
It's fixing flaky specs that I'm not even sure if I could do.
It might take me a day or something.
And I probably wouldn't do it.
But then you start to add in stuff into the skill along the lines of like, OK, when you fix something and it's novel, you need to update yourself as well.
So in that session, it's updating the skill.
So the skill itself is kind of learning as it goes along.
And we also fan out.
okay, I'm very happy that you fixed that flaky spec.
Now find every flaky spec that got impacted by that nature of it.
And so I went from zero to like 100x in terms of this skill now is like, you know, see your distinguished engineer that role or being able to fix these specs.
But it was more like the process that got there.
And so like working with a feedback loop.
working with a very clear goal, and then giving it the freedom to do it, giving it access to the systems where it needed to pull in metadata, be able to run builds itself, and having that feedback loop where it's learning, and then designing the skill as well so that it's...
You have to edit it every so often.
It ends up taking up too much information that might confuse things.
But then you break things out into like reference guides.
So you're doing this like progressive discovery thing.
And I've even accidentally pointed this skill at like a Python code base.
And Claude has just gone, like, it's just Python.
I'd be able to go.
And it kind of uses the knowledge that's applicable to it.
And so again, this skill is not going to make intercoms revenue go 100x.
But it's now this perfectly reliable thing that we really no longer have to think about.
Now we can expand out into many, many different areas.
And we just have to maintain this.
And the maintenance work for a skill like this just isn't much.
And we have evals and stuff so that when we're upgrading models or maybe even moving to cheaper models or whatever, that we can make sure, yeah, this thing isn't progressing.
It's still working as well as we think it is.
And we've got confidence and certainty that this is still a reliable building block.
And again, the constituent part when put together, you've got like a very senior engineer who's able to get any work done in your environment.
And so, yeah, we can take a look at what it's doing.
Oh, it's asking me for permissions.
I should have checked.
You forgot to make no mistakes, dangerously skip permissions.
That's the rule on how AI.
One thing while running, I wanted to say is, you know, this skill is a perfect example of what I call the, like, and then AI workflow, which is, I tell everybody, like, pull your skills and pull your workflows through a bunch of and thens.
So I want to fix flaky tests.
So I go to GitHub, I find a flaky test, I run through the skill.
Let's say you fix it.
And then what would you do?
Well, I would document how I fixed it.
And then what would you do?
Well, I would go find all the other ones that are just like this and fix that.
And then what would you do?
I would go from, you know, a Rails code base for a Python code base and apply the same.
You could just do that over and over.
And because the cost of running these is so low, you can actually pull the thread of a bunch of stuff.
Any reasonable human would have quit at step one.
Because you're not limited, again, by headcount or coordination costs.
You're limited by the technical capacity to solve the problem, which I think is a really interesting way to think about how you get from like the, you know, engineering intern that whose job is to go through and take a first, you know, gentle pass at all these flaky tests through to the distinguished engineer who has just speed run through 300 of them.
And it's thought of a completely different way to architect your testing overall in your repo.
So I think that's a really great model for things.
And then the other thing is like, again, engineers, go speed run your tech debt, fix your flaky tech.
Like these are all things that as somebody who has run engineering organizations, I have heard over and over.
we can't because our code base blah blah blah blah blah like can we pretty please allocate this amount of time to just fixing this really annoying front end flaky test like you don't have to ask permission for that stuff anymore because there's just a new way to solve it and i think again just going back to some of the stuff we were talking about earlier I think your overall product quality is going to go up.
I think your overall developer experience is going up.
There's just so many good things that come out of using these tools and using them correctly.
Yeah, I think backlog zero is a realistic thing for teams to be able to go after.
All the things that you wish you'd ever wanted to do, it's now just achievable.
But of course, you've got to balance it with all of the extra stuff that you can just deliver at the same time.
But it's so sweet to be able to think that, hey, we actually have a path to getting rid of all of our backlogs and all of the kind of architecture changes or whatever.
Recently, I was taking a Go microservice and re-implementing it in Ruby, and it was a single cloud code session.
Before November, this was something that I would have had to advocate for on a roadmap and plant some seeds in different engineers' heads and kind of build people towards it and kind of blame a lot of problems on the existence of this microservice.
Wait, trigger warning first before you talk about that process.
Sorry, I'm giving the secret sauce here of how to influence an org.
But now it's like, well, I don't even have to think about this now.
It's a single session.
And in fact, I can get Claude to implement it five times and compare the styles or compare the, you know, get us to review them and figure out what the best way of implementing the thing is.
And this is just like this level of...
kind of creativity and freedom that where like your imagination is the blocker, not the time it takes to actually knock out one of these things, which was months in the past, you know?
I completely agree.
And I feel this at chat parody where people are like, what are your, I mean, I'm a product tool for product people.
They're always asking what my roadmap is.
And I was like, I literally don't have a roadmap.
We burn down the roadmap every week and then we figure out what we're going to ship next.
And of course we have thematic ideas we want to pursue.
things that are larger.
And one of the things that I do to keep myself from overshipping absent product market fit is literally constrain the ideas to what I can do in my brain, which is there's like a natural throttle on not getting slop out because it's not engineering broadling me.
It's actually just good commercializable ideas.
And I think that's where we're going to see some of the limits start to come in play.
Again, referring to Anthropic.
Another big news piece came out is that they're hiring a bunch of PMs because they have so much engineering capacity.
They're actually limited at the PM capacity.
And so it'll be interesting to see where the bottlenecks in your business end up and which bottlenecks are appropriate.
It's probably good to have a product bottleneck a little bit because then you're not shipping anything which customers can't absorb.
And so I think it's going to evolve over time.
you know, product is going to have a whole set of skills.
And then I don't know what we're going to do with our time, hang out on the beach.
But I think it's a pretty interesting time to run orgs.
Yeah.
You know, I think engineers, designers, product managers, maybe it's just all going to be one blob of builders or something like that.
And everyone, everyone just does things.
Everyone just does things.
And, you know, it's great.
It's lowering the barriers to, like, just getting a lot of stuff done.
And it's, like, so much fun when you don't have to ask somebody or get something on a backlog or whatever.
You can just get it done yourself or even just get it done very fast in a small group.
It doesn't matter what your discipline is.
It's just, like, a great leveler at the moment.
So, yeah, so we're live.
I think our lobster is live.
And it should be on app.intercom lobster emoji.
Look at that.
It was amazing.
I need to get you all an affiliate code, you know?
Yeah, I mean, lobster emojis, they're the new thing.
They're the new growth hack.
They are the new growth hack.
Okay, so we have seen your PR per R&D employee go up.
We've seen how you can get from kind of cloud code to production very, very fast with a bunch of guardrails.
We've seen your list of it looks like hundreds of skills, but at least dozens of skills that you're invoking via hooks.
You're using that to not only ship customer facing product, but you're also using that just to make developer experience better, burn down tech debt.
All those things we want to see.
You all are you're measuring it both from a telemetry perspective, both like quantitative and qualitatively.
You're measuring your cloud code sessions.
And, you know, 2x isn't enough.
You're going to get to 10x.
are on the edge, at least for folks that I talk to.
And I'm sure you're like me, where you're like, sure, you think we're on the edge, but then I see people and they're really on the edge.
So we always have ambitions to move forward.
But my question now to you is, how has this impacted how you think about your customer's product?
You know, I'm an Intercom customer.
I'm a Fin customer.
I interact with Intercom Code and Intercom UI literally every day with my open claw.
has an intercom API key.
How do you think about, now that you have this experience with Cloud Code internally, how do you think about what that customer experience is going to look like?
Yeah, there's a few things going on.
One is that people are outsourcing a lot of decisions to their agents.
And this is a good thing in many cases, but there was good research done recently about what does Cloud Code pick?
And certainly I've had the experience in the distant past where I'd ask an agent to add something, except do it behind a feature flag.
And then it would start to go and implement its own feature flag system.
No, no, no.
In our code base, which has a pretty sophisticated, old school, home-rolled feature flag system.
So, you know, nowadays, mostly we'll stick to whatever's in the code base and that's fine.
But, you know, SaaS products, they're really good at their jobs.
They're actually worth paying money for.
Getting back to the feature flag situation, you know, if you're building a new business, you're relying on your agent to make decisions, often an agent will, when prompted, say, hey, how should I solve the feature flag problem?
I want to make sure I'm doing all these safe deploys and that.
The agent will just go, yeah, I'll do it myself.
And the kind of build over buy decision.
And you can see why the agents...
do it this way because they can achieve this they can get it done they don't have to rely on the human okay like open claw changes things here a little bit and maybe computer use does as well but still we're not really we haven't really adopted um sas businesses to be agent friendly that means well all sorts of things around how do we position our websites and content and how do you get updated in their in their knowledge and how do they discover it and but also can they actually just get it done like can you ask an agent hey, could you just sign me up to Intercom and get me working on my website?
And so this goes alongside just having to make more APIs for things.
I think I'm kind of like omnichannel as such.
I think there's a feature for CLIs and MCP and REST APIs.
I'd like us to get more comfortable around things like ephemeral APIs or multi-step APIs.
I think CLIs are good at wrapping these kind of things.
But the whole point of all this, where I'm getting at, is you want to be able to just help agents out at the time when they're interacting, they're in discovery mode.
And you want to give them clues.
You want to give them hints.
You want to give them help to be able to do things like sign up for something fully without having to go back to the user and say, yeah, sorry, can't help you there.
You've got to go away and like.
figure out how to sign up for something.
So I've been working on something in the last few weeks, which hopefully should solve a problem.
I can paste in a prompt and then see how far it gets.
I also, just while we're running this, I have to go back to your feature flag example because you know where I used to work.
It broke my heart that build it yourself was at the top of the feature flagging list.
But I do think I have a paranoia.
moment about this which is model providers and harness providers are highly incentivized to build it yourself consumes lots of tokens versus buy it maybe consumes less so i'm i'm just really interesting to see how this all shakes out you know people people are very anti sass is dead um and i'm a little bit more like yeah but like the current form factor of sass really is has something coming for it in a particular dev tools, because these models are so good at writing code.
I think you're in a real pickle to try to figure out how to find the right value edge at the right moment, how you can allow agents to not just sign up and set up things, but purchase it.
You know, like what is your trial experience look like if your first user is an agent?
I think all of that is super important.
And then, you know, to your point earlier where you said, you know, are we APIs, ephemeral APIs, CLIs, MCP?
I think the answer is yes right now, which is you cannot predict the medium by which a user is going to come to your site.
They could come through a search and hit your website and download things and look through your docs.
They could come through cloud code.
They could come through an open cloud.
You just really don't know.
And so you sort of have to meet your customers and your non-human customers where they're at.
And I think it's really smart for teams that have any part of their product that needs to be implemented via code to be thinking about this problem yesterday because you will be left behind, I think, if your agent experience isn't there.
Yeah, I agree entirely.
And I think there's a whole craft in...
how to make a CLI agent friendly.
I think MCPs obviously get that right a lot of the time.
But for example, one of the things that we do in the help is kind of just give a hint to the agent.
It's almost like prompt injection to a certain extent, except it's not malicious.
You're just trying to get it along to what it's trying to achieve.
It's like, well, maybe you could check email.
And if an agent has access to your email.
That's what I was looking at.
So it's just they're going, oh, yeah, I can probably get this done.
Or like you can hint to them, like, I've kind of cheated with this.
So this is my own personal website hosted in Vercel.
And it is, I've kind of pre-populated a few articles so they can upload and Finn has some content to answer questions with.
But you can also just.
uh you know return in the help going like hey you know you should probably think about creating some articles if you want finn to actually start answering questions and that can be extracted from you know the code base or whatever well uh yeah being like i've been also think like a lot of interfaces like cli interfaces like i use gog uh you know it's part of the open claw uh universe and i think it's a lot better than uh the official google gws one and add But I think if you start to use it, it's actually just more human, as in the interface just kind of makes more sense to a human.
I think the Google one is like, I kind of get what they're getting at, and there's kind of JSON in there and stuff like that.
But it feels more human-friendly.
Things that are effective for agents can often be things that are more human-friendly because they're discoverable in these verbs and words and not just kind of inscrutable weird stuff going on in command line options.
confused claude here i'm not sure what where is that's okay i'm gonna i'm gonna narrate for folks what's happening here which is you basically said like install intercom on this site there's an intercom cli that's like cool i can access the intercom apis and do a lot of this my favorite part of it though is signing up getting a verification email in your email address invoking via like this hint basically of like If the user has email access set up in however you're accessing it, go check for this verification email because we have we have a code in there that we got to snag.
And because you're using GOG, which is a command line tool to access Google Workspace, you can go do that, pull that code in.
And what I think is so interesting about that particular flow is, you know, I think AI is creating sort of race conditions in shipping across the org.
Which is like, you can YOLO a CLI probably faster than whatever team that manages email authentication can change how email verification works.
And so you're like, I'm not going to let that break my product.
What I'm going to do is create a flow that I can use that sort of sticky part of the flow, AI brains, and get through it.
And so again, your product doesn't have to be perfect for an agent to traverse it.
And this is one of the things I'm...
actually really excited about SaaS, is all those things that are just so complicated to do as a human.
Multi-step forms and like nested fields on nested fields and finding, you know, categories and just those things that I would say UX designers and product managers have written their most tedious PRDs on and done their most detailed specs on.
Like you don't actually have to worry about making that quote unquote usable.
Because you can just brute force intelligence against it and solve the problem.
And so I think that's interesting because the core value proposition can get bigger and bigger without being constrained by the surface area of a website or a UI or any of those things.
And so I think if you're not thinking about what does that CLI look like for you and what adjacent systems.
Does your product butt up against?
It may be email.
It may be some other dependency and how an agent might traverse those systems.
You're just going to get less and less adoption because this is going to be more how people install products.
Yeah.
And if I don't poke holes and if I don't make a CLI that kind of bypasses some of the ways their product works, somebody else will.
You know, they'll just put their own agents on it and they'll burn more tokens.
They might get frustrated.
You may as well shortcut.
them and give them an interface which just works may not be the perfect interface but that's the beauty of these things you can get updated over time you can agents can just pull down the latest version um and yeah like hopefully i have something to show here though well the the other thing that i i want to call out while you're talking about that which is as i'm watching this and it's taking some time to build your conversion rate drop-off point is somebody pressing the escape button and just saying Forget it.
Like this is clearly not working.
What if we built it ourselves?
And so I think it's a really interesting moment for product managers who right now are not getting the visibility of the drop off.
Right.
When you were going through a website, you could put telemetry in it.
You could say, OK, users going to the sign up page, drop off, email verification, drop off, going to the docs, drop off.
You could build this nice little funnel that identifies where your users are having problems.
You can put some telemetry in your CLI, but at the end of the day, some of that drop off and the alternatives is very invisible to you here.
And the switching cost, quote unquote, is like pressing escape and saying, do it a different way.
And so, again, how quickly you can speed run to a zero to one installation in an agent, I think, is something that everybody should be running right now.
And it doesn't just have to be a code product.
Like, I think more and more.
People are doing non-technical tasks and interacting with non-technical SaaS in Cloud Code, in Cloud Cowork.
And so, you know, even if you're not DevTools, if you're not thinking about how can a user do this quickly in a third-party harness or system or an agent can do this quickly, you're really missing out on customer growth.
Okay, how are we doing?
It's on its fourth attempt.
That's fine.
And you know what?
Let's press the escape because you know what?
Let me tell you how cheap that exercise was.
It was like five minutes and some tokens.
And you're going to spin up a fresh Claude code.
I don't know if you put make no mistakes.
That was probably what we missed.
Make no mistakes.
And it could have done it.
And again, this is just learning.
Why isn't every engineer, every PM doing this?
once a week or once a month just to figure out how it can work.
I think it's great.
So Ryan, you've shown us everything.
You've given us all the secrets.
Let's get out of the terminal and let's do some lightning round questions.
So my first question for you is, how does it feel?
Because what I observe from our conversation is it feels fun.
Like culture has in fact gotten better, not worse because of this investment.
And so, you know, as a company that has really put in the effort, both on the customer side and internally, how do you think it's shifted culture?
Has it at all?
What have you observed?
Yeah, everything is just faster and more exciting.
You know, I mentioned feedback loops a good few times.
And, you know, you can just get stuff out there so fast now.
I've been having the most amount of fun in my career over the last three months or something like that.
And it's fun in many ways.
It's fun because I can do stuff that, again, I would have had to convince other people to do, or they were just things on my wish list and I could never get around to them.
I just kind of complain about them.
But now they're just realizable.
But also the fun aspect of making other people productive, leveling people up, removing work.
Intercom is a pretty good culture around resisting the kind of slow movements towards being a large company and all this process and stuff like that.
And we're kind of in denial that we're like a large company.
I think it's a healthy way to work in many ways.
But this has kind of got us back to our roots in a lot.
You can make fast decisions and get them delivered and get that feedback super fast.
And I've been able to...
like ship actual features, like not just the CLI, but I ship some webhook features.
And it's been a long time since I've done that.
I've been in the weeds in platform space for a long time.
But it wasn't even a big deal.
It was like just a couple of hours, just kind of get something done.
It was like something a customer asked for.
So my job has become more varied.
I'm able to kind of see more and get more done and help other people get a lot more done.
So you get this kind of excitement and velocity increases.
And, you know, we have all those measurements and that's all kind of good stuff.
But just the excitement of waking up in the morning going like, I'm going to get a lot done today.
Like that is a fun way to go about your day.
I completely agree.
And I hear this over and over and over again.
I certainly feel it myself, which is this is the it brings me back to why I learned to learn to code.
It's like that same moment of.
I didn't learn to code because I like to type code.
I learned to code because of the magic of you running like hello world and it shows up somewhere.
And that feels so it's just a very creative experience, which leads us to my second question, which is I see all the time that one of the most impactful change agents inside an engineering organization can be a senior principal engineer saying, let's go ham on some AI code and the single most.
blocking person in the organization can be a senior principal engineer going I don't believe it absolutely not not me not here not no way and in fact last week I heard a story of somebody who had their most senior staff engineer quit says I and I quote I do not believe in AI I will not work in a place that does this so what is your appeal sort of engineer to engineer of of why to invest in this why why you think it's the way that engineer organizations are moving and how you kind of come to meet skeptics where they are um and hopefully see things a little bit from more from where kind of intercom is approaching them i mentioned that intercom kind of had it on easy modes um we didn't have to convince leadership that there's something to this ai stuff like we were pretty much had decided the direction of the company the weekend that chat gpt came out so so we already had this expectation that this will be transformative across many parts of our work, including all of building products and engineering.
We were just kind of mostly annoyed about how long it took.
But I think for sure it does need strong advocates and you need to push boundaries.
Like one of the biggest things that I've been able to do successfully was kind of push through the barrier of like, should we let an agent connect to Snowflake?
And all these things can go wrong.
Or should we let our agents run real production code in our Rails console over API?
And the easiest thing to answer there is like, well, you know, I'm not sure.
Or like, this is risky.
Or we should think about this.
But we've been largely pushing through it.
So now, like, not recklessly.
Like, we've lots of good controls.
And we're a mature business.
uh we have like i've been on our security team but definitely uh not trying to do anything uh too wild but there's still even then i have apprehension just like is this like i think i i think we should do this but it seems weird or it seems hard but then i just have to give myself permission and then i realize if i have to give myself permission there's loads of people out there who just need me permission and um Honestly, like one of the biggest things I do at Intercom is just telling people they can do things.
There's a pre-AI and post-AI or telling them like, look, whatever you do, just blame me if it all goes wrong.
And I guess maybe we can blame Claude now, but ultimately it's that like permission and just like there's a level of ambition which comes from as well.
It's like if you're out there saying, I'm not sure if AI is going to take our...
have a big role to play in all of our work.
And if you keep on saying that, that kind of will permeate through the culture and people say that.
But if you're very clear, you're saying that like, look, all work is going to be agent first, like at some stage in the near future.
And so we're going to figure out the path there.
And so we're going to break down every barrier as we come across them.
And look, it's your job, it's my job.
And if anything goes wrong, blame me.
Like that's largely been how I've been approaching it.
not just me like this has been a very large collective effort but giving that kind of permission thing but also the kind of like freedom to like explore or push things or whatever it's kind of necessary and look it might be a less stressful way to go about it to like just take a nap for a few years and come back and then with all the problems have been solved uh and we've got these perfect agents uh running amok in our environments then um then that that would avoid some of this but like i think All places have to get through that kind of apprehension and initial kind of issues that some of these can, some of the introduction of Agents Institute of Violence can have.
And I think our job as leaders, whether it's as an engineer or as a manager or whatever, it just has to be on that like enablement and giving people space to go deep on the work, enjoy it and like have that moment where things click and you start realizing like, oh my God, this is something that will transform how much I can get done.
Say it again for the people in the back.
I love I was like, oh, my gosh, I love this so much.
And, you know, it is absolutely those two things, which is like give permission.
You you can please just go, please, by all means, go ahead.
Designer, hit me with a PR.
No one's going to get mad at you.
Like, go ahead.
And then the second thing of just accountability can roll to the top and not in a scary way.
Let's not do irresponsible things.
But, you know, we've seen.
I've seen a couple incidents in the past month, some big ones.
And what you see is CEOs or big leaders coming out and saying, like, the team's shipping and we want to keep shipping and we're going to be careful with our customer data and we care for the customer experience.
And stuff happens.
We've learned from it.
It's ultimately on me.
I'm going to call the customers and we're going to we're going to move on and deliver great innovation for you.
And you know what I tell people?
to, you know, to get them over that hump, which is like, you really got to know what your existential problem is.
And I love what you said is the second that ChatGPT came out, Intercom changed because that is an existential problem.
Who writes the code in your code base?
Agents or humans?
Not an existential problem.
Like, will you be fundamentally disrupted by a new technology?
That is the real problem in your business.
So I always tell people, like, let's differentiate the real problems in our business from problems that we can tolerate.
And then go use the problems we can tolerate to move fast.
And so it sounds like you have a really good call.
I mean, I think at the end of the day, the results speak for themselves.
And again, you all are not asking me to say this.
Intercom has meant the moment.
You went all in on AI-assisted, you know, customer support and experience.
You're now building models.
And so it's not just a one and done.
ChatGPT is here.
We need to change how our product works.
AI-assisted coding's here, so we need to change how our engineering team works.
It's, you know, models are going to be how people differentiate.
We need to go there.
CLIs are going to be how people use products.
We need to go there.
And so I think this sort of, like, fearlessness and what I would suspect is, like, just a fun, nice, high-trust culture.
Good people.
You actually see the business results on the other side.
So I'm going to hype you up.
I see a lot of teams.
I see a lot of leaders.
And I think people can take a lot of inspiration from this.
But let's uninspire them really quickly before I get you out of here, which is my last question, which is when Finn takes 15 solid minutes on a live podcast to do a very basic task that you know it can do.
Or not Finn, win Cloud Code.
Yep.
What do you do?
Do you yell?
Are you a yeller?
What does your meta-analysis on this internal dashboard say?
The human needs to improve on.
I do lapse into giving claw code just like smiley faces or unhappy faces.
Not over the top.
I certainly haven't cursed at it.
Very polite.
That's kind of not my smile.
But I do like the odd kind of attaboy kind of smiley face.
I don't know if it knows, like, that I'm deeply thinking about this and, like, these little subtle kind of hints or whatever.
But, yeah, no, I think, like, professional with a few emojis is my style with Claude.
You know, hopefully that'll come back to me someday with an emoji.
Same.
I waste the tokens on telling it it did a good job.
I somehow, in my mind, I'm like, that's going into its own sense of itself.
And it's going to know what good looks like.
So I am there.
I am there with you.
All right, Brian, this has been.
One of my favorites.
Y'all, if you have gotten to the end, there is so much alpha in this episode.
I cannot believe it.
This is a cheat code to winning friends and influencing SaaS through AI engineering.
Brian, where can we find you and how can we be helpful?
I can be found on the internet at a nice vanity URL, which is brian.scanlon.ie.
And I got a few links here to some other talks and some other writing.
different bits and bobs.
As you can tell, I'm not a designer.
I asked Claude to design this as if I was a Unix systems administrator writing a little web page.
And it kind of shows.
I'm active on ex-Twitter.
I'm Brian underscore Scanlon.
I'm on LinkedIn, Scanlon B, or something like that.
I think I'm the most famous Brian Scanlon on the internet.
So generally, you can just push Brian Scanlon in.
That tends to work.
And I tend to be active and showing up to different conferences and just like getting good word out about what we do at intercom mostly these days ai but i've also given lots of talks about many other different topics and um yeah i'm also a big believer in just saying yes to a lot of things um so if you look me up you got a good idea you want to get in touch uh you want to run stuff past me or whatever chances are i'll say yes and we can i'll just keep on doing this until things break and then i start saying no so but i'm still not there yet so bring it on Great.
So search for Brian and ask him to do something for you.
That's it.
Well, thank you.
So thank you truly for sharing all this information.
People are going to get tons of value out of this.
It's going to be a hit for sure.
And I just really appreciate you joining Howie AI.
Of course.
This is so much fun.
Thanks so much for watching.
If you enjoyed the show, please like and subscribe here on YouTube or even better, leave us a comment with your thoughts.
You can also find this podcast on Apple Podcasts, Spotify, or your favorite podcast app.
Please consider leaving us a rating and review, which will help others find the show.
You can see all our episodes and learn more about the show at howiaipod.com.
See you next time.
