Overview
This episode of How I AI features Brian Levin (designer at Notion AI) explaining how Notion’s design team is shifting toward code-first prototyping to “encounter reality” earlier—especially for AI-driven product experiences that can’t be meaningfully validated in static mockups. Brian walks host Claire Vo through Notion’s internal “Prototype Playground,” a lightweight system that centralizes prototypes, speeds iteration with AI coding tools, and makes sharing and reuse across the team easier.
Key Takeaways
A core thesis is that great B2B SaaS design improves when it meets real constraints sooner: loading states, responsiveness, interaction timing, error cases, and “feel” become obvious only in a browser. AI coding tools (especially Claude Code) change the cost curve by enabling designers to prototype rapidly—sometimes close to production fidelity—without needing to hand off to engineering just to test fundamentals.
Prototype Playground’s power is less technical novelty than organizational leverage: it’s a single Next.js repo where each contributor has a namespaced folder, and each prototype is a standalone page with minimal shared structure. This creates visibility (you can browse what others are exploring) and reuse (you can “yoink” patterns/components from teammates). Notion also keeps shared “Notion-y” styles/components so prototypes resemble the product quickly.
Brian highlights a workflow principle: whenever the AI asks you to do something (check the browser, verify CI, lint, compare to Figma), you should try to teach it to do that itself. That philosophy drives automation via Claude plan mode, repository-level instructions (Claude.md), per-user local rules (Claude.local.md), and purpose-built slash commands/skills.
For AI product design specifically, Brian argues Figma is insufficient for simulating real model behavior. “Golden path” chat mockups break down when models stall, ask follow-ups, fail tool calls, or take minutes to respond—so designers need executable prototypes connected to real models/MCP tools to discover necessary states like progress indicators and error handling.
Practical Steps
- Create a centralized prototype repo (e.g., Next.js) with a simple convention:
/app/<person>/<prototype>/page.tsxplus lightweight metadata files to generate an index page of prototypes. - Provide shared UI scaffolds (sidebar, typography, icons, colors) so prototypes start “close enough” to your product without rebuilding basics each time.
- Add repo-wide AI rules in
Claude.md(stack choices, lint/test commands, file structure norms) and per-user rules in an uncommittedClaude.local.md(username, directory boundaries, “don’t edit others’ folders”). - Standardize repeatable flows with slash commands:
- “Create prototype” to generate folders/files/templates automatically.
- “Figma → code” that (1) checks MCP setup, (2) extracts the design, (3) implements, then (4) runs a verification loop comparing browser output to Figma until stable.
- “Deploy” to create branches/PRs, open the PR, and monitor CI until green—fixing failures automatically.
- Use “skills” for recurring failure modes (e.g., icon-name hallucinations): instruct the agent to search codebase synonyms and run scripts rather than dumping huge directories into context.
Notable Quotes
- Brian Levin: “You want your designs to encounter reality as early as possible.”
- Brian Levin: “Anytime the AI asks you to do something… try your best to see if you could teach the AI to answer that question for itself.”
- Brian Levin: “I don’t think you can design a good chat experience in Figma… you need to connect to real AI models and see where they break.”
Full Transcript
The way I think about designing B2B SaaS is you want your designs to encounter reality as early as possible. I've always been into prototyping. And then all of a sudden these AI coding tools come along and now I can prototype faster. I can prototype in production. So explain to us what this prototype playground is. It's just an XJS app. All of our prototypes are in one place. Seeing what other people are working on is really fun and interesting. And oftentimes you spot cool ideas and you're like, Ooh, I want to try that. The code is all in one place. It's just in one repo. And so I can just yoink cool ideas from other people's prototypes and incorporate them into mine. Every time somebody is like a little anti-AI assisted coding, I'm like, do you know that I used to have to walk uphill both ways for my CSS? It was not fun to do this. I mean, even just sitting here watching this, I still just find this magical. Welcome back to How I AI. I'm Claire Vo, product leader and AI obsessive here on a mission to help you build better with these new tools. Today, we have a designer centric episode with Brian Levin, designer at Notion AI, who's going to show us how he set up a prototype playground for the entire Notion design team to vibe code using Cloud Code, any prototype they need, this is a great one for someone looking to either shift their design organization into a code first prototyping mode, or learn some advanced techniques with Cloud Code. Let's get to it. This episode is brought to you by Work OS. AI has already changed how we work. Tools are helping teams write better code, analyze customer data, and even handle support tickets automatically. But there's a catch. These tools only work well when they have deep access to company systems. Your co-pilot needs to see your entire code base. Your chat bot needs to search across internal docs. And for enterprise buyers, that raises serious security concerns. That's why these apps face intense IT scrutiny from day one. To pass, they need secure authentication, access controls, audit logs, the whole suite of enterprise features. Building all that from scratch, it's a massive lift. That's where Work OS comes in. Work OS gets you drop-in APIs for enterprise features, so your app can become enterprise ready and scale up market faster. Think of it like Stripe for enterprise features. OpenAI, Perplexity, and Cursor are already using Work OS to move faster and meet enterprise demands. Join them and hundreds of other industry leaders at workos.com. Start building today. Brian, welcome to How I AI. What I am so excited about in terms of our conversation today is you're going to show us about how one of the best designed products out there, Notion, is being designed by people like you using these new AI tools like Cloud Code, so why did you make this shift to how you were doing design, what it meant to prototype, design, and build things, especially for a product and in a company who values design so highly? The way I think about designing B2B SaaS is you want your designs to encounter reality as early as possible. And, you know, if you imagine this gradient of like I'm scribbling on a napkin on one side to I'm shipping to production and showing customers on the other side, our goal as designers is to move up that gradient towards prod as quickly as possible. So I would say for most of my career, I've sort of biased towards being interested in programming, mostly at the prototyping level. I just find that when you're designing something in Figma and then you actually try it in the browser, in the browser, you notice a ton of problems. You know, all of a sudden you're clicking things, you notice loading states, you notice, ah, that didn't quite work on this screen size. So you encounter some version of reality sooner and you end up getting to a better design more quickly. So, you know, I've always been into prototyping and then all of a sudden these AI coding tools come along and now I can prototype faster. I can prototype in production. I can, uh, or what I most often do now at Notion is just prototype in a little internal tool we've built called Prototype Playground. And again, the idea is just like, how do we get something that's somewhat realistic in a kind of real environment, in our case, the browser, uh, as quickly as possible, and I think that just helps you move faster and end up with better designs. So explain to us what this Prototype Playground is and how you set it up and how you might use it. Okay. So Prototype Playground is nothing magical. It is just a Next.js project. So, uh, actually here to source apps and there's an app directory and you'll notice here in our app directory, where normally in a Next.js, uh, app, you would see pages, um, well, we've just namespaced every designer on the team or PM or engineer, whoever signs up and wants to use it, can just namespace, um, some directory. So here's Brian. And then every directory inside of that is some prototype. And so it's just the Next.js app, but each page is sort of a standalone. There's no global layout. There's no global, I don't know, like structure that you have to adhere to. And so what that looks like on the front end is this, this is what we call Prototype Playground. And it's just a list of prototypes ordered by who was working on stuff recently. So here's a few from December and then a bunch from November. And it's really cool because having everybody's prototypes in one place is useful on two dimensions. One, just from a visibility point of view, seeing what other people are working on is really fun and interesting. And oftentimes you spot cool ideas and you're like, Ooh, I want to try that. And then on the other dimensions, like if you spot a cool idea and you want to try it, the code is all in one place. It's just in one repo. And so I can just yoink cool ideas from other people's prototypes and incorporate them into mine. Usually, usually by just telling Claude to do that. Uh, yeah, I think before Prototype Playground, there was a lot of designers at Notion who prototyped in code. The difference was we were all creating our own repository, our own Next.js instance. And so it was hard to know where everyone's stuff was. Everyone was rebuilding it in different ways. Or if people were trying to recreate something that looked Notion-y, we were all doing it from scratch. So anyways, Prototype Playground, Next.js app, all of our prototypes are in one place. And then we have a few shared components and shared styles. So if you want to make something that looks Notion-y, you can do that pretty quickly. So for example, we have some templates here. I can show you like Notion UI is just a Notion-y sidebar. And actually this isn't even very Notion-y. I think at some point I slipped this new button in here, which obviously doesn't exist in the product. I don't think these things do anything, but it's close enough. If you're like, oh, I needed to prototype something with a Notion sidebar, I can just come in here and duplicate this template. And then we, of course, pull in a bunch of our colors, typography, icons, so that again, just getting to close enough Notion styles, uh, without a whole lot of effort. Yeah. And I want to call out for people a couple, many episodes ago, I showed how you could build a very similar Next.js app for yourself that had a combination of docs you were working on, Markdown docs you're working on, and prototypes in a very similar format where it was like, here's my folder of just stuff I'm working on. Very minimal shared components, very minimal shared styles. I like this too, because it's nice to have that team level organization. So you can pop in and see who your teammates are working with. I have a question from an operational perspective. Did you set this up? Like, was this like a passion project for you? Uh, did engineering set it up for you? Like, how did this actually get created? Yeah, I set it up with another engineer. I mean, it's just a Next.js app, but then operationally, just a few approvals. It's deployed on Vercel, so we had to like go through a little bit of process to get that project spun up, get a few people added as members. Otherwise, yeah, it's not that much. Again, it's just a pretty basic Next.js app, which you can literally use Cloud to like help me make a Next.js app, and it's just going to get you the default. I like keyboard hands. Everybody does the same keyboard hands motion where it's just this. I have one more question, which is, of the people now working in this repository, how many before were working in code versus this is their first repository that they've cloned to, you know, their desktop or deployed? Was the design team pretty technically adept already, so this was very natural, or were there some people that needed to be onboarded? I think so. I mean, to be honest, Prototype Playground is still really for me. Like, I think I use it the most. You can see here there's a bunch of other people that are creating things, but if you were to go through, I probably use it the most. I think there's maybe like five to ten people at Notion that use it quite a lot, and then a bunch of people who either have tried it and it didn't stick, and we can get into reasons why that is, or they're just not interested, or they prototype separately, right? Like, we still have people prototyping in Figma. We have people that prototype in their own code base still. They just prefer their own stack. Maybe they don't like Next.js. Maybe they don't like React, so they do something else. And I think all that is totally fine. In fact, one of the features I added recently was this ability to link to an external prototype. So if you prefer using v0, or lovable, or a Figma makefile, whatever it might be, you could just link to it here. And in fact, this is what it'll show up as in Prototype Playground. Just have this little external icon. And so you can click it and it'll open in a new tab. So in theory, this could be the Prototype Playground or repo for any prototyping tool. My hope is that over time, we make this thing useful enough that more people will want to prototype in it because it's just faster than those other tools. And we got to figure out how to lower the onboarding complexity for people who aren't technical before. So to answer your question, I don't know. I'd say some people who weren't technical made their very first code prototypes or AI-assisted prototypes in the Playground. But probably the majority of us that are still using it daily had some technical background. Got it. Perfect. Well, let's prototype something. I want to see how it actually works. Let's do it. OK, so there's a few ways to make new things in Next.js, right? Like we could be in Cursor and we could come in here and create a new folder and create a bunch of page.tsx and metadata files. And that sucks. I don't want to do it. So there's two ways around that. The first is when you're running in localhost, you can actually just click this button that says New, and you give your prototype a name and a description. I'll call this one How I AI. And then this is for fun. And I create that. And all that's doing under the hood, if we bounce back over to Cursor, is it just created those files for me on my computer. This is my favorite part. There's no back end for Prototype Playground. It's just all files on disk. And then we can just push all this to GitHub. So here we have a little metadata file. These get sort of collated to render the list on the homepage. We have an actual prototype file here with some code. And then this is kind of nice. Like it automatically gives you a button to open it in Cursor. So now I can just come in here and start prototyping. Now, typically my workflow is I just bust open Clod in the terminal. I know this isn't how you're supposed to use Cursor, but it's just how I do it. It's probably not even how you're supposed to use Clod code, but I just do it. We're just equal opportunity offending these two tools. I know. I know. Sorry, everybody. But this is how I like to work. And in fact, I have a little shortcut here where I can just press Caps Lock G. And then I can get these two things side by side in my computer. So I usually am clodding over here, reviewing the changes here, and then monitoring sort of the output over here. So let's see here. I want to make a prototype. And I don't know. Let's just come up with some contrived example. Like maybe you can help me think of a good use case. Can we make a prototype for, oh, like a little video and audio. This may be complicated. Video and audio like display module for my podcasts. Video and audio. So it's like video and then maybe like an audio player. Let's see. Opus 4.5. I think you could do it. Okay, let's try it. So normally, let's walk through like my actual workflow. There's sort of two steps. One is you can type a lot. That's not that fun. I do use this tool called Monologue where you can just talk to your computer. There's many products like this. I think Monologue is just nice and cute. So we can just talk and it's just much faster than typing our prompt. The second thing you'll notice with Cloud Code is I switched over to plan mode. I think it's really, really important to plan before doing anything. For whatever reason, you just get better outcomes. Now, the key thing about using plan mode is to actually read the plan. And I think this is where having a development background just gives you an edge because you can read the plan and be like, oh, that part actually doesn't look quite right. Whereas if you maybe don't have as much programming experience, it would be harder to tell that. But in either case, I still find that having the plan mode and creating some structure before actually writing code is better. So let's just do both of these things at the same time. So we're in plan mode and I'm going to invoke Monologue here and it's recording. And so let's say I want to build a new prototype in this howIAI directory. And we are a podcast and I want to build a detail page for a podcast episode that has both a video player and an audio player underneath. The page should have the title of the episode, a description. And how about if you hit play, there's little confetti that shoots up out of the player. And so we end that and now I will delete this and we plan. So I have to give you props on two things. One, I am also a plan mode slash like write your spec, write your PRD person, obviously. I think the second thing is I am still just such a read the code, read the outputs girl when it comes to AI. It's actually one of my challenges when I use something like Claude code or watch people use Claude code is if you don't do it inside a cursor or something that gives you this sort of I love your three pane window, you're like code window, your Claude window, your output window. Because I see people with like 17 tabs of Claude code going just accepting a bunch of changes and I have to read. I think this is also just like engineering development background where you can just spot things that make no sense in the moment as opposed to having to go back and debug something. So I am very much aligned with you on that. Yeah, it's helpful. And you know, this is probably obvious to a lot of people who are familiar with using Claude code. But maybe if you aren't like another piece that's really important here is getting the right context up front, right? Like we just typed in some prompts. But under the hood, I can show you we actually have some other files helping us out here. So we have a Claude.md file at the root of our project, with just some rough instructions around like the tooling that we use, like we use bun, we use tailwind. It has like a rough outline of the project structure. Another thing that we do is anytime someone runs the project locally, we create a Claude.local.md file. And that local file is not committed to the Git repo. So it's personal per computer. And it adds a little bit of extra context like, hey, this is my username in Prototype Playground. It tells Claude where my directory is. And it gives some instructions like, hey, you know, don't go around and mess with other people's files, like prefer to work in my directory. And a little bit more about the workspace structure and how like individual projects can be built. So a couple of those things are working under the hood here. And while you're accepting some of these Claude code changes and questions, I do want to call this out for folks, because I think people are pretty aware of the Claude.md global settings. But I think people forget that there are actually locally scoped versions of these that you can implement. And so it's really useful to get one version deployed to everybody that gives you your master rules for using Claude. And then you can set up your own custom one with your own particular preferences. And I think that's a really nice way to create a good collaborative environment where people are using a similar AI tool or agent to work in the repo. Totally, yeah. Okay, I don't know, we'll see how this goes. But it's gonna install some sort of confetti. It's gonna have a player, audio player. This is really awesome. Like it does a wireframe in the plan, which is crazy. And here, I don't know, we can just kind of skim this for the sake of this example. This looks fine. So let's auto accept edits. Now, I have a tip for people. Because I think when you spend enough time on Twitter or watching other people use these coding tools, people are always like, how do you get it to run for longer? You know, they find themselves constantly getting stuck, or the agent does the wrong thing, or it's asking for their input. And my philosophy on this has been, any time the AI asks you to do something, you should, before responding, try your best to see if you could teach the AI to answer that question for itself. There's a good example. Oh, wow, that was very fast. Ooh la la. Well, here, let's hold on that and see if the confetti works. Well, actually here, the example is, I've already taught Claude to like always lint itself after it's done, right? What's really annoying is when it builds a bunch of stuff, and then you go and look in your browser, and there's some error, right? So for example, I've taught Claude, hey, check your work. One, you can run commands like, what was this? Like eslint, right? And like, look for actual TypeScript errors. The second is you can give it access to MCP tools. So Playwright is one, the Chrome DevTools MCP is another one. And you can say, well, actually, you know, before installing this, Claude would say to you, hey, I've implemented your feature. Go take a look at it and let me know what you think. And remember, our rule is anytime Claude tells you to do something, ask if you can teach it to do that thing for itself. So I don't want to have to look at the browser every time to see if I did it correctly. So instead, I teach Claude, actually, you should be the one to go and open the browser. So it knows how to launch Chrome. It knows how to navigate here. It knows how to click the play button, look for confetti, make sure the audio is working, all that kind of stuff. And so now we were able to run this task for much longer without my input and actually get to something that is working well. I'm actually very impressed with this prototype. It's much more lovely than I thought it was going to end up. Much more robust. And the confetti looks great. The confetti looks great. Yeah. Well, here, I'll show you another example. This is, I think, where the power of MCP gets crazy. So let's actually clear this. We're just going to start a new conversation here. I'm going to just totally undo everything. Let's just start from scratch. So a couple other things that I've built in, I think, remember, I'm trying to make the onboarding flow as simple as possible for people on my team. So what Claude has is called slash commands. And you can just build these yourselves. And they're basically glorified prompts. But they can also run scripts. And so we have some slash commands in the project that help people get going really quickly. So I have one called Create Prototype. And then you can give it an optional name. So we'll call this one How I AI. And that's going to do the same thing as clicking the New button on the browser, which is what we did earlier. The difference, of course, is I don't have to click things. I kind of want to design this so that I basically live over in the terminal. And can you show us really quickly in your repo just how these commands get defined? Perfect. Thank you. Yeah, sure. So again, it's basically a glorified prompt. It has a name, a description, and then some instructions. So in our case, instructions so in our case we say Kind of how to come up with a name based on what the the user provides Tells it where to look to determine the current user's username how to create the new thing It actually provides some sample code to use for both creating the page and the metadata file. I think I need to Also approve this so it goes Let's just do Blank for now As well as creating the metadata, so, you know AI is Better with good context, but it's also really really good if you just provided examples of how to do things So the reason it's important to provide these code snippets is to show it what success looks like, right? Yep, if this was like Just instructions to create blank files. It wouldn't know what to create. So in our case, we're just showing it an example of success And we could probably simplify this. It's actually quite a long command, but but here we go So it created this and a blank piece of text. That's great So that's just one way to start you just type slash create a prototype and then and then that'll create but Maybe we have some design in Figma and we want to build this This might not work, but let's try it So we can connect to the figma MCP and I can just copy a link to this frame and say like let's build this this notion UI so Before you could just paste a link to a figma URL and try and manually invoke the figma MCP And it would sometimes ask clarifying questions and sometimes it would build it and then sort of stop halfway through I don't like any of that. So we actually built a command Called slash figma and it roughly does a couple of things The first is it actually checks that you have the MCP server installed and running, you know for people on the team who have never Done MCP stuff before they might not know how to do this And so it detects if you have it installed and if it doesn't if it finds that it's not installed it'll just teach you how to do it so it actually returns instructions to the user on how to set all this stuff up and Then it moves on to phase a designing Or extracting the design from figma, then it'll implement it And then the most important thing is we enter this third phase called the verification loop where it's going to open the browser and compare the implementation it created to the original figma file and I think my instructions are basically Keep looping until you've gone through like two loops where there were no more changes Oh, yeah here repeat until the implementation matches or after three iterations with no changes and then stop iterating so let's just see what happens this I would say it gets like 80% correct 80% of the time, but that's just that's just how AI is right now. I was about 60% Well, actually, you know I think it is 60% but this this command in this loop in these Instructions and like the pairing of the two MCPs actually gets us to 80% I want to call call this out for folks because one of the things that I find most Frustrating using MCPs even as a fairly sophisticated user is one You just have to use these like magic keywords to invoke the MCP and the right tool and the right thing And you know, sometimes I have one of the challenges I have is I have a lot of MCPs that use the same Tool names Because so much across SAS is named the same like everybody has the concept of projects Everybody has the concepts of pages or documents And so I like this idea of like force invoking a specific MCP Via a slash command and not even just force invoking that specific MCP But force invoking a specific set of tools in that MCP super super useful and then I will give you props for the Instructions at the top that teach somebody if you have no idea what you're doing here How do you even get this thing installed? That's such a nice piece to add in as user experience for a consumer of this Slash command that might not be you And so that's something that people should really really think about. Yeah. Yeah, I would say also It's funny because I've actually watched a bunch of these videos and looking even back at the ones from six months ago It's crazy How far the tooling has come and so I imagine that people who for whatever reason might be watching this video in six months We'll look at what we're doing here and be like, oh how naive it you know We've come so far MCP is no longer a thing or something like that, right? and I kind of feel that way now where MCP is It's like Not the best thing But it's the best we have so far right like it's very context inefficient Sometimes it runs forever. Sometimes it yeah, it just like blows up your context window, but it's the best we have right now So even just watching this right like here's our design that got built. This was literally just pasting the link to the Figma file No other custom instructions and now over here on the right. It should be I think I ran into an issue earlier. Yeah, but Something got busted with this. Let's try the Chrome Dev tools MCP again, I think I quit it midway through because it was detecting some conflict with the window Anyways, this is pretty good by default and then from here I'd iterate, you know Some things things you might notice would be like there's no hover states Some of these images are broken, but those are just an easy follow-up tasks. Well, and you're doing this from a kind of design perspective but think about how many engineers sit there and like pixel pullover Figma Pixel pullover Figma prototypes into the front end and you know, if you have a great design system Maybe that's easier to do but it's not what the 27 seconds that we just watched To scaffold stuff out. And so I just think you know the friction reduction in these You know asset to asset handoffs Which for my entire career 20 plus years in tech have been the most expensive part of implementing something where a designer gives you a design And then you have to get into the front end or the front end has to be hooked up to the back end all those little pieces can be Smoothed out and done much faster and then you can spend the time on the Optimizations the performance the how it feels how it how it works. And I think that's just really it's really fun from a builder perspective totally, it's so fun and Yeah, I mean even just sitting here watching this I still just find this magical right like Now that it's using the the Chrome DevTools MCP you like looped and fix the broken images and like created this checklist of stuff like, okay, everything appears to be right It's got this bottom bar. These things are obviously wrong, but we could go and fix those with the follow-up prompt But again, the goal is like can we get 80% in literally one prompt? I just pasted a link and it just iterated itself towards something that's roughly complete I know and every time somebody is like a little anti AI assisted coding I'm like, do you know that I used to have to walk uphill both ways for my CSS? Like yeah, it was not fun. I find this just Mesmerizing. This is so cool rising This is great This episode is brought to you by orcas the company behind open source Conductor the platform powering complex workflows and process orchestration for modern enterprise apps and agentic workflows Legacy business process automation tools are breaking down Siloed low-code platforms outdated process management systems and disconnected API management tools Weren't built for today's event-driven AI powered cloud native world Orcas changes that with orcas conductor you get a modern orchestration layer that scales with high reliability Supports both visual and code first development and brings human AI and systems together in real time It's not just about tasks. It's about orchestrating everything API's micro services data pipelines human in the loop actions and even autonomous agents So build test and debug complex workflows with ease add human approvals Automate back-end processes and orchestrate agentic workflows at enterprise scale all while maintaining enterprise-grade security compliance and observability Whether you're modernizing legacy systems or scaling next-gen AI driven apps Orcas helps you go from idea to production fast Orcas orchestrate the future of work Learn more and start building at orcas.io. That's ork es dot io Are there any other? Commands that you think are super useful. Yeah. Yeah, I can show you a couple So I want to scroll back up a little ways actually there was this step very early on Where you can see it was running over and over again the skill called Bunrun Claude skills find icon. What's that? Well, if you look over here in our design We actually have a bunch of very notion specific icons, right? Like we have this AI face. We've got home inbox We have all of the icons in our project. The problem is AI is really bad at estimating What the name of an icon should be? Or rather it uses like the most obvious name possible, which doesn't always match what's in code So for example, like this face icon There's no way AI would know what we call this or a very common one Is it will if you have like a search magnifying glass, right? It will just assume that it's called search icon when in fact in our code it's called magnifying glass icon and So this icon hallucination was getting really really annoying So I wrote a little skill called find icon and the skill basically says like anytime you're gonna implement an icon first go and actually look through the whole project, but also look for Synonyms or closely related words to the icon. So if you're gonna look up something called search icon Also try search for magnifying glass icon and it actually wrote a typescript Script to do this to just iterate through all of the the files in our icons directory, which is like 5,000, right? It's a lot So it actually be very inefficient for it to try and load all that up into context It needs to write itself a script to do more effective searching So in that that loop here Yeah, you can see it like Found it looked up Magnifying and found the magnifying glass icon it looked up in box and it looked up gear and trash in order to get all these things correctly Now this only this skill Had to exist after all of us on the team just got really really frustrated with it hallucinating over and over and over again It's sad because it obviously missed these bottom three. It didn't get them, correct But the fact that it got these on the first pass is a huge step up so the way I think about it is, you know, we have these commands that you run manually and skills are these capabilities that The AI should detect automatically and sort of use at the appropriate time and it'll know to do that based on The description and title you've given it So in this case find icon and then how to search for icons And of course the best part is just letting it do things programmatically on your computer by calling actual Coded scripts. So this was really helpful saves us a lot of time and just fixing imports and nope Search icon does not exist those kinds of annoying knowing steps Well, what I like about this is one This is exactly what you would do to like a junior designer or engineer onboarding You would like explain you'd like sometimes we call it search, but not really it's magnifying glass You just got to go find like the closest M&M and the ability to be able to describe that to an agent or a skill or a tool and then let it do it Programmatically for you is really useful We do have a how I how I a I episode on Claude skills But one piece we don't go into in detail Which I think is really important is Claude skills can be bundled with scripts. And so the ability to combine both you know natural language prompting which is in the skill dot markdown with a Set of programmatic tools in terms of scripts is a very powerful combination and Claude's very good at making these So, oh, yeah, like all this like I did not type a single line of code in this right? Like this is a hundred percent like hey I just need this problem to be solved create a skill for it and then Creating that skill also create a script so that you can work more effectively like this is a hundred percent prompted Show us your your last command because I think this is a really useful one. Okay, this is fairly new I think I merged this last week going back to sort of the problem with prototype playground. It's still a next JS app it's still react and typescript and git and branches and It's just a lot of concepts to throw at someone who maybe is used to only Prototyping in Figma or they're intimidated by a terminal or code and so I'm trying to figure out like how do we make this thing? more approachable, how do we Make it easier to onboard but also not Dumbed down right like I Want people to learn how to use computers? I want people to even subconsciously absorb the ideas of git and branching and pull requests and merging So, I don't know the best way to do that. But my first attempt is to create this skill called or this command called deploy and deploy does Basically two things the first is it like goes through prerequisites and makes sure that it has the github CLI tool installed on your computer and that you're authenticated and if you're not it like walks you through those steps how to do it and Then the second step is It will just walk you through Step by step how to get this prototype. You've just created Deployed so that you can share the link with someone on your team Let's see what happens I'm gonna try it now I'm gonna hit deploy and we'll see what happens There's a couple of really cool loops in here that I think save people a lot of time So we can see it going through the prerequisite steps here. It's making sure I'm logged into github Now the first thing here look it's looking to see if I'm on a git branch. It notices. I'm not I'm on main and It shouldn't be doing that right like we never want to push to main So, I think what it should do is help me create a new branch And We'll see if it actually does it correctly it's also trying to find some typescript errors and it's gonna run some tests I Basically told it to do all this stuff because it's really annoying if you push code to github Wait for all the checks there to pass if they fail then you got to come back to your computer fix stuff Okay, great. So it created a branch Now it's staging our changes branch name perfect Creating the commit I'll give I love this. This is a great idea I will also give my just like hack to learning git for anybody who hasn't used it I just love the git get github desktop app It just like it gives you buttons for all this you can see your divs you might create branches with buttons So I think this is awesome. And if you are intimidated by the command line There's like literally a beautifully designed desktop app that you can that's true. It's pretty nice Well now check this out so It's created the PR and in the instructions. I've told Claude Hey, whenever you create the PR open it in the users default browser. So now we have our PR open here and This check to deploy it to her cell will fail But that's okay because I give it one more step here and all this red looks scary, but it's not I Tell Claude to just monitor the CI every 30 seconds or every 60 seconds until all the checks pass And I tell them the specific checks that I care about and if any of the checks fail Just fix yourself and then push the changes. So, you know if people push something to github and there's a typescript error They'll see some error over here in the github UI, they'll take a screenshot They'll send it to me on slack and be like, why is my thing not working? I want to just avoid that entirely and You know going all the way back to my first principles Like if the AI is asking you to do something like check the PR or tell me the CI status You should really be thinking about how do I teach Claude to just do that for itself? So over here this slash deploy command Literally is just end-to-end. I just sit back and watch it loop over and over and over again Checking its its commit status its CI status making sure everything works and then when all of the check marks over here are green the script will stop I Think this is pretty awesome. I Feel I hope it lowers the barrier and like the intimidation factor of having to learn all these tools But at the same time, you know, if you are curious, you can just sort of read along and understand what's happening It's like instructed to communicate in clear English what it's doing My favorite part of this and it's not gonna be what people think I think The slash command is amazing. I think running through all the Pre-projects great. I love that. You just open it up in a browser window It's one of those things that you know Even if you created the branch created the pull request said it was ready to go people are like, okay Well now now what do I like now? What do I do and just forcing open the browser window and saying like this is where it lives on github. My question is Do you have to get your code reviewed in prototype playground or do you for prototype playground? No I mean people can always ask for it, but no we pretty much just yolo merge I think I mostly check for is like did my PR accidentally mess up someone else's prototype? Yeah, but again like that happened a couple of times and that was annoying So then we created this cloud local file that's like important do not do this You know and that seems to fix the the problem. So Yeah, a lot of Yolo enable auto merge and of course, it's not perfect. I don't know It's it seems to be hallucinating some stuff here like it thinks it thinks these past even though they haven't I don't know it It's close So I'm just gonna zoom out everything that we went over you created a shared repo for your entire team where you could have name Level directories no database. We're just using metadata JSON and and and Shared code to put different prototypes inside this repo You set it up with both global Claude rules as well as local Claude rules plus Claude commands and Claude skills to sort of guide people along common paths My favorite one is gonna be figma to code. It's so beautiful. It's so Good and then the number one rule that I've heard from you today is When asked to do something by Claude Teach Claude to do it. It's Yeah So you have this amazing prototype? Playground you've set all this stuff up How has let's let's just do a couple lightning round questions and get you on your way And my first one is how is this shift from? Doing things, you know Maybe exclusively in figma or in these lower fidelity prototype models to really leaning on things like Claude code Code based prototyping. How has that changed the design? Team has it changed a small part of the design team Do you feel like overall things in the organization are shifting in in a way? How do you feel like it's changing the way people work together? I still use figma. I probably still spend 60 to 70 percent of my time in figma, you know, there's just certain things that you're making That don't need to be in the browser. They don't need to be coded up You can just look at it and be like, yeah, that's roughly right. We should just ship that I Find that as You're designing for things that use AI that is not true, though, so for example if you were building a Chat bot or in my case I work on notion AI I Don't think you can design a Good chat experience in figma You can design what the chat input looks like you could design a little chat bubble and a send button and like a drop-down For model picker. I think all that's fine in figma But what you can't design in figma is what it actually will feel like to use that thing to use that thing. I probably should have said this at the very beginning, but the reason Prototype Playground existed is because when I started working on Notion AI, I was literally designing conversations in Figma. It was like, the user is going to say this, and then the AI is going to say this, and then it's going to work perfectly and create a page or a database. And you mock these golden paths in Figma, and then the engineers go and they build it, and then it just doesn't work that way, right? You send a message, the AI gets stuck, or it asks a follow-up question, or it does the wrong thing and you need to correct it. And Prototype Playground was, for me, a way to connect to real AI models and just start feeling out like, okay, how are the models going to work if I submit this kind of prompt? What happens if I connect it to the Notion MCP? It doesn't even know how to create a page. What happens if it runs into an error? Oh, right, we need to design an error state for this. What happens if the model is thinking for two minutes and the user's staring at an empty chat screen? What should we do in that intermediary time to help them feel confident that it's working, that it's doing the right thing? Is there any way to show incremental progress? I just found those things very, very hard to design in Figma. So to go all the way back to answering your question, as more and more people are designing apps that both are for AI or incorporate AI in some way, they're going to need some other native, code-first way of working to actually understand what the models can do. It feels honestly kind of bad. It feels like a lot of wasted time where every month the whole freaking industry has to learn like, oh, what are the new capabilities of this model 4.3.2 Max Pro? And then a month later, it's all irrelevant because the new thing has come out and then you learn that. It feels like a waste of time. Unfortunately, I think it's necessary because the model capabilities are still advancing quite steadily with each release. And it's really important as designers to understand what models are capable of doing so that we can create product experiences and designs that sort of live right at the edge of what the model is going to be able to do well. What's really frustrating is if you design something that's like, oh, a user is just going to ask for a cool website and it's going to be this perfect output website on the other side. Models can't do it, right? Or they require a bunch of fine tuning and sort of like intermediary prompting to get that right. Designers just have to know what's going on under the hood there to design something that's plausible and possible. So I suppose the more products incorporate AI, the more designers will have to shift to thinking sort of prototype first, but probably prototype first with actual code under the hood where you can incorporate modern models and see where they break and see where they're good and see where they're bad and actually form an opinion about which models are good for which things, that kind of stuff. So speaking of which models are good for which things, and you're using my current fave, my babe, Opus 4.5. Why Claude Code? Why cursor in this non-cursory configuration? Tell me how you arrived at this as your tool stack. I need to play with more of the cursor stuff. I actually think cursor agent mode is pretty awesome. I've like clearly tried it a little bit. I just haven't gotten that far. The thing that I still really appreciate about cursor, I actually technically use both. Like if I have, I don't know, like some file and there's there's like some error here, I still really appreciate being able to just hover over the error and there's a button that says fix in chat. That's still faster than like copying and pasting it down into Claude Code. So I actually use both a little bit. I just think Claude Code does the best work. I don't know how else to describe it. There's this weird feeling as you use all the different models for different things. It's like to different people, they just feel right. For me, Opus 4.5 is just insanely good at doing what I want and I like the way it approaches problems. I like the way it plans. I like the way it executes. I like the way it communicates back to me and the follow-up questions it asks. And then, you know, why not use Opus 4.5 in cursor versus in the terminal UI? I don't know. I think this is just purely personal preference. Like some people look at this and they're like, this looks like shit. Like give me buttons and UI and components and drop downs and things like that. And for me, I don't know, this just feels nice and easy. It just feels good. As our friends over at Every Say each model has a mouth feel. Yeah, yeah, exactly. Claude Code and Opus have a good one. Okay. And then my very last question, because you seem like an expert prompter, but when AI is not listening, when it's not listening, when it makes up, you know, CI checks that passed where it didn't actually pass, what is your prompting technique? Basically, I noticed there's a direct correlation with how good of things I can make and how tired I am. And if I ever get to the point where, man, Claude just sucks, it's doing the wrong stuff. And I go back and I reread the thing that I said, I realized I made no sense. And so the best solution for me to write better prompts is like, go to bed, try again tomorrow. Which, I don't know if that's a compound answer. It's not actually writing better prompts. But, you know, your output's just directly correlated with how good of context you give the thing. And if you're giving it sleepy, tired, lazy, please fix this type commands, it's going to do bad work. I don't know if this is what you intended, but you gave me very good both relationship and parenting advice there, which I'm thinking about. I was trying to ask my kid this morning to do something. I'm pretty tired. And I clearly, the inputs, we're not going to get the outputs that I want. Well, it's easy. I mean, just go take a nap. Can't you do that at any point that you need? I love that. You know, one of my favorite little agents, Devin, does have a sleep. You can send the agent to sleep. Maybe we just need, we need the agents to send us to sleep. Well, Brian, this has been awesome. Just a deep dive, I think, in a very forward-looking view into how design teams, as you say, especially ones that are going to be building AI products, are going to start doing their work. So where can we find you and how can we be helpful to you in Notion? You can find me, I'm mostly on Twitter or X, Brian underscore Levin, or my website, brianlevin.com. And then I work on Notion AI, and I think it's genuinely one of the few useful sort of knowledge work agents. So if you haven't tried it, try it and send me feedback. We're always trying to make it better, help it do more things better, faster. So try Notion AI. Yeah. And we're big fans of Notion AI, too, here at the podcast. So definitely give it a look and definitely give some feedback. And we will send it directly to Claude and put it in Prototype Playground. Brian, thank you for joining HowAI. Thank you for having me. Thanks so much for watching. If you enjoyed the show, please like and subscribe here on YouTube, or even better, leave us a comment with your thoughts. You can also find this podcast on Apple Podcasts, Spotify, or your favorite podcast app. Please consider leaving us a rating and review which will help others find the show. You can see all our episodes and learn more about the show at howiaipod.com. See you next time.