
Loading summary
A
It all comes down to having a very tight and manicured process which thankfully I have spent my 10 gajillion hours in midjourney and Nana Banana and everything to figure out exactly what that is. So you're not pulling your hair out prompting all day.
B
One of the things I like about the mood board is it's a visual language to explain to midjourney what you're trying to do. The picture is worth a thousand words. Like literally a picture to an LLM is worth a thousand words.
A
Mentioning like Vogue or high fashion or even like a different artist name is a great way to tell the model a ton of stuff. Stuff without actually having to tell a
B
ton of stuff in the past. Brand and creative directors or agencies will give you these photos and be like cool, call us and re up when you want more photos. What I love is that you're like look, you're going to value me for all this upfront work that I'm going to do to define the space, give you these codes, really give you reference images and then now you can go do this for yourself. It's just like a very different model of providing service and I think it creates a really positive collaboration between the client and the Creative director. Foreign welcome to How I AI I'm Claire Vo, Product leader and AI Obsessive, here on a mission to help you build better with these new tools. Today we have an aesthetic episode with Jamie Gannon who is an AI Creative Director and is going to show us how to create consistent, beautiful and unique brand assets using Midjourney, Nana Banana, Flora and more. This is a workflow we haven't seen yet and goes into incredible, incredible depth on how to create awesome brand assets that you can use to uplevel all of your designs. Let's get to it. As an AI founder, you're used to sprinting towards product, market fit, your next round or that first enterprise contract. But speed isn't enough for AI startups. Buyers expect security, compliance and transparency from day one. That's why serious AI startups you use Vanta with deep integrations and automated workflows built for fast moving AI teams. Vanta gets you audit ready fast and keeps you secure with continuous monitoring as your models infra and customers evolve. AI innovators like LangChain, Rider and Cursor scaled faster and closed bigger deals by getting security right early. With Vanta, listeners can claim a special offer of $1,000 off Vanta at Vanta.com/how I AI Jamie, thanks for joining How I AI I am having you on the show for a very selfish reason, which is I think I'm the only pink AI brand in all of SaaS. And when I saw your work, I was like, oh my God. I need this lady to teach me how to create brand imagery that is beautiful and fun and girly and whatever magic she has. I need. So talk to me about how we can get amazing images like what you're showing us right now. I think consistently is the most important part because I can get a one off image that's this great, but I can't get this brand portfolio. So you gotta tell me your sorcery.
A
Yeah, I mean, it all comes down to like unfortunately having a very tight and manicured process, which thankfully I have spent my 10 gajillion hours in mid journey and Nana Banana everything to figure out exactly what that is. So you're not pulling your hair out, prompting all day. And yeah, I would, I would love to show you.
B
Great. So where do we, where do we start? When you teach people how to do this, you know where, what's, what's step one.
A
So first thing that I always do is I'm going to start in either Pinterest or Cosmos and I'm going to create a mood board that is the general vibe of what I want. So for this exercise, I wanted to have like a very pink and cute, but still kind of like not super girly, very Internet kind of coded aesthetic. I like especially with AI doing like juxtaposition. I think that's really fun. So we have like an orange with piercing. We have like a fluorescent fruit dog on a computer, things where they shouldn't be. We have like a grungy unicorn. So I thought this was like a really cool aesthetic to start with. And I typically start. There's two ways that I usually start. I will either go in with a mood board in mid journey and you can basically just copy and paste your images in, or you can start by adding them as srefs, as you can see here. So srefs basically are style references and they kind of do exactly what they sound like. It just tells midjourney to take the overall style and coloring and camera treatment and vibe, if you will. And it like tells it to apply that. Now one of the first prompts that I tried to create this aesthetic is I use that mood board that you saw that I made. And in this part of my process, in the create part of my process, I'm just trying to get information. I'm trying to figure out like what, what are the Images telling AI. What is my prompting? Telling AI. What's the mood board telling AI? And I just want to generate very fast, so I'm not very precious when I'm doing these prompts. Like right here, I just have beautiful female model. I have astronaut. You can see I'm using that mood board here. But if we remember, like, the original mood board and those final images that you guys have a sneak peek of, we can see that we're, like, very far off from where we want to be. And. And I think where a lot of people that are starting to use AI and get kind of tripped up is like, if you've ever just, like, raw, dogged, generated something in, like, Mid Journey or chatgpt, like, this might be, like, great to you and like, some of these images standalone are really cool to me. But if we're working for, like, a client or we're trying to be consistent with the brand style, we need to be like, really, really honest with ourselves on, like, does this actually look like that vibe? And truthfully, it does not. So I think that there's going to be a better approach to get things going.
B
One thing I want to call out, if you could go, go back, is I think one of the things people lack when they're working with more of these truly creative generative AI tools is they lack language. And so one of the things I like about the mood board is it's a visual language to explain to Mid Journey what you're trying to do. What I like about the style references, which we've done a couple episodes that have reference style style references. In terms of Mid Journey, we've done one with Zach, the creative and design lead at Gamma. We've done one where we were looking at style refs for more photography styles. So these are just alternative languages to tell midjourney or another tool, something specific about a visual aesthetic, which I feel like a lot of people just aren't trained. If you're not trained as a designer, if you're not trained as a photographer, you don't have. And so I love this. A picture is worth a thousand words. Like literally, a picture to an LLM is worth a thousand words. Uh, the other thing that I want to do, if you go to your mood board versus what was generated, one of the things I want to call out about this, where you're comparing the mood board to the images, is somebody who has been a designer and has been around photographers. I bet you can see this and say, okay, like, the saturation and the contrast on these photos is not as high as they are in the generated images. Um, there's this like washed out vibe on some of the photography on the generated images. And so one of the tricks that I wonder if people might think about is you can actually upload this to like a, like a chat GPT or cloud and you could say, explain to me why the photos don't match the mood board. And so, you know, you are probably an expert at this and have language to figure out why it doesn't match. But if for folks that are trying to teach themselves language, that's just one trick I think is really useful, is throw this in and say, hey, ChatGPT, explain to me why these top four images aren't in the same style as the bottom. And that can sort of give you a seed. Seed to start.
A
I try and avoid prompting at all costs in my process. But like, for this example, we're going to go like super, super simple with like using srefs and mood boards. But yeah, if I'm in one of these, like I'm doing some insane editorial work for clients, it needs to be consistent across like a hundred images and like foolproof to like consumer eye. That's when like getting really into the details, especially with models like nanobanana can be super helpful.
B
Great. Okay, so this doesn't match. What do we do?
A
Basically what I kind of glean from this is like the mood board is not doing its job. It's not communicating the vibe properly. This is something that happens a lot with midjourney mood boards. There's not a ton of like documentation from midjourney on exactly how it works. But as creatives, you can tell like the more kind of consistent a mood board is. Let's say it's like five images of like fuzzy 3D cats. You're more likely to get an image of like a fuzzy whatever you prompt when we're doing more generalized vibe stuff like this midjourney can tend to average things out with the mood board. And I find that using srefs as the mood board instead essentially can give much better results. So this was sort of the next step in my process. I wanted to try the SREFs and see if that made it better. And you can tell we're definitely getting better contrast. We're getting a little bit more kind of aesthetic and edgy, a little bit more of that like 2025 aesthetic that we want. But it's pulling really, really green for me. So what I ended up doing is I just removed that green eye SREF this is something that comes with intuition. You know, I've been using Midjourney for, like, three years, so I kind of can, like, understand, like, where. I don't know. It feels like sorcery to me, but I can understand, like, where the LLM is pulling certain things. And I knew that if I removed this green, it would solve a lot of my problems, so I did. And as you can see, we're starting to get a little bit more neutral in tone. We're also, like, a little bit more zoomed out too, especially with, like, the people photos. So now I know I am on the right direction.
B
Can I ask a question real quick? When you say I'm using the srefs, where are you getting those from your mood board or how are you actually deciding what the SREFs are? Just for people who are less familiar
A
with Mid Journey, I'm just using literally the ones that were on that original Pinterest mood board. So you can just copy and paste that image and put it in there. And then as you copy and paste things in, it'll like, save in a library for you, like, forever. So if I wanted to bring that green one back in to keep trying stuff, it's, like, already there for me.
B
Got it. So instead of the SREF codes that a lot of people and we've talked about in the past, you're using literally the UI to just drag in images as style references, and that for you sometimes gets you better results than just using the general mood board process.
A
Yeah. Yeah.
B
Cool. And then I have one more question, which is do you have some go to, like, test prompts? Like, I love the astronaut as a prompt because there's a lot of ways you could generate an astronaut. Do you have some, like, go tos that you run through when you're doing a mood board, or does it really depend on your client and what you're working on?
A
Yeah, I would say I love doing, like, ethereal female model for some reason. I do, like, I do that a lot. Yeah, this is like a crazy silver one that I was doing. It tends to just like, give sort of like a. A more elevated vibe than just regular model. I think I do cats a lot because there's just a lot of, like, texture to work with.
B
The other thing I would say is that there is probably a lot of training data of cat pictures on the Internet. So Mid Journey can probably do a pretty good job with a cat.
A
Yeah, running too. I find I feel like I do that a lot. Oh, God. This is like my first generation Isn't that crazy? That scrolled all the way down there, but yeah, I'll do running a lot. Or like, runner. Yeah, anything with, like, astronauts fun too. Anything that's, like, specific enough to kind of like, give you the vibe. I'd say, like, vibe. It's like, kind of nonsense.
B
Broad enough that those different styles kind of apply. Okay, so. So you've used these style references. You're getting a little closer. What's the next step?
A
Yeah, so I consider this part of my, like, create step. And again, the goal is just to, like, get some, like, information. Take like, this should take like 10 or 15 minutes to start the create process. And then what I'm going to do next is I'm going to move on to iterating. And that's what we're going to start, like, getting a little bit more specific with our process, maybe slightly more technical, and then starting to combine more styles to get like, your own unique style and get better consistency. Um, so what I'm going to do, particularly in this process is I'm going to start to bring in some personalization codes and other mood boards. So what ended up helping me get these results is my personalization code that I call late 2025 aesthetic. Now, personalization codes, again, it's kind of like a little bit of mystery how it works exactly under the hood. But when you're creating a personalization code, majority is going to put you through this, like, endless flop matrix of images that you're either going to vote one or two or skip, and basically you're just telling it what you like. And you can have like, as many profiles as you want. So for like, this profile in particular, I was trying to think. I was trying to rate images that were of a 2025 aesthetic. So, like, more iPhone style. And we can see here, like, you'd pick that one. Yeah, all right. I skip a lot. That's another thing when I'm giving, like, advice to people on personalization codes is like, there's a lot out there on, you know, like Mitcherny says, like, don't skip that much. But some people are like, skip as much as possible. Some people, like, skip a medium amount. I tend to skip like a medium amount and only pick things that I would like if I generated it. But I do find there's kind of like style pleading. So, for example, let's say I like, like the quality and the colors of this image. If I like a bunch of images that look like this, I might get a heavily influenced style that, like, wants to be painted. Yeah. Or wants to be like this, like vintage or renaissance, whatever aesthetic. So that's something to consider as you're going through this.
B
One thing I want to call out for folks that is a little bit of a side of this, this particular flow, which is when you're building an AI tool yourself. I have not seen this, like this or that personalization flow in a lot of AI tools. And I think it's such a good way to like fine tune whatever you're going to provide to your end user. And so a lot of times we get these like end A B tests and like a ChatGPT prompt. But this is so interesting to kind of put this up front and say, okay, let's spend five minutes telling me what you like. Then we can be more confident that when you have a downstream experience in my, in my AI tool, it's going to look great. Okay. And you can create as many. I've never again, I have not used mood boards or personalized. I'm just like YOLO up in there in the main chat, so. And you can create as many of these as you want.
A
Yeah.
B
To, to use. Cool.
A
Yeah. Unfortunately you can't like go in and like edit them the same way that you can with mood boards, which is why you gotta be good at like naming, trying to understand like what you were doing six months ago when you spent two hours ranking images. But I have my own crazy system that's just a quick brief on like what personalization codes are. So what I started to do. Let me go back to these like other images. I, I just felt like these could use more of my own style and I felt that they were a little bit over stylized. Like I do enjoy the pink, but I was curious to see if like maybe we can get some like other influences in there. So by adding my personalization code, I was able to just get like more of a depth, I would say, also because I wanted it to be like very crisp and like modern. You know, you start to see like better skin.
B
That one to me looks like very a good match for some of your earlier ones. That one's the one that stands out to me.
A
I mean it's kind of, it's, it's nuanced, but I'm liking where this is going now.
B
Hold on. And I know it's just you and me and little astronauts in the middle.
A
Yeah. Yes. Yeah.
B
So you are combining some of your style refs and this like personalization model on top of each other. You're getting way closer to what you want in terms of iteration. So how do we take this to the next step?
A
Yeah, so I want to show you guys some, like, of the other mid journey techniques and ways again to like, start prompting and kind of like upping the ante on, like, what we're making. So as you can see here, these prompts are no longer just like, women or astronaut. We're actually starting to give some, like, aesthetic thought. So one prompt. I actually found this on, like, the explore page a couple weeks ago, but it's days editorial photo shoot. So Day is just like this really hip magazine if you're not familiar. So doing stuff like that, like mentioning, like, Vogue or high fashion or even like a different artist name is again, a great way kind of in the same line of like, a picture's worth a thousand words to tell the model a ton of stuff without actually having to tell a ton of stuff. So with like, a day's editorial is like a really famous, like, asap rocky cover. It's like super gritty, high contrast, all these words that, like, are really hard to find even if you are like a professional photographer. But when you just say days editorial, depending on how, like, famous the publication is, midjourney's gonna know what you're talking about. Or, you know, Vogue especially, it's gonna know kind of like the level of the highlights. And okay, we're gonna be doing fashion stuff. So I love doing that same thing with the word editorial. In this case, I say a woman, a woman in her mid-20s, deep in thought, up close macro photo. So not the most crazy prompt. I think everybody knows what a macro photo is up close. It's not like, you know, center frame, zoomed in. It's very human language that I'm using. So what I did for this prompt is I wanted it to really have the vibe of this original sref we have here. I wanted to make sure it was like, the same composition. So what I did is I used an image reference. Image references are kind of tricky because definitionally, they just structure your composition. But a lot of the times, like, the structure of an image and the composition kind of is the style. So I kind of don't like to separate style references and image references. If you have a woman posing, like, really sexily and you use as an image prompt that stylistically is going to influence a lot of what's happening in your image, it's going to immediately be more editorial or sensual, et cetera. But anyway, I use this as an image reference because I want that profile. But obviously as you can tell, were getting immediately, which I don't hate some of these. But my intention was more of like a thinking photo. So what I did is I literally just zoomed in, like in the Mid Journey ui, I cropped it and then I just like dragged it back in and then I ran the prompt again and then we got something very similar to what we were working with here. Especially in terms of like image quality, the colors better or worse. It's also kind of like mimicking her like bone structure, especially this one. You can see like we have like these contours. So image references will give you a lot in terms of style.
B
One thing I want to call out that I've noticed in two year flows is on the we're getting too much green. There was just such a compelling element in that green eyeshadow photo, which was like half of the eye was very green. It was clearly the most obvious thing about the photo. And with this image, the most obvious thing about the photo is she's blowing a bubble of gum. And so what I like is in both of those, you're like just boot the thing that is so obvious and so overwhelming. And you can either do that by kicking out the image or you could do that by cropping out the part of the image that is pulling the rest of the generations down. So that's a really clever technique and I love that. You're just like. You're like, I'm just going to screenshot and drag it over. I'm not going to like try to prompt it to say like, remove the bubble gum or whatever.
A
Yeah. Oh my God. You will spend all day in Mid Journey. You know, even if you're using like something you can do is like the no feature. You can say no bubble gum. You know, you can try and like say like lips are showing sometimes it'll work. Most of the time it won't. If you really want to get specific, you could take this into Nando Banana and say like, remove her bubble gum. But I don't think it's going to get.
B
Well, that's what I was going to say. And I know we're going to get to Nano Banana in a minute, but then also if you move it to Nano Banana, you are going to wait like 45 minutes for that photo to come back and like waste a bunch of credits and stuff that you just don't need. Right. This.
A
Yeah. That's another thing to like avoid doing all this SRAF and this, like this. The profile codes. I don't have to prompt, like 2025 esthetic, you know, like, model esque. Like, it already knows that I'm going to have, like, a very conventionally attractive blonde woman because I gave it a conventionally attractive blonde woman or vice versa. I don't have to type in the colors and the grading every time. I don't have to do those horrible JSON prompts that no one can use. So this is like the laziest way ever.
B
I love it.
A
To do prompting. Okay, let's go into some, like, more specifically prompting. My arch nemesis. So one thing I wanted to do again, I like the juxtaposition of, like, the images on the original mood board. I thought it might be fun to have, like, a deer in a New York City apartment. Kind of like bringing like, the forest into the city. And that could be cute. I got these images, like, they're definitely on par in terms of, like, general style, but I was kind of missing, like, the New York aspect of it. So instead of like going and maybe finding like a New York City S raf or like finding an image of an apartment, putting it as an image reference, which potentially could have worked, this is one time that I do actually do some prompting. So all I said was like, New York skyline can be seen in the window behind it. So again, not doing anything super crazy with the language. I'm literally just saying what I want to see. Another example, I thought the pink couch was a little bit too fantastical. So instead I just said on a matte black leather couch. And now we're starting to get. We know it's in New York. It's a little bit more kind of like realistic, I guess. Couple other things I tried was I said at night, I think kind of giving me at night, maybe sunset vibes. And then one, the craziest I'll ever get is I have a list. This is included in my course of, like, all these different cameras. So I have like, dslr, I have mirrorless, I have digital. I have filmed as kind of like quick shortcuts because that's really hard to remember all of them. And the aperture and stuff. I barely know what aperture means. So sometimes when I just want to try changing up the vibe or make things more realistic, make them have like a more 90s aesthetic, I'll just paste in like a camera that mimics that. So the Sony RX100, I think I probably generated this with like, ChatGPT. I'm assuming that this is a 90s digital camera. And this is kind of like what I would consider the final Image So we know maybe the legs need to be fixed, but we know it's in New York. We can see it's in like a high rise. We have that couch. It's a deer. Oh, another thing too. To know what the prompt is like saying luxury New York City apartment. Instead of saying on the fourth floor of a high rise in a new post war New York City apartment, you're saying luxury. Because like everybody knows and the LLM knows or not. Alum. The AI knows luxury is going to be like a big metal super high up and retrieve will live on the top. So that one word, luxury, just gives us everything that we need to know about the scene, which is very fun.
B
What is great about this particular prompt is it reminds me of what Ravi in an earlier episode did, talking about midjourney for generating images again. And he's like, you need the subject and you need the setting and you need the style. And so you have the subject, which is the deer, you have the setting, which is this luxury apartment at night, and you have the style. And he did a very similar thing, which is like cameras are cheat codes for styling. And again, what I appreciate about you're doing, and we love to hear on how AI is, everybody just wants to be lazier with their prompting. No one want. I mean, if you go to the Explore page on mid journey and you look through people's prompts like, people need a job, man. Like, this is too long. And so I love the idea of you're just trying to find shortcuts for yourself to make these shorter and shorter and shorter, but still get the same quality because you're generating a lot, a lot of images.
A
Oh yeah. Oh, yeah. Like sometimes like thousands a day, depending on what I'm doing. If I'm lucky, I nail things really fast. Like, like this aesthetic I happen to get like very quickly. Whole point of it being 30 minutes. But yeah, if you're trying to do something that needs like a lot of different aspects. Like I was doing a stock photo project recently. I'm doing like nature stuff, I'm doing people stuff, I'm doing skin. So, like in this case, like, I can't really have just like one mood board because there's a lot of different things that we need to do in terms of treatment. So, like being able to get like super, like what's prompt do I have here? Like CMYK highlights for me has been a big one that I've been using deep blacks, high contrast, and then we have just a really powerful srf, which actually Came from a previous midjourney generation I did that day. And then we get all this, like, cool stuff, I think. Yeah, the prompts, experiencing music. So we're starting to see, like, ears and like, you know, maybe body sensations. That's another tip too, is like when you're in especially the create phase, maybe even iterating phase, they kind of blend together. Sometimes I'll literally, if I'm doing like a startup deck or something, like a VC deck, sometimes I'll literally just like paste in the sentence and just like, see what it gives me. Like the full sentence. Like, this is our business model. Just to get me thinking. Just get the model thinking. Same thing. Like, I'll just do, like financial markets. Really vague stuff. Midjourney is like, very poetic, I would say. You can, like, write little poems to it and it'll actually give you some. It'll. It'll do what you want it to do. Like, this is experiencing music to me. You know what I mean? You feel the vibrations through your body versus, you know, if you were to describe this image, it's gonna sound like a whole bunch of nonsense. Even worse. Even worse. If you do it in like, Gemini or something, ask it to give you. You're just never gonna be able to get this. Which is why I love Midjourney so much, because it just feels like an extension of your. Of yourself. Basically.
B
Every time I see Midjourney, I have to tell people, you know, now I'm this host of this AI podcast. I get to see tools and I do a lot of things every day. And I. Mid Journey was the first tool that just switched my mind about what was possible with AI. It really is an inspirational tool. It's fun, it's accessible to not just people like you that are building a business off of this, but my kids love Mid Journey. It's such a creative space, and I feel like it's one of the more, you know, dare I say it, like, soulful AI experiences. And so I love the idea of just getting out of, like, the tactical and practical prompting of, like, I want this thing and this camera, blah, blah, blah, and just playing in a space with it to see if it can inspire. Inspire things for you. This episode is brought to you by Lovable. If you've ever had an idea for an app but didn't know where to start, Lovable is for you. Lovable lets you build working apps and websites by simply chatting with AI. Then you can customize it, add automations, and deploy it to a live domain. It's perfect for marketers spinning up tools, product managers prototyping new ideas, or founders launching their next business. Unlike no code tools, Lovable isn't about static pages. It builds full apps with real functionality. And it's fast. What used to take weeks, months, or even years, you can now do over the weekend. So if you've been sitting on an idea, now's the time to bring it to life. Get started for free at lovable dev. That's lovable.dev okay, so just to recap really quickly, on our mid journey journey, we have done mood boards. We have used those mood boards to kind of create some and get a sense of what's working well with the mood boards or not. We've pulled in those mood boards via style references. We've also pulled in specific images as image references. You've shown us how to go from like the most generic astronaut prompt to slightly more specific, but still pretty lazy. Dear New York City luxury prompt, now you're starting to get stuff that you want. How do you kind of like package this up and scale it out?
A
I'll basically just keep going with this same SREF stat and just continue to generate images across, you know, the subject matter. So for like this one, you know, I'm thinking about AI bubbles. I'm thinking about talking about technology, I'm thinking about talking about culture. So I literally just prompted AI bubble using the same prompt. But as you can see, we're getting to the dreaded like five finger thing. So in this case I'll hit very subtle or very strong and it'll help me like come up with a couple more generations of that aesthetic. So that's one technique I'll use. Kind of like fight those. I will also go and I'll like steal prompts or get inspiration for prompts from the Explore page, especially when I'm generating like really large kind of like stock photo sets of images. So like, here's like an edgy man. I don't have any edgy man photos yet. I thought the basketballs could be a cool motif in this aesthetic. So then I kind of like pick apart those prompts and I use them here. And then eventually I get to kind of like what we saw at the beginning, all these images that I'm very, very happy with. And I'll move on to sort of reinforcing my prompts if need be and editing images. Um, so one thing that I always try, especially once I have like the exact outputs that I want already, I'll go ahead and I'll make another Mood board again. Um, so you can literally just add from your gallery. You can just click. So I selected. I think this is about, like, 30 images that I liked. And then what you can do is you can use this again. Um, so what should we prompt? Like Turtle and the Sea.
B
I was thinking turtle. This is weird.
A
You're thinking turtle.
B
We're floating through mid journey together. I was like, maybe she'll do like a turtle.
A
Yes, very funny. So this is, like, one again, kind of back to the beginning. We're going to try the mood board, see if it works. If it doesn't, the good thing is we already know this SREF stuff works. Something else that I was trying. I'll take that new mood board with the images, and then I'll try with a different mood board that I have. So this one's called Real Skin. This is just what I'm trying to make, like, really realistic images of skin or of models. Obviously, it influences a little bit of what they look like too. So I will use both mood boards at the same time. I might just say model, and then we're able to kind of like, play with using rose aesthetics. Now I will say. So these are generated with the mood board. Definitely, like, closer than, like, if we hadn't used it. But I still think the SREF is gonna, like, be supreme here. Yeah. So what I'll do is I'll just take that, this mood board here, and I'll click that, run the same prompt again, but using the SREF and using the mood ward, and we'll see what it gives us. Yeah, so same for this, like, closer to what we want, but, like, not as, like, stylistically consistent as these, which sometimes, depending on what you're making, like, in my opinion, like, all these images are great, but, like, over time, it might start to look a little bit flat or, like, too much pink. So, like, weaving in a couple images that are, like, not the exact same. Style can also be good, too, as long as you have this kind of, like, through line. So, like, good example here is, like, this turtle, our mood board and our srfs is, like, not super optimized for animals. Yeah, that's what I was saying. Yeah. It's way more optimized for, like, editorial stuff. So what I might do here, I do have, like, a nature mood board that I have adventurecore, and I never delete my mood boards. So I'll just, like, keep them around and we'll just. In case. Yeah, we'll see what this gives us. Using the srefs. Hard to See her getting a sexy lady, naturally, that is one of the
B
downsides of mid journey is you're going to get a lot of sexy ladies.
A
Yeah. So again, cool. Not too realistic. So this might be a part of the process where we kind of go back to square one in a sense and think, like, okay, maybe we need to find reference photos of vintage National Geographic that kind of have this, like, print aesthetic, but it's of animals to give midjourney a little bit better information to work with.
B
What I, I want to say is I love this process of you're just like, finding the right two or three things to mix in mid journey. And sometimes it's a style reference and a prompt. Sometimes it's a prompt and a mood board. Sometimes it's a mood board and image reference, you know, and so you can just combine all these things and ultimately iterate to a package that you want to. Want to send to clients.
A
Yeah, and then I'll just deliver this. Usually in Figma, the only thing with midjourney, they don't have any sort of like, sharing. I think I've literally seen designers, like, if they're on a big enough project, they will like, literally create a midjourney account just for that. And then what I usually do, I'll literally just paste in, in Figma what that, like, final prompt is. So the most important stuff is like, the profiles that you're using. For this one, I happen to go like, very crazy with the profiles, stylization if necessary. And then I just say, like, these are the reference photos. So for like all of these images that you see here, I think like a hundred percent of them, if not like 90% of them, were generated with like this exact setup right here. So I just give the clients this. I'll give them a set of images also. These images are going to be like, in context, but yeah, it's still kind of the wild west.
B
What I think is really cool about this and the reason why you're leaning into it, I really appreciate is, you know, in the past, and we've talked about this a little bit in our episode with Zach at Gamma, who got a very similar package like this from their. Their brand team is in the past, brand and creative directors or agencies would like, give you these photos and be like, cool, call us and re up when you want more photos. And what I love is that you're like, look, I put in all this. You're gonna, you're gonna value me for all this upfront work that I'm gonna do to Define the space, give you these codes, like, really give you reference images. And then now you can go do this for yourself. And if you wanna evolve the brand or you're not quite getting what you want, great, come back to me and I can give you another package and we can go forward. But it's just like a very different model of providing service and I think it creates a really positive collaboration between the client and. And the creative director.
A
I know sometimes I'm kicking myself, I'm like, gosh, I probably should just not get some and just continue charging them. But I'm kind of allergic to retainers. I like. I like doing this beginning process so much that I don't want my old clients to bother me. I just want to keep making new stuff.
B
Well, I. I love it and I'm sure people are going to see this and you might have a few more for new, new clients. Okay, so you showed us kind of end to end how we get through these packages. What are just a couple other workflows that you find yourself using, I guess,
A
to continue on with this one? I can show some that I already have done. So for some of the images in midjourney. I'm sure you have all seen the terrible hands oftentimes too, when you're doing kind of like more vintage aesthetics. If it ever can give you an Apple logo, it might give you like some weird old computer. So something I do a lot of the time, even just for my personal stuff, I post on X. I'll take my midjourney images and I'll take them into Flora or Hicksfield sometimes and I'll just use Nano Banana as Photoshop. Nano Banana literally is just Photoshop. That's exactly how you should think of it. You're just able to speak to Photoshop essentially for most people. So what I have here is this image that I really like that we generated it, but I want to upscale it so I get more texture in her shirt and stuff. And then I want this to be like a real computer. Nano Banana does require a little bit more prompting than Midjeri, but it's much more, I don't want to say forgiving. It's much less complicated, like if you're a beginner in some ways, depending on what side of the line you stand on, whether you're more, like, technical or more artistic. But anyway, I just said replace the computer. She's typing on on a 2026 midnight block MacBook Pro. So Nano Banana is like a reasoning model, so it, like actually knows what things are so you don't have to give it a reference photo all the time, especially for stuff that's in like the public mind sphere. I also usually mention like, don't change anything else. I say keep the position and the size of the computer exactly the same. And then just because I've done this so many times, I know sometimes the end of animal might, you know, change the angle slightly. So I just say exactly, I say exactly what it's seeing. So like only the left side and the keyboard is visible. And yeah. And then if I were to download this photo, it's going to be like 4000 by 4000 versus like 800 by 800. Kept the style pretty much exactly the same. We just slotted in like a real computer so it would be relevant to like use now on social media.
B
I'll just gonna behind the scenes at how I AI we use a very similar process to upscale screen caps from the podcast for our YouTube thumbnails. And so you know, I like make make these faces and then we want to clip them for the podcast but just screen capping from the video is really low resolution. And so we use a very similar prompt to upscale and improve the lighting on, on our photos and then we drop those into our thumbnails.
A
Oh, I got to show you, you don't actually have to take thumbnails anymore.
B
Oh, show me live. Live demo.
A
Have you been seeing my like articles? No.
B
Show me detour.
A
So this is a really like kind of for us, all these like nodes and stuff. So my profile photo is AI I think I got quite a bit of attention for doing that. I think I ended up making it Higsville at some point. But basically what I do is I'll take a bunch of selfies of me that are like more realistic looking because sometimes like, you know, you'll take a best selfie and like doesn't look like you at all. I'll also like show my teeth. I don't have like perfectly straight teeth. I'll take a couple reference photos and then I can just make me do whatever I want. This was another one. I had a mid journey photo that was this and I wanted her to be annoyed. I kind of already had this process in my head. I knew I wanted a photo of me in this vibe, but angry. So I took this photo, made her angry and then I replaced my face which like me on my best day. This 100% looks like me. I have like multiple reference photos to make sure it's like actually getting my vibe. And then from there I'M able to. You know, I was thinking about, like an anti agency post. I think I just ended up using this photo. This was one. Took me a couple tries. This is where again, like reference photos kind of really come in. And sometimes, especially with Nana Banana, sometimes you just gotta, like write that prompt, you know, like with the actual camera angles with the background. But I ended up just finding these on Pinterest, I believe, of like the aesthetic reference and the actual pose reference. And then this was for my article on AI legal stuff. What else have I done? This is one recently. Funny enough, I could not get it to give me extra fingers, which is hilarious because there were years of my life where I was trying to do the opposite. So 15 minutes I was able to get it to give me, like, AI fingers. That was a recent article. I did too. Again, we're like using SREFs. This one didn't actually kind of like carry through, but in earlier references it did.
B
So I just want to call it for people who are listening, not watching, because I'm making a face, which is Jimmy basically dragged in a bunch of realistic but still my face, which is really hard as somebody who's trying to generate images of themselves realistic but still my face selfies, and then has used Flora to generate a bunch of mix and match remixed versions for articles and thumbnails. Y', all my YouTube thumbnails are about to get real.
A
Yeah, real good.
B
I'm so excited. This is very, very helpful. And if I could throw back to probably three, two or three years ago. So it had to be three years ago. One of the first things that I did was back in the day when you actually didn't have these beautiful UIs is I fine tune a model directly on my face that I could. It was like a light. I forget what it was called.
A
Yeah, it's a Laura, I think.
B
Yeah.
A
Yeah.
B
So I. I fine tuned it on my face and then I generated a bunch, a bunch of images. And it was so useful to have this specific one. And now we're all spoiled. We can just do this in this ui and I never want to see this Chad Steve Job ever again. But I love the idea of dragging these two nodes together into. Into an image.
A
Yeah, People. People struggle out with it. It's hard to. To get to nail it. I think it's. I have like a really kind of like, strange face. I have like, big eyes and stuff that, like, just latches onto me really well. Like I have an Asian boyfriend and it just like, refuses. Like every generation I make of him is just like generic Asian guy.
B
We could, we can do the aside of like the like the bias of what goes into these training, training sets. I mean we can get a lot of conventional, as you said, conventionally attractive blonde women.
A
Yeah.
B
We can generate all day, all night because they're all over the Internet. Fingers crossed.
A
Every, every athlete, if you, if you want an athlete in mid journey, it's going to be a very large black man. It's going to be a very strong, beautiful black man.
B
Oh my God, get a white.
A
You can, you can not get a white athlete in printer.
B
I mean this is what's really interesting is as you start to play with these models, you really start to understand they are trained on the Internet. And I was mentioning recently in some like chit chat about malt book, which is where all the open claw lobsters are together and people are like, why are they weird? And I was like, they're weird because it's a fake Reddit trained on Reddit data and people on Reddit are weird. And so I think one of the things you have to know about using these models is like where does the training data come from and therefore what's it going to generate for you and what isn't? And I, and I. The more you explore it, the more you, more you hit those edges. Jamie, I love all this. This is all super useful. I'm going to be able to go put this into practice literally today because we're working on a, on a thumbnail.
A
Yeah. I'll give you prompts.
B
So let's, let's skip over to lightning round and then I will get you out to generating another thousand images. So my first question is, where do you go for inspiration? I think one of the things that you sort of like pass through is you're like, oh, if you know this magazine or that editorial style, that camera. And I feel like people aren't cultivating their visual taste and language enough. So what are some of your, you know, other than Pinterest, what are your, some of your sources where your continually improving your language and your aesthetic taste through exposure to other things?
A
Yeah, One thing that I recommend to everyone is to start a list on Twitter. There's tons of like basically Tumblr accounts still to this day that post like really aesthetic images. So like some of these are like this accounts literally called pink glitter. So she posts a lot of like 2,000 stuff. So that's really great to use for midjourney fashion accounts. A lot of fashion accounts, who I follow. So I have Like, a Twitter list of, like, really aesthetic photos. Like, this is obviously great if you, again, want more sexy ladies. So that's what I do. Also, Cosmos, of course, is superb. So let's do, like, ethereal model here. One of your favorite images. Yeah. So obviously this is just, like, a treasure trove of Astros, especially for, like, specific moods and vibes. I feel like, as a designer, I still, as much as I love Cosmos for, like, branding stuff, like logos and, like, website inspiration, I still use Pinterest a ton as, like, my home base, but I will still use Cosmos for, like, art direction things. The great thing about both of these is, like, Pinterest has a plugin, so you can literally save from, like, any page. Especially, like, again, if you're on, like, X, you can go save to Pinterest. Cosmos, I believe, has the same thing, but I don't have it turned on right now, so that's what I do. And then anytime I'm, like, shopping online again, X Instagram, something I teach in my course as well, is a daily taste practice. So just, like, get really on top of saving and archiving stuff. My mood boards on Pinterest are actually shockingly simple because I realized if I had too many, I was like, getting kind of, like, decision fatigue, which is why my design inspiration board is 7,000 pins. It's probably time to, like, switch that up, but I have, like. I don't know, what is this? Like, 20 mood boards. So like, instead of having, like, ethereal model mood board with, like, 10 images, I just have, like, one that's like, SREF. And anytime I see anything on Pinterest that I would, like, love to bring in to midjourney one day, I just save it. And then when I need inspiration for, like, any project, even, like, client projects, like, I'm doing a sock brand soon, so I have a lot of running stuff. I just save it all here, and then it's really easy for me to go through and just, like, copy and paste into mood boards. As you can see here, like, these are images that I've saved. I don't know what the hell I was looking at.
B
You were looking at that. It's like shells, and it seems like very.
A
It was like a website for, like, technical 3D scans, but this is, like a grain of rice or something. But it was sick. So I, like, saved this, and then I really need someone to do this for me. But I get, like, millions of views on Pinterest a month now, because every time I see something and I save it, like, images I was already going to save anyway. Like, this is just from like high fashion Twitter. I just saved it and it has like 200,000 impressions.
B
So not only this, this is a kind of like source of inspiration for you. It could actually be a source of business for you because you're sharing out, making these resources useful to people. Jamie, my last question I have to ask you. Ask everybody. Which is, you know, you. You have seemed to. Your prompting technique is keep it dead simple. We also talked a little bit before the show about how you don't love to prompt a lot and you're not like, you know, you don't want to spend a bunch of time in Claude. But yeah, if AI is not giving what we want, if we're getting the ugly stuff here, what's your prom? What's your prompting or personal technique to just get it going in the right direction, whether it's images or text.
A
Are you mean firstly take a break always? Like, you're never going to be able to, like, see properly if you're kind of like in it. So, like sleeping on it or just like walking away. And then sometimes I come back and I just like, new sref different prompt, like, grab this reference, use this camera, and then it immediately works. So I would say walking away. And then part of my process too is just like, during that walk away, when you come back, be like, okay, what is actually the problem? Like, for some of these generations, I'm like, it's just too busy. It's too midjourney. I'm using like 18srefs. Like, this is just not going to work. So I'm going to, like, take a step back. I'm going to redo the mood board and I'm going to figure out what exactly from these images is important. And it's like, all right, I keep getting this, like, stupid red color. I'm going to take out this red color. Like, even though I love this image and I would love for it to, like, come through, it's not giving me what I want. Midjourney's not listening to me. So again, like, I think I talked about this in the beginning. Like, brutal honesty on, like, what's going on and trying to, like, not, I don't know, make AI work the way you want it to work and actually understanding how it works in general. But I think time away is probably, I love it.
B
Time away and seeing things from the AI's point of view.
A
Yeah.
B
Well, Jamie, this was great. How can we find you and where can we be helpful?
A
Yeah. So I spend all my time on X. So X amiegannon or Tech Bimbo is my name. And yeah, I have an AI course coming out. It's called the AI Creative Director. It's on Maven. You can find that on my X or my website as well. So if you are interested in like really deep diving into this and getting live coaching from me and hearing more of my by yapping on AI, I would highly recommend you do it. It's meant to be able to make you create consistent client level work like I showed today. So join the course. Awesome.
B
We'll link to that in the show notes. Well Jamie, thank you so much for sharing this and I'm going to go dive into midjourney.
A
Awesome. Cool. Thanks Claire.
B
Thanks so much for watching. If you enjoyed this show, please like and subscribe here on YouTube or even better, leave us a comment with your thoughts. You can also find this podcast on Apple Podcasts, Spotify or your favorite podcast app. Please consider leaving us a rating and review which will help others find the show. You can see all our episodes and learn more about the show@howiaipod.com See you next time.
Mastering Midjourney: How to create consistent, beautiful brand imagery without complex prompts
Guest: Jamey Gannon (AI Creative Director)
Host: Claire Vo
Date: March 9, 2026
This episode dives deep into practical, repeatable workflows for building consistently beautiful and unique brand imagery using AI generative tools—primarily Midjourney and Nano Banana. Host Claire Vo and guest Jamey Gannon break down the exact process Jamey uses to create standout visual assets, emphasizing simple prompts, leveraging visual references, and strategies for maintaining stylistic cohesion across large image sets. The episode is packed with insights for designers, founders, and anyone wanting to make the most of modern image-generating AI.
Jamey starts every project by crafting a mood board on Pinterest or Cosmos to establish the “vibe.” This is imported to Midjourney as a visual guide for the AI.
Style References (SREFs): Instead of general mood boards, using select images as “style refs” gives Midjourney stronger, more specific direction on look and feel.
Pro Tip: If your generated images don’t match your mood board, use GPT or Claude to help analyze why—this builds your “creative vocabulary.”
Start simple: Early prompts are broad, e.g., “beautiful female model” or “astronaut,” just to see what comes out. Don’t be too precious—generate fast to get baseline outputs.
Iterate: Study results, tweak or swap out SREFs. If a style pulls too much of one color (e.g., too green), remove that reference. Intuition develops as you use the tool.
Test Prompts: Jamey uses recurring subjects that work well (cats, astronauts, runners) because of their rich training data and variety.
Name-Dropping as a Shortcut: Referencing magazines (“Dazed editorial photo shoot,” “Vogue”), artists, or high-fashion terms compresses complex visual guidance into a word or two.
Image References: Use these to control composition. Tweaking by cropping can suppress unwanted elements (e.g., cropping out a prominent bubblegum in a photo).
Camera Settings as Style Codes: Pasting in actual camera names/models (“Sony RX100”) instantly shifts aesthetics to more realistic or era-accurate looks.
Literal Prompts: Don’t be afraid to just say what you want—“New York skyline can be seen in the window” or “on a matte black leather couch.”
Once you have a tight prompt and SREF combination, you can create large, consistent image libraries. Jamey adds the final “successful” images back into new mood boards for further generations.
Organizes and delivers image packages using Figma, including prompt/code documentation so clients can reproduce or iterate themselves.
This model is more collaborative and less dependent on expensive re-hires for every asset.
For detailed edits (fixing “AI hands,” adding/removing objects, upscaling), Jamey uses Nano Banana and Flora as “Photoshop you can talk to.”
Nano Banana is especially powerful for upscaling and minor replacements; it understands reasoned, high-level commands (“replace computer with 2026 MacBook Pro,” “keep position/scale the same”).
Facial/Brand Consistency: Jamey uses personal reference selfies to generate realistic, brand-cohesive headshots and thumbnails.
Gather Inspiration (03:41)
Set Initial SREFs/References (06:13, 10:34)
Create & Iterate (12:40)
Fine-Tune with Personalization Codes (13:22)
Prompt Strategically (16:48, 21:30)
Scale & Organize (29:01, 33:41)
Edit & Upscale as Needed (35:56, 37:55)
Jamey’s approach demonstrates that consistent, on-brand, “beautiful and unique” AI imagery is the result of clever visual groundwork, smart short-cutting, and honest iteration—not convoluted or lengthy prompts. Whether you’re a creative director, marketer, or founder, these workflows make AI both more powerful and more approachable.