$140M Powers Fal's 10X Image Performance Peak - The AI Podcast

The AI Podcast — Episode Summary

Episode Title: $140M Powers Fal's 10X Image Performance Peak
Date: December 30, 2025
Host: Jaden Schaefer

Overview

In this episode, Jaden Schaefer explores major news from FAL AI, which has just completed a $140 million Series D raise and unveiled a significant upgrade to image generation technology. Built atop Black Forest Labs’ open-source model (Flux 2), FAL’s new “Flex 2 Dev Turbo” introduces a leap in speed and cost efficiency, promising to reshape how generative imagery is accessed and deployed by developers and enterprises. Jaden explains the technical innovations, licensing nuances, and the broader implications for the AI ecosystem, particularly how open model optimization may set the future direction for the whole industry.

Key Discussion Points & Insights

1. FAL AI’s $140M Funding and Breakthrough Model Launch

FAL AI has just raised $140M in a Series D round.
The big news: Launch of Flex 2 Dev Turbo, a model promising images “10 times cheaper and 6 times more efficient” than existing offerings.
The host frames this as “the direction that all of the AI companies, all the large AI hyperscalers... are going to be moving in” [02:00].

2. Technical Foundations and Innovation

Built on Flux 2: Original, open-source model from Black Forest Labs—also the base for Grok’s image capability on X (formerly Twitter).
Lora Adapter: Flex 2 Dev Turbo is not a standalone model but a “Lora adapter”—“a lightweight optimization layer that essentially attaches to the original Flex 2 base model” [04:15].
- Result: “dramatically improves the performance” of Flux 2 without requiring full model retraining or grossly increased hardware.
- Delivers “high quality image generations in a fraction of the time and at significantly lower cost” [04:40].

3. Licensing & Accessibility

Available on Hugging Face—but with a significant caveat:
- Distributed under Black Forest Labs’ custom non-commercial license.
- Jaden: “It is not licensed for commercial deploy... allows personal use, research, internal evaluation, but doesn’t let you use this for any revenue generated applications without a separate agreement” [17:40].

4. FAL’s Business Model and “Infrastructure over Models” Philosophy

FAL is positioning itself as “an AI media infrastructure,” not just a model vendor [07:30].
They act as a “centralized hub for real time generative media,” providing APIs for open and proprietary models spanning image, video, audio, and 3D generation.
Notable claim: “More than 2 million developers are now using their platform... they serve billions of assets each month” [09:15].
Usage-based pricing: “charging per token or per asset... just like you would pay OpenAI for using their model” [10:10].

5. Technical Metrics & Benchmarks

Flex 2 Dev Turbo’s improvements:
- Step Reduction: From 50 inference steps (original) to just 8 for comparably high-quality images [13:00].
- Enabled By: “Customized DMD2 distillation techniques” [13:15].
- Evaluation: Achieves the highest ELO score among open-weight image models, 1166 (beats Alibaba and others on the YUP benchmark).
- Speed & Cost: “It can create an image for $0.008 in just 0.66 seconds” [14:40].
  - “Basically the lowest price that is currently on the leaderboards” [14:50].

6. Impact and Use Cases

Jaden highlights how cheaper, faster image generation will “make a huge difference” for companies with large-scale image generation needs, such as Suno AI (music generator that dynamically creates millions of album covers with each song) [15:30].
Even companies traditionally seen as doing something else (music, text) are significant image generators behind the scenes.
“If you could make this thing six times faster and ten times cheaper... that would be an incredible innovation.” [05:30].

Notable Quotes & Memorable Moments

"What’s incredible here is [FAL has] been able to build something on top of that model to create images that are 10 times cheaper and six times more efficient." — Jaden Schaefer [01:15]

"Flex 2 Turbo is not a full standalone image model... Instead it’s what's called a Lora adapter. This is a lightweight optimization layer that essentially attaches to the original Flex 2 base model and... dramatically improves the performance." — Jaden [04:12]

"When you start building software on top that optimizes the models that are there, that nobody else has, like this improved version of Flux, I think that’s when these companies can become really, really valuable." — Jaden [11:35]

"When the original Flex 2 required roughly 50 inference steps to produce a really high quality image, Turbo achieves a really comparable output in just eight steps. So going from 50 steps to eight steps, this is a massive improvement." — Jaden [13:03]

"It can create an image for $0.008 per image. It’s basically the lowest price that is currently on the leaderboards." — Jaden [14:42]

"Despite you being able to access this license, it’s not licensed for commercial deploy... it allows personal use, research, internal evaluation, but doesn’t let you use this for any revenue generated applications without a separate agreement." — Jaden [17:40]

Key Timestamps

| Timestamp | Segment | Details | |---------------|-----------------------------------------------|-----------------------------------------------------------------------------------| | 01:15 | FAL’s breakthrough and cost/speed claims | 10x cheaper, 6x more efficient image generation | | 04:12 | Technical explanation: Lora adapter | Overview of how Flex 2 Dev Turbo functions on top of Flux 2 | | 07:30 | FAL’s business positioning | “Infrastructure over models” approach, serving devs and enterprises | | 09:15 | Adoption figures | Over 2M developers using FAL; billions of assets served monthly | | 10:10 | Pricing and usage | Usage-based model (per token/asset) | | 11:35 | Value of model optimization | Differentiating by building optimization layers atop open models | | 13:03 | Technical leap—step reduction | 50→8 inference steps, enabled by new distillation techniques | | 14:42 | Benchmark results | ELO scores, pricing ($0.008/image), speed (0.66s/image) | | 15:30 | Suno AI as a use case | Music generator’s need for high-volume, low-cost image gen | | 17:40 | Licensing clarification | Non-commercial license; commercial requires separate agreements |

Tone and Concluding Thoughts

Jaden is enthusiastic about the broader industry significance—emphasizing how such advancements will not only benefit AI insiders but also improve user experience for anyone waiting on slow, expensive image results.
There’s a clear hope that giant platforms (OpenAI, Anthropic, etc.) will “take some of this technology as well” to accelerate improvements industry-wide [05:45].
He’s realistic about commercial limitations (non-commercial license) but optimistic about future releases and licensing flexibility.

"I'm really excited to see if we see similar technology rolled out in other big players and we see a speed up in the overall image generation space." — Jaden [18:00]

Summary Table: FAL Flex 2 Dev Turbo at a Glance

| Feature | Stat/Claim | |-----------------------|-----------------------------------------------------------------| | Funding | $140 million Series D | | Improvement | 10x cheaper, 6x more efficient image generation | | Model Type | Lora adapter on open-source Flux 2 | | Step Reduction | 50 → 8 inference steps | | Price per Image | $0.008 | | Speed | 0.66 seconds per image | | ELO Score | 1166 (highest on open-weight leaderboard) | | License | Personal/research/internal only; commercial by separate agreement|

For listeners and developers alike, this episode provides a compelling breakdown of next-level open model optimization—and why FAL AI’s new approach could spark a wave of cost and speed improvements in generative media across the industry.

Transcript

A (0:00)

Welcome to the podcast. I'm your host, Jaden Schaefer. Today on the show I want to talk about foul AI. They've just recently raised $140 million in a series D and they've just come out with a brand new model. This was kind of their year end surprise. What's incredible here is it's built on top of Flux 2, which is an open source model from Black Forest Labs. I've talked about them a lot. They were kind of the original image model that powered Grok over on X. What FAL has done that is making, you know, it's surprising a lot of people is that they've been able to build something on top of that model to create Images that are 10 times cheaper and 6 times more efficient. I want to cover this today because I think this is the direction that all of the AI companies, all the large AI hyperscalers, everyone is going to be moving in the direction of. And so I want to break down what they, what they're doing, how they're doing and what some of the innovation is. Before we get into that, I wanted to say a big thank you to today's sponsor, which is delve.com if compliance is something that's slowing down your deals, Delve is an incredible resource. They help with SOC2, HIPAA, GDPR compliance. Busywork can definitely kill momentum inside of your organization. Delve uses AI agents to automate compliance. They do this end to end. They collect evidence, they fill out security questionnaires and they customize controls to your actual business so you can get compliant in days, not months. Something else that I think is awesome is that you get one on one slack support from real security experts who respond quickly. There's over a thousand fast growing companies right now that are using Delve to help them close deals faster and stay compliant as they scale. If this is something that would be interesting to you, go check out delve.com I'll leave a link in the description to go check out Delve. All right, let's talk about what's going on with fal. So I think the, you know, obviously they've just raised a whole bunch of money. $140 million. What's interesting here, they've just unveiled a flex to deb dev turbo. This is a faster, cheaper and more efficient version of the open weight model which was originally released by Black Forest Labs. So this new model, it's already outperforming a lot of large competitors on public benchmarks. It's available on hugging faces today, although there is a really important caveat. It is distributed under a custom non commercial license which was originally created by Black forest labs. So Flex 2 Turbo is not a full standalone image model. I'll just definitely put that out there in a traditional sense. Instead it's what's called a Lora adapter. This is a lightweight optimization layer that essentially attaches to the original Flex 2 base model and then when they attach it, it dramatically improves the performance. So the result is that you get these really high quality image generations you and they're all delivered in a fraction of the time and they're at significantly lower cost. So this is an incredible innovation if they can apply this to Flux 2, which is kind of this open source project. So that's why, you know, they're able to even work on this. But if they can apply it there, there's so many companies that could take the same technology. OpenAI Claude, who doesn't really have an image generation model right now, there's a lot of other players that could also take the same strategy and we could see much cheaper, much faster images, which I actually think will make a huge difference. I don't know if any of you struggle with this. I definitely, definitely do. Basically every time I use ChatGPT to create an image, like, yes, I know it's a magical AI machine that can make incredible any image I want. And so like I should just be grateful for what I have. But it really is sort of annoying to sit there and wait for like two minutes for my image to be generated. If you could make this thing six times faster and 10 times cheaper, I think that would be an incredible innovation. So I'm hoping that OpenAI can take some of this technology as well. I think one of the most important things that the model's open weight. So for engineering teams that are trying to, you know, grab a good software solution. Of course there's so many closed APIs right now. And this new Turbo shows how this kind of targeted optimization of open models can actually get some really big gains in speed, efficiency and cost control. I think what's interesting here to me is kind of foul's bigger bet, which is this infrastructure over models approach. They're really positioning themselves not as a model company, but as an AI media infrastructure. So they're kind of serving as this centralized hub for real time generative media. They're offering developers access to both open and proprietary models that they have image, they have video, they have audio, they have 3D generation. According to a recent press release that they did more than 2 million developers are now using their platform. They also operate a usage based pricing on their product. So essentially they're charging per token or per asset. FAL is actually someone that I think we originally looked at. I'm not, I'm not sure if we're using, we might be using one or two things from them. On AI Box, which is essentially my product where you can, you know, you use an AI and you can build tools without knowing any code. You just describe what tool you want it to build and it can chain AI models together. What we were originally using FAL for or what we looked at it for, I think what we're primarily using is together AI. But VAL does something very similar which is essentially have all the open source models, they host them. You can use an API and you pay them just like you would pay OpenAI for using their model, for using a lot of open source models. So foul or together. A lot of these players are doing a really good job in this space. But I think what's even more impressive is beyond just kind of, you know, offering, beyond just running the open source models and offering an API subscription, which is very useful for developers like, don't get me wrong, but when you start building software on top that optimizes the models that are there that nobody else has, like this, you know, improved version of Flux, I think that's when these companies can become really, really valuable. They said apparently on their whole platform in the past year they've become one of the fastest growing backend providers for API generated media. They said they're serving billions of assets each month. A lot of that has drawn from their investments that they've raised. They've had money From Sequoia, Nvidia, NVentures, Kleiner Perkins, Andreessen Horowitz. And their customers are, you know, ranging from solo developers building creative tools to a lot of enterprise teams running large scale personalized media pipelines across retail, entertainment and internal design workflows. So right now Flex 2 Dev Turbo is the latest addition to their stack that they've built. It's one of the most developer friendly image models currently available on the open weight ecosystem, I think. But what does it really do differently? I think it is a distilled version of the original Flex 2 dev model, which was released last month by Black Forest Labs, which is a startup founded by former Stability AI engineers. The base model was positioned as kind of this open alternative to offerings like Google's Nano Banana Pro, which is part of Gemini's image lineup, and OpenAI's GPT image 1.5. So when the original Flex 2 required roughly 50 inference steps to produce a high, you know, a really high quality image, Turbo achieves a really comparable output in just eight steps. So going from 50 steps to eight steps, this is a massive improvement. That acceleration I think is enabled by customized DMD2 distillation techniques. So I think the speed has not come at the expense of quality in evaluations which are conducted by artificial analysis. My favorite, you know, one of my favorite tools for determining the best AI model for something. Turbo now holds the highest ELO score among all of the open weight image models. It has a rating of 1,166 which is passing the competitor, some other competitors like Alibaba, a bunch of other players on the YUP benchmark which is factored in the latency price and the user ratings. It has Elo score of 1024. In you know, 66 seconds it can create an image for.008 dollars per image. It's basically the lowest price that is currently on the leaderboards. And I think that is a really like, that makes a big difference because if you're looking at like what AI model to power image generation, like I think a lot of interesting tools and use cases that we don't think about a lot. For example, one of them is Suno AI, which is a music generator. Every single time it creates a song for you, it also creates like an album cover for you. They're all the same kind of like style of Suno. But you think like you, you think of Suno as like this music generation company, but they're also an image generation company because they are making just as many images as they're making songs and that is millions and millions and millions, perhaps a day. And so if you can get you know, one of these models that can drastically, you know, make it six times cheaper or 10 times faster to create an image, then I think a company like Suno is very interested in that technology and running with it as well. So and Suno's like one example, there's probably like a hundred of these where they, they have to generate a ton of AI generated images and being able to make this a lot cheaper does make a huge impact on their company. One thing that I'll say is despite you being you being able to access this license, it's not licensed for commercial deploy. So this model is governed by the Flux non Commercial License V2O which essentially allows personal use research, internal evaluation, but doesn't let you use this for any revenue generated applications. Without a separate agreement. So you can make your separate agreement and then you'll be able to use this for inside of your organization. Essentially. I'm really excited to see if we see similar technology rolled out in other big players and we see a speed up in the overall image generation space. I would be a massive fan. Thank you so much for tuning into the podcast today. If you enjoyed the episode, make sure to leave a rating and review wherever you get your podcasts. As always, make sure to go check out Delve.com the sponsor of today's episode, and go check out AI Box AI, my own startup that lets you build AI tools without knowing how to code. Just describe what you want to build and we'll build it for you. Thanks so much and we'll catch you in the next episode.