
Simon and Jillian catch you up on the highlights from today's keynote PLUS all the "pre:Invent" anno
Loading summary
A
This is episode 749 of the AWS podcast released on December 4th, 2025.
B
Hello everyone and welcome back to the AWS Podcast. Simlish here with you. Great to have you back. And of course joined by Gillian Ford. G', day, Gillian.
A
Day two. Day two.
B
Well, it's never day two. It's always day one.
A
Oh, well, you all know the word references.
B
We know we mean. We know what we mean. Day one, reinvent 2025. Day two, here's what's going on. There's been lots happening and in this epis we're going to cover two things. We're going to cover some of the announcements from the keynote that Swami did, but also we'll be going through the pre invent stuff. So all the things that happened before re invent that we didn't get to cover, there's about 60 of them. So it's going to be a fair bit. But let's get into. Absolutely, let's get into what's the latest and greatest. So we're happy to introduce AWS AI factories. These provide rapidly deployable high performance AWS AI infrastructure in your own data centers. So by combining the latest Trainium accelerators and Nvidia GPUs, specialized low latency networking, high performance storage and AWS AI services, this accelerates your AI build outs by months or years compared to building independently. So you get immediate access to leading foundation models without having to negotiate separate contracts with individual model providers. You get to use Bedrock, you get to use SageMaker. And these AI factories operate as dedicated environments built exclusively for you or your dedicated trusted community. Which means you have complete separation and operating independence while integrating the broader set of services. So this is really useful for a variety of use cases across industries or just places where you just couldn't get the cloud in there. Now you get a factory in there now. Speaking of hardware, we're announcing Amazon EC2 Trainium 3 Ultra Servers for faster, lower cost generative AI training. This is our fourth generation chip called the Trainium 3 and it's our first 3 nanometer AWS AI power chip that's purpose built to deliver the best token economics for all the work that we're doing these days. So each AWS Trainium 3 chip provides 2.52 petaflops of FP8 compute. It increases the memory capacity by 1 1/2 times and the bandwidth by 1.7 times over Trainium 2. And it gets to 144 gig of HBM3E memory and 4.9 terabytes per second of memory bandwidth. So Trainium 3 is designed for both dense and expert parallel workloads with advanced data types. So you've got things like MX FP8 and MXFP4 and improved memory to compute balance for real time multimodal and reasoning tasks. Now, the Trainium 3 Ultra service can scale up to 133 Trainium 3 chips. So that's 362 FP8 petaflops in total. And you can get them in ultra clusters that scale to hundreds of thousands of chips. So A fully configured Trainium 3 Ultra server will give you up to 20.7 terabytes of HBM3E and 706 terabytes per second of aggregate memory bandwidth. Now this next generation Trainer 3 Ultra server also has the Neuron Switch v1, which is an all to all fabric that doubles interchip connectivity bandwidth over the previous generation. So lots of big stuff. Basically 4.4 times higher performance, 3.9 times higher memory bandwidth, and 4 times better performance per watt compared to the Trainium 2. So lots to look at there. Gillian.
A
Speaking of more things to look at, this one's like a two for one kind of. So we've announced within Amazon Bedrock and within Amazon SageMaker AI capabilities that make advanced model customization accessible to developers at any organization. Reinforcement fine tuning in Amazon Bedrock and serverless model customization in Amazon SageMaker AI with reinforcement learning, this is going to make it really simple to make those create to be able to create these efficient models that you can be able to fine tune. It's going to be a lot faster, more cost effective and more accurate compared to just using base models. So these techniques, now that they're available in Bedrock and SageMaker, just makes it a lot faster to be able to get started. I'm personally excited about this episode because I got to interview one of the folks that has been working behind the scenes on this and we've got an episode coming out on that. Yeah, it'll be on fine tuning should you fine tune. What are the different techniques? And of course this is going to be mentioned as well, so stay tuned for that. And We've also announced two new AI model training features within Amazon SageMaker HyperPop checkpointless training, which is an approach that mitigates the need for traditional checkpoint based recovery by enabling peer to peer state recovery and elastic training. And another one is elastic training. Elastic training maximizes cluster utilization as training workloads automatically expand to use idle capacity as it becomes available. This is going to save you a lot of time. If you're been used to spending a lot of time having stop your training.
B
Job, restart loading configuration. It takes like an hour or two depending on the size of the cluster. So this is now minutes, which is cool.
A
Really cool.
B
Something else that's cool is a new model, Amazon Nova 2 Omni, which is an all in one model for multimodal reasoning and image generation. Now this is the industry's first reasoning model that supports text, images, video and speech inputs while generating both text and image outputs. So you get multimodal understanding, image generation and editing using natural language and speech transcription. So you don't have to stitch lots of things together, you can just get up and running. The model supports a 1 million token context window, 200 plus languages languages for text processing and 10 languages for speech input as well. So lots of cool capabilities there. And speaking of cool capabilities, something that Gillian I have been discussing together for a while now because we knew we had to get this done for all of our amazing listeners is all the announcements that happened just before re invent and it's kind of a weird time because it's Thanksgiving, people are away, stuff's going on, lots of lead up to reinvent sort of focus, but lots of cool stuff came out. So we're going to take you through it. So let's start with the AWS Marketplace. So the AWS Marketplace now has agent mode and an AI enhanced search to let you discover solutions better. There are 30,000 listings. More than so, having an agent to help you figure out how to do things properly and quickly will make your life a lot easier. There's also an AWS Marketplace MCP server as well, so you can sit in your interface of choice and get access to what you want. AWS Marketplace has also introduced Express private offers for fast personalized pricing. So rather than having to talk to a sales team, et cetera, you can get it done automatically using guess what AI. And there is now a new automated integration for CrowdStrike Falcon next gen SIEM in AWS Marketplace as well. So this unifies your threat detection, et cetera and it correlates lots of stuff. And it means you don't have to do any manual setup when you're creating it as well. We're also now happy to introduce multi product solutions in the marketplace. So this is a combination of products and services from one or more AWS partners tailored to meet a specific use case or industry component. Each component has its distinct pricing and terms, but it means you can get it all in one sort of one up solution and the AWS Marketplace now also supports variable payments for professional services. So this is a new billing option that lets professional services sellers bill customers as work is delivered.
A
Next up we've got Analytics AWS announces the Apache Spark Upgrade Agent. This is a new capability that accelerates Apache Spark version upgrades for Amazon EMR on EC2 and EMR serverless. The agent converts complex upgrade processes that typically take months into projects spanning weeks through automated code analysis and transformation. AWS Glue now supports Materialized Views, a new capability that makes it easier for data teams to transform data and accelerate query performance. Amazon EMR Serverless now offers Serverless storage that eliminates local storage provisioning for Apache Spark workloads. This is going to reduce data processing costs by up to 20% and prevent job failures from disk capacity constraints.
Amazon Kinesis Video Streams now supports a new cost effective warm storage tier. The warm storage tier enables developers of home security and enterprise video monitoring solutions to cost effectively stream data from devices, cameras and mobile phones while maintaining extended retention periods for video analytics and regulatory compliance.
B
Now let's move on to artificial intelligence. Yes, more updates in that because we are deep in that wonderful virtuous cycle where customers get their hands on new services and capabilities and they say if only it did X, Y and Z and the team works towards that. That's why there's lots of updates. And spoiler alert, wait till we get to the Connect section. Oh my goodness, that seems to be busy. But let's talk artificial intelligence first. Amazon Bedrock Agent Core Runtime now supports bi directional streaming so now you can have real time conversations where agents listen and respond simultaneously while handling interruptions and context changing mid conversation. So this gets rid of that friction of the sort of, you know, like talking on a handheld radio. It's much more natural. Amazon SageMaker catalog now exports asset metadata as a queryable data set so you can get it through an Apache iceberg table through Amazon S3 Tables and Amazon SageMaker Hyperpod now supports programmatic Node reboot and replacement so you can do it through an API. And we're also announcing a preview of the AWS MCP server. So this helps AI agents and AI native IDEs perform real world multi step tasks across one or more AWS services. And so this is consolidating the capabilities of the existing AWS API, MCP and the Knowledge Servers into one unified server so you can get access to documentation, generate calls to over 15,000 APIs and also use pre built workflows called Agent Standard operating procedures or SOPs that can guide you through normal tasks. We're also happy to announce the aws AI League 2026 championship. So this expands the flagship AI tournament with new challenges and doubles the prize pool to $50,000 for builders to compete and innovate. So this allows you to have a quick orientation and then focuses on tournaments with two challenge tracks, the Model Customization Challenge using SageMaker AI and the Agentic AI Challenge using Amazon Bedrock Core to build intelligent agents so you can participate at AWS summits and I think a lot of folks will have a lot of fun with this.
Another quick update, Multimodal retrieval for Bedrock knowledge bases is now generally available. Previously you could only search through text, documents and images. Now you can get all different formats through one interface. So terabytes of meeting recordings, training videos, visual documentation, a whole bunch of stuff. This can handle it all now. SageMaker HyperPod now supports managed tiered KV cache and intelligent routing for large LLM inferences, so this means you can optimize your inference performance for long context prompts and multi turn conversations. Managed tiered KV Cache addresses the challenge by intelligently caching and reusing computed values, whilst intelligent routing directs requests to optimal instances. With this we're getting up to 40% latency reduction, 25% throughput improvement and 25% cost savings compared to baseline configurations.
Amazon SageMaker Catalog now provides automatic data classification using AI agents and Amazon SageMaker HyperPod now supports custom kubernetes, labels and taints.
Also announcing a major expansion of the AI Competency, formerly the Generative AI Competency, in the largest specialization launch to date, which includes 60 validated partners across three new AgentIQ AI categories, AgentIQ AI tools, AgentIQ AI applications and AgentI AI consulting services. This lets you identify and work with the AWS partners who specialize in developing and implementing autonomous AI systems that can perceive reason and act with minimal human oversight.
A
Now onto business applications which I think think at this point we should just call it the Connect Show. Amazon Connect now allows you to test and simulate Contact center experiences in just a few clicks, making it easy to validate workflows, self service, voice interactions and their outcomes. Amazon Connect now gives business users greater control over daily Contact center operations without requiring technical resources. With new capabilities to create customer UIs that adjust queues, routing behavior and customer experience settings in real time, business users can respond to changing conditions immediately while maintaining enterprise grade governance and security. Amazon Connect launches Real time AI Agent assistance and contact summarization for Salesforce Contact center with Amazon Connect. Amazon Connect now allows you to bring your own Amazon Bedrock knowledge bases and supports multiple knowledge bases per AI agent giving you greater flexibility in how you organize and and access knowledge content for your AI agents. Amazon Connect now supports third party speech providers for end customer self service giving you greater flexibility in how you deliver voice experiences. Amazon Connect is introducing agentix self service capabilities that enable AI agents to understand, reason and take actions across voice and messaging channels to automate routine and complex customer service tasks. Amazon Connect launches MCP support and enabling AI agents for end customer self service and employee assistance to use standardized tools for retrieving information and completing actions. You can now use Flow modules as M C B tools to reuse the same business logic across both deterministic and generative AI workflows. And of course you can integrate this with Amazon Bedrock Agent core gateway for all of the fun MCP ification of things. Amazon Connect now provides analytics and monitoring capabilities with for AI agents across self service and agent assistance experiences. Amazon Connect now provides improved analytics and monitoring for AI agents. Amazon Connect now supports creation of custom metrics for use in dashboards and APIs. Amazon Connect now provides businesses with the ability to automatically evaluate the quality of self service interactions and get aggregated insights to improve customer experience. Amazon Connect now allows you to automate email responses and agent routing logic using keyword and phrase conditions helping organizations increase self service, reduce manual handle time and improve routing accuracy. Amazon Connect is launching an AI powered predictive insights that transforms how businesses understand and serve their customers. So an example is maybe you'll have like a recommendation algorithm such as recommended for you or similar items. So this is currently in preview but with this Amazon Connect customer profile you'll be able to get these different types of individual user recommendations. Amazon Connect now streams messages for AI powered interactions. This new capability shows connect AI agent responses as they're being generated which reduces perceived wait times and improves the customer experience. Amazon Connect now makes it easier to link related contacts such as email replies, call transfers, persistent chats and queued callbacks to the same case so agents can view the complete customer journey and resolve issues faster. Amazon Connect now provides AI powered case summaries.
Amazon Connect now provides granular access controls for performance evaluations. Amazon Connect provides managers with new criteria while setting up automated evaluations, making it easier to identify relevant contacts for evaluation and providing additional insights to automatically populate evaluation forms. Amazon Connect Chat now supports in flight data redaction and message processing. Amazon Connect Chat now supports agent initiated workflows. Amazon Connect Outbound campaigns now supports multi step multi channel customer engagement journey builder. Amazon Connect now allows you to customize and visualize the appearance of the agent workspace. Amazon Connect enhances its Agent Assistant capabilities. These AI agents analyze conversation, contacts and customer sentiment in real time and actively completing tasks such as preparing documentation and handling routine processes.
That might have been the most Amazon Connect updates that we have.
B
Crazy.
They're really, really busy.
A
They are.
B
Now let's talk about compute. We have new instances. We have the new compute optimized Amazon EC2 C8A instances. These are powered by 5th generation EPYC processors. These are the Turin processors and they deliver up to 30% high performance and 19% better price performance than C7A instances. As an example, they're up to 57% faster for Groovy JVM, allowing better response times for Java based applications. So that's kind of an example where you can just do an instance update and get half off your performance, which is pretty amazing. We're also announcing the Amazon EC2 general purpose M8AZN instances in preview. These are also fifth generation EPYC processors. These have twice the compute performance than the previous generation M5ZN instances as well. We're also announcing Amazon EC2 M4Macs Mac instances in preview. So this is the latest Mac Studio hardware. If you're building for Apple you can get access to those.
And we also have got the new EC2 P6E GB300 Ultra servers which are accelerated by Nvidia GB300NV L72. These are now generally available. These are honking great powerful servers that you can get access to straight away. We're also announcing the Amazon EC2 memory optimized X8i instances. These are powered by custom Intel Xeon 6 processors and these give you 1 1/2 times more capacity, up to 6 TB and 3.4 times more memory bandwidth than the previous generation X2i instances as well. These will be SAP certified and give you 46% higher SAPs compared to the X2i instances. So if you're looking for lots of SAPs, that's the place to get it from. And aws previews. The EC2 C8ine instances, these are the 6th generation intel scalable processors. These are the Granite Rapids ones and the latest AWS Nitro version 6 cards. So you get up to two and a half times higher packet performance per VCPU than the previous generation C6 in instances you get twice the higher network bandwidth through Internet gateways and up to three times more elastic network interface compared to the existing C6 in. So if you've got packet processing workloads, telco type stuff, virtual security appliances, DDoS protection. They're the ones you want to have a look at in preview.
A
One update in databases Amazon Aurora now supports Postgres 17.6, 16.10, 15.14, 14.19 and 13.22. Next up management and governance Amazon CloudWatch now offers configuring deletion protection on your CloudWatch log groups, helping customers safeguard their critical logging data from accidental or unintended deletion. Very important. Amazon CloudWatch now enables automated quality assessment of AI agents through agent core evaluations. This new capability helps developers continuously monitor and improve agent performance based on real world interactions. Amazon CloudWatch launched incident report generation capabilities with an AI powered root cause workflow that guides customers through the Five Whys analysis technique. The feature is modeled on the correction of errors process used by both teams within Amazon and our customers to improve their operations.
B
The five Whys process is an excellent dive deep process. It really helps you understand the root cause of problems by really helping you ask the right questions in the right area. So definitely a big plus one on using that. Now let's talk networking. We have probably one of the biggest announcements we've had for a long time in networking or we've had some big ones. We are announcing a preview of AWS Interconnect Multi Cloud. This provides simple, resilient high speed private connections to other cloud service providers and it's starting in preview with Google Cloud as the first launch partner and then with Microsoft azure later in 2026. So customers have been adopting multi cloud strategies while migrating more applications to the cloud and they do have many reasons for interoperability requirements. They may want choice, they may want speed, they may want ease, they may be integrating previously existing systems, et cetera. AWS Interconnect Multi Cloud is the first purpose built product of its kind and a new way of how clouds connect and talk to each other. It enables customers to quickly establish a private secure high speed network, connections with dedicated bandwidth and built in resiliency between your Amazon VPCs and other cloud environments so you can get up and running really quickly instead of weeks or months. Now Interconnect multicloud is available in preview in 5 AWS regions and you can enable this capability in the console and CSPs can also adopt via a published open API package on GitHub. So lots of information on this one. Another version of Interconnect is now a gated preview of AWS Interconnect Last Mile this is a fully managed connectivity offering that allows customers to connect their branch offices, data centers and remote locations to AWS with just a few clicks. So again, no friction of discovering partners and complexity, et cetera. This is a collaboration between AWS and Lumen and this really takes advantage of Lumen's extensive network footprint. So it's currently a gated preview. If you're interested in this, you need to reach out to your account team and this is for our customers in the us. And one more networking update is the Amazon API Gateway now supports MCP proxy, which allows you to transform your existing REST APIs into MCP compatible endpoints. Now that's interesting. So this new capability enables organizations to make their APIs accessible to AI agents and MCP clients. And through integration with Amazon Bedrock's Agent Core gateway service, you can securely convert your REST APIs into agent compatible tools whilst enabling intelligent tool discovery through semantic search. So this gives you three key things. It lets rest APIs communicate with AI agents and MCP clients through a protocol translation so you don't have to modify your application. Second link is your comprehensive security because you've got dual authentication, you're verifying agent identities for inbound requests while managing secure requests to rest APIs for outbound calls. And it also enables your AI agents to search and select the most relevant REST APIs to best match the prompt context.
And now to finish off, let's talk a little bit about storage, and there's a good one in here that we'll get to in a moment because it changes the answer to a lot of things. But firstly, Amazon S3 batch operations introduces performance improvements. In fact, it now completes jobs up to 10 times faster at a scale of up to 20 billion objects in a job. That's a lot of objects. That's a big job. And it now pre processes objects, execute jobs, and generates completion reports up to 10 times faster with no additional configuration or cost. So if you're a batch Operations user, this is great. Again, you're old hey boss, I just sped everything up by 10 times if batch wasn't doing it for you. From a performance perspective, you now can Speaking of improvements, this is a big one. Amazon S3 has increased the maximum object size to 50 terabytes, so that's a 10 times increase from the previous 5 terabyte limit. Now this is fun because one of the questions we'd often ask SAS and perspective essays about S3 when we're interviewing or getting them trained up was hey, you can store as many objects as you want in an S3 bucket. But what's the maximum object size? And the answer, of course, was five terabytes. Well, now, that is not the answer. The answer is now 50 terabytes. So you can store up to 50 terabyte objects in all S3 storage classes and use them with all S3 features. Now you want to optimize upload and download performance for those really large objects by using the latest AWS common runtime and S3 transfer manager in the AWS SDK. And so this is really important to make sure you're using the latest SDK to get access to this. But 50 terabytes, that's a big object.
A
That's a lot.
B
Speaking of a lot, there was a lot today. So we've had reinvent stuff, we've had pre invent stuff. We've kind of got folks up to date. And tomorrow's episode is going to be a little more infrastructure focused. And we'll catch you up on any other announcements that have been happening. But Killian, it's keeping us busy.
A
I think this is keeping a lot of people busy for sure.
B
There's lots to absorb. Thank you everyone for listening. We'd love to get your feedback. AWSpartmazon.com is the place to do it. And until next time, keep on building.
Date: December 4, 2025
Hosts: Simon Elisha and Gillian Ford
This episode provides an in-depth recap of the major product announcements and innovations unveiled during Swami Sivasubramanian’s keynote at AWS re:Invent 2025, as well as a comprehensive rundown of the 60+ "pre:Invent" releases that dropped ahead of the main event. The hosts break down advancements in AI, EC2 hardware, managed services, analytics, networking, and especially, business applications like Amazon Connect. Throughout, they contextualize these launches with clear, practical implications for developers and IT professionals.
“You get immediate access to leading foundation models without having to negotiate separate contracts with individual model providers.” (01:16)
"A fully configured Trainium 3 Ultra server will give you up to 20.7 terabytes of HBM3E and 706 terabytes per second of aggregate memory bandwidth." (02:53)
"This is going to save you a lot of time... it’s now minutes, which is cool." (05:36)
"You don’t have to stitch lots of things together, you can just get up and running." (05:47)
Bedrock Agent Core Runtime: Bi-directional streaming enables natural, real-time conversational AI (handles interruptions, context shifts).
AWS MCP Server Preview: Empowers AI agents to automate complex multi-step tasks across AWS.
AI League 2026: Expanded flagship contest, $50,000 in prizes, challenges around model customization and agentic AI.
Multimodal Retrieval for Bedrock Knowledge Bases:
Now allows cross-format search (video, document, image) in one interface (11:38).
SageMaker HyperPod Upgrades: Faster, cheaper inference performance via managed KV cache and intelligent routing.
"That might have been the most Amazon Connect updates that we have." – Gillian Ford (17:57)
"The Five Whys process is an excellent dive deep process…" (21:47)
"Well, now, that is not the answer. The answer is now 50 terabytes." (25:34)
Simon on AI Factories:
"You get immediate access to leading foundation models without having to negotiate separate contracts with individual model providers." (01:16)
Gillian on Checkpointless Training:
“It’s now minutes, which is cool.” (05:44)
Simon on Nova 2 Omni:
“You don’t have to stitch lots of things together, you can just get up and running.” (05:47)
Gillian on Amazon Connect:
“That might have been the most Amazon Connect updates that we have.” (17:57)
Simon on CloudWatch's Five Whys:
“The Five Whys process is an excellent dive deep process… it really helps you understand the root cause of problems…” (21:47)
On S3 object size:
“Well, now, that is not the answer. The answer is now 50 terabytes.” (25:34)
This episode is packed with re:Invent 2025’s biggest AWS news, offering practical, nuanced perspectives from Simon and Gillian. The highlights—AI Factories, Trainium 3, Nova 2 Omni, the flood of Amazon Connect enhancements, S3’s new 50TB object limit, and the opening of AWS to true private multi-cloud networking—signal AWS’s push for comprehensive, integrated AI and cloud-native experiences.
For listeners in IT, development, data science, or cloud architecture, this episode is a goldmine of must-watch trends and technical insights.