
New game-changing AI developments are here, from SageMaker Unified Studio to Bedrock's new multi-age
Loading summary
A
This is episode 713 of the AWS podcast, released on March 24, 2025. Hello, everyone. Welcome back to the AWS Podcast. I'm Les Schuberthy. Great to have you back. I'm joined by one of my two co hosts. I'm joined by Gillian Ford. G'day, Gillian.
B
G'day, Simon. It's always exciting to be here on the AWS podcast.
A
It's always fun. And Shruti is off gallivanting at some fancy schmancy conference. So we're hoping she hears us say that so she'll be embarrassed that she didn't didn't join us for the podcast. But she's doing some interesting research for the podcast while she's away, so she'll share that with us. Now, a few things before we get crackalacking. Firstly, again, we want to do a mailbag episode. We've got a few really interesting questions that have come in, but we want more. AWspodcaston.com is the place to send it. If you've got a question about aws, about Amazon, about the services, about the podcast, it's up to you. Throw them in there. And we look forward to answering those questions. But let's have a look at some of the key things that have happened in the last couple of weeks. Firstly, SageMaker Unified Studio has gone GA. Now, this was announced a little while back, but it's basically a unified studio for data and AI development together. And I guess Julian, like, I'm not a data guy per se, I don't have that deep data engineering expertise, but people that do tell me that this is a big deal. Like to have it all in the one place. Saves a lot of time. What are you hearing?
B
Yeah, I think so too. And I think now the delineation between data engineering and AI is really just starting to become just one unified all on the same team here. And I think that's what you're starting to see with SageMaker Unified Studio.
A
Yeah, because you got all together, you've got Athena, EMR Glue, Redshift, you got bedrock in there as well. So you can build and customize your apps. Q developers in there as well. So that's for the workflow. There's a SQL editor right in there. You know, my fanboyness of SQL. So lots of stuff happening in there. Plus there's also S3 Tables is also continuing to be integrated with things like SageMaker Lakehouse, which you can query straight from there as well. So there's a bunch of stuff that I think is going to make things easier. And it'll be interesting to, I guess, see what people do because also you can publish your data and your AI models to the catalog as well so you can get secure governance. You can use Amazon datazone capabilities so you can make sure that the right people get the right access to the data. It's not just about putting it together, it's who has the need to know as well.
B
Yeah, I'm really excited to see how customers start to use SageMaker Unified Studio.
A
Interesting times. And speaking of interesting times, Amazon Bedrock now supports multi agent collaboration in ga. Now I think this is a big deal and I say that because I've been doing some work lately with an application that relies upon multiple agents. And basically what this is is networks of agents with supervisors and specialized sub agents that can each do their own thing and do them well. And I think this is interesting, Gillian, because what we're learning is if we try and make a single AI call or an AI, even a single LLM, do a whole bunch of stuff at once, they struggle. But they're really good at doing chunked up jobs. And this makes the chunking even easier.
B
Doesn't does. And I think it's cool that it now allows you to think about picking the right model based on what the task is and being able to cost optimize because maybe you want a smaller model that will be cost less, maybe be faster and then maybe where there's something that requires more thinking, more intelligence, you can use a more capable model in that for sense.
A
And there's a number of ways that these interactions can happen. So firstly there's supervisor mode and supervisor with routing mode. And with routing mode the supervisor agent will just route simple requests directly to the sub agent. So I won't do the full orchestration dance, but in supervisor mode it can do amazing things automatically across them all. But the other thing I like is that you have inline agents, which means you can dynamically adjust the roles at runtime, which I find fascinating. I think this is gonna be really interesting. As the supervisor determines emergent requirements of the workflow, it can then change what the roles are doing to adjust. And also the fact that you can reference payloads outside so external data sources without having them embedded in the request, that's another big thing. I mean there's, there's so much here it's almost difficult to unpack in one go.
B
It is, Yeah. I mean there's just so many different types of use cases that I feel like this can really unlock in any kind of Industry like I definitely see it a lot in like healthcare, for example. A lot of these complex, complex workflows of being able to triage patient requests, for example, are coming up. Anything that comes to mind that you're seeing with this?
A
Yeah, look, I think I'm really interested in the concept of engineering and design and the ability for multiple agents to perform a complicated team based task in a synthesized way that then has human oversight, which kind of means that you have access to a team. So for example, it could be a testing team or it could be a, a design team, it could be an ideas team, what have you. And each of those team members will have different capabilities and this lets you sort of bring them all together in a really specialized way, which is just, it's just very interesting and I think there's been a lot of talk about sort of agentic AI. I think it's multi agentic AI. That's actually the interesting part. But time will tell. But certainly in the work that I'm doing, I'm seeing it be really, really useful, which is fun.
B
Yep, I totally agree.
A
Let's get on to some of the other updates that have happened. Firstly, analytics. We're happy to announce support for AWS Data Glue Catalog views with AWS Glue 5.0. So this allows you to create views from your spark jobs that you can query from multiple engines without requiring access to reference tables. Again, security is important. Amazon Kinesis Data streams now supports IPv6, so if you need support for that, you've got it. And Amazon EMR on EKS now supports Amazon EKS POD identity so this reduces complexity and possibilities of errors. Some great updates for application integration. AppSync Events adds publishing over WebSocket for real time. Pub sub where was this a month ago when I was doing Pub sub stuff? I could have used it then. But starting now you can use your AppSync events APIs to publish events directly over WebSocket connections implement which complements the existing HTTP API publishing capability. So this allows applications to both publish and subscribe to events using a single websock connection, streamlining the implementation of real time features. This is a win. The other thing that's really interesting is the announcement of AWS step functions. Workflow Studio for the VS code IDE Put my hand up. I'm a VS code kind of guy. I use it all the time. And now you can visually create, edit and debug your state machine workloads directly in your local development environment, which is cool.
B
Amazon Bedrock now supports fine tuning for Meta's Llama 3.2 models. These are for the 1B, 3B, 11B and 90B, enabling businesses to customize these generative AI models with their own data. Fine tuning allows you to adapt llama 3.2 models for domain specific tasks, which allows you to enhance performance for specialized use cases. Amazon Bedrock Knowledge Bases now supports Graph Rag. Well, it's actually generally available, so Graph Rag delivers more comprehensive, relevant and explainable responses by leveraging relationships within your data, improving how generative AI applications retrieve and synthesize information. I'm really excited about this because I'm starting to see a lot of customers that are really interested in utilizing graphs and especially those who haven't used graphs before, but see the applications within generative AI to support a variety of use cases. So if you're new to graph, I definitely suggest going back in the time machine 2024 we did an episode on graphs we talked about Amazon Neptune as well. So you'll definitely want to scroll back into the eight of podcast feed to find that episode.
A
And this is so this has got Amazon Neptune like built in, isn't it? So it's it's like you don't have to do anything. There's no barrier. It's just get it, get into it.
B
And what's cool is that Graph Rag so it automatically is going to generate and store vector embeddings in Amazon Neptune analytics along graph representation of entities and their relationships. Graph Rag combines vector similarity search with graph traversal, enabling higher accuracy when retrieving information from disparate yet interconnected data sources. Also check out the AWS blog on this one because we did some really and a really interesting blog post that showed you the difference between using Graph Rag versus not using Graph Rag and it's just a great way of being able to see the subtle difference of what the graph algorithms are able to provide. Amazon Nova Pro foundation model now supports latency optimized inference in preview on Amazon Bedrock, enabling faster response times and improved responsiveness for generative AI applications. Latency optimized inference speeds up response times for latency sensitive applications, improving the end user experience and giving developers more flexibility to optimize performance for their use case. Amazon S3 Tables now seamlessly integrates with Amazon SageMaker Lakehouse, making it easy to query and join S3 Tables with data in S3 data lakes, Amazon Redshift data warehouses and third party data sources. Amazon Bedrock's capabilities are now generally available within Amazon SageMaker Unified Studio offering a governed collaborative environment. DeepSeq R1 is now available as a fully managed serverless model in Amazon Bedrock AWS is the first cloud service provider to deliver DeepSeek R1 as a fully managed, generally available model. You can power your applications with DeepSeq R1's capabilities through Amazon Bedrock's fully managed service via a single API along with Bedrock's tools, allowing your team to focus on building differentiated generative AI applications right away. Deepseek R1 combines powerful natural language processing with advanced reasoning capabilities demonstrated by leading performance in mathematics where it achieved 79.28% on AIME. I guess that's AME. Maybe that's how it's pronounced 2024 and software engineering where it achieved a 49.2% score on swe bench verified deepseek R1 provides a cost effective option for unlocking advanced AI reasoning capabilities, efficiently maximizing your technology investment. Amazon SageMaker inference now supports rolling updates for inference component endpoints. Amazon Q Developer announces a new CLI agent within the command line. You can now ask Q Developer to write code, test it, help debug issues and Q Developer will iterately make adjustments based on your feedback and approval. Amazon Q Business now supports insights from audio and video data. SageMaker HyperPod Flexible training plans now supports instant start times and multiple offers. Amazon Lex now supports confirmation and alphanumeric slot types in Korean locale. AWS Healthomics now supports the latest Nvidia L4 and L40s graphical processing units and larger compute options of up to 192 VCPUs for workflows. Next topic is business applications. AWS Wickr is excited to announce a redesigned admin experience that's now fully integrated with the AWS Management Console.
A
Let's talk about compute Amazon ECR has announced ECR to ECR pull through cache, so this is a capability that allows customers to automatically sync container images between two ECR private registries existing across AWS regions and OR accounts. So this means that you can get the reduced latency of pulling those cached images in region and so this makes it more cost effective, faster and more reliable as well. Amazon ECS now offers GPU optimized AMI For Amazon Linux 2023 and Amazon EKS now envelope encrypts all Kubernetes API data by default. So this is an additional layer of security with an AWS owned KMS encryption key or the option of bringing your own key model. Rocket now supports Nvidia Multi Instance GPU or MIG for Kubernetes workloads, so this allows you to maximize your GPU resource utilization by running multiple workloads simultaneously on a single GPU whilst maintaining hardware level isolation between those workloads. Bottlerocket also now supports AWS Neuron accelerated instance types. This now allows you to deploy and manage machine learning, inference and training workloads on a bunch of new instance types like inf1, inf2, train1 and train2 so you can move faster. And Bottlerocket also simplifies system setup with default bootstrap container image so you have something to start from rather than having to maintain your own container image for that init configuration. Now let's talk about customer engagement Amazon Connect has broadened global telephony coverage. There is now industry leading inbound number availability across 158 countries and national outbound numbers in 72 countries, so this expansion increases coverage by an average of 125% across regions. Telephony is hard, we're trying to make it easier. Amazon Connect Contact Lens can now dynamically update the questions on an evaluation form, which is pretty nifty. And Amazon Connect, Amazon Workspaces and Amazon AppStream 2.0 and our Chrome Enterprise recommended so this ensures seamless integration and performance for businesses using Chrome devices. And Amazon Connect can now also target multiple agent proficiencies in a single routing step. So you can have up to 3 all conditions and you can match a contact with 4 different types of agents and increase the possibility of finding a suitable match. So for example, if the backup for a niche banking skill consists of agents trained on account management, registration and tax, so what this means is that you can match the right agent to the right customer at the right time with a higher probability of success. Let's talk about databases. Amazon Aurora PostgreSQL 0ETL integration with Amazon Redshift now supports multiple integrations. In fact you can have up to five from the same Aurora cluster. So this makes it even easier for you to move data loads across without having to do a lot of work. Amazon RDS for SQL Server supports new minor version in February 2025 and Amazon RDS for MySQL has got extended support for minor versions as well. My regular reminder for you to patch.
B
Your stuff now let's talk about developer tools. AWS CodeConnections now allows you to securely share your connection resource across individual AWS accounts or within your AWS organization. Previously, to create a connection you had to install the AWS connector app for GitHub or GitLab or BitBucket for each AWS account from which source access was required. But now using AWS Resource Access Manager to securely share a connection to your third party source provider across AWS accounts. By using AWS Resource Access Manager to share your connection resource, you no longer need to create a connection in each AWS account. AWS Code Connections now provides greater control to manage the creation of hosts with a new IAM condition key For Self Managed GitLab GitHub Enterprise Server Host AWS CodePipeline Introduces Code Build and commands rule for stage level condition. AWS CodePipeline supports invoking pipeline execution with a new action type. AWS CodeBuild now supports registering self hosted runners at GitHub organization or Enterprise level. Additionally, you can assign your self hosted runners to specific runner groups for enhanced security and access control. AWS CodeBuild is a fully managed continuous integration service that compiles source code, runs test tests, and produces software packages ready for deployment. AWS makes it easier for developers to build serverless applications by bringing serverless LAN's extensive application pattern library directly into the Visual Studio Code ide. This integration eliminates the need to switch between your development environment and external resources when building serverless architectures by enabling you to browse, search and implement pre built serverless patterns directly in Visual Studio code. Idea Yay.
A
Ides. We love them. Let's do an update. A quick but important one for end user computing. Amazon Workspaces pools now supports FIPS142 validated endpoints. This is a US government standard that specifies some security requirements for sensitive information, so this one's important. And let's move on to the topic of front end web and mobile AWS Amplify Hosting Announces Deployment SKU Protection Support now this feature ensures front end requests are always routed to the correct server backend version, which eliminates what's called version skew and makes your deployments more reliable. You can enable this at the branch level and there's no additional cost. My favorite price and it's available to all customers who already use this particular service. It's a great way to manage version changes through your user base.
B
This one's exciting because I don't think we've had a game tech announcement on the news update show in a while.
A
It's been a while. It's been a while. Yeah.
B
Well, for those who have been waiting anxiously, Amazon gamelift Servers Launches Game Server Wrapper for Rapid Onboarding Developers told us they face significant overhead integrating the server SDK, including dependency management and custom code implementation. The Game Server wrapper solves these challenges by eliminating the need for for server SDK integration making it easy to deploy game servers on Amazon gamelift servers with zero code changes.
A
That's I'm invoking my right to make dad jokes on the podcast. So this is not MC gamelift wrapper. This is a code wrapper.
B
So I didn't know what that what meant. So I guess that was I was.
A
Making a very bad rep for Star Joke MC gamelift.
B
Maybe that's why I don't get it. And we've got another one from Amazon gamelift Streams. So Amazon gamelift Streams is a new managed capability that allows developers to stream games at up to 180p resolution and 60 frames per second to any device with a WebRTC enabled browser in just a few clicks. You can upload games built with a variety of 3D engines with little to no modification. Provision streaming capacity in specific AWS regions and immediately start test streaming players can start playing. Yeah, this is really cool. Players can start playing AAA and indie games over the Internet in just a few seconds on their PCs, phones, tablets and smart TVs without waiting hours for a download.
A
Let's talk about the Internet of Things Announcing MQTT enabled Site Wise Edge Gateways for AWS IT Sitewise so of course, AWS IT Sitewise is a managed service that makes it easy for you to collect, store, organize and analyze your data from industrial equipment at scale with this launch, newly created gateways now include an MQTTV5 broker component that centralizes connectivity and makes it easier for you to get up and going. One quick update for management and governance Amazon EC2 allowed AMIS now integrates with AWS Config, so you can now use your AWS config rules to automatically monitor, detect and instances launched using AMIs that have not been allowed by the allowed AMIS rule. So this means that you can make sure everyone's doing what they should be doing.
B
Now let's talk about networking and content delivery. Amazon Route 53 traffic flow introduces a new visual editor to improve DNS policy editing AWS Application Load Balancer now allows customers to provide a pool of public IPv4 addresses for IP address assignment. To load balancer nodes, customers can configure a public IP Address manager pool that can consist of either bring your own IP addresses that is customer owned or a contiguous IPv4 address block provided by Amazon.
A
Let's talk about security, identity and compliance. Amazon S3 Access Grants has simplified authentication when using both IAM and Identity Provider permissions, so just makes life a little bit easier. You don't have to choose between them, it'll figure it out. Amazon Inspector has expanded ECR support for minimal container based images and enhanced detection, so this means that scanning goes quicker and you can maintain better security with very robust environments. Amazon Eventbridge has expanded IAM execution role support to all targets, so this means you can have consistent permission policies and dedicated invocation throttling limits. AWS WAF now supports PCI DSS 4.0 compliance protection with partner solutions and AWS WAF adds JA4 fingerprinting and aggregation on JA3 and JA4 fingerprints for rate based rules. So this is really important for you folks who work in that difficult world of figuring out who's who and what's going on and who's good and who's not so good and how to protect yourself against that. Amazon Redshift Data API now supports single sign on with AWS IAM Identity Center. IAM Access analyzer now supports IPv6. You can pick the trend. We've been talking about IV6 for a while. I'm pretty sure pretty much Everything will be IPv6 this year. AWS Service Reference Information now supports resources and condition keys and onto the topic of storage, Amazon Data Firehose and now delivers real time streaming data into Amazon S3 tables. So if you're a firehose user you can fire hose it right into the tables. And Amazon S3 has reduced pricing for S3 object tagging by 35%. There you go, another cost reduction you didn't have to do anything for, but you get it. And Amazon S3 Tables adds Apache Iceberg Rest Catalog APIs so any iceberg compatible application can easily create update list and delete tables in an S3 table bucket. And Amazon S3 tables has added create and query table support in the S3 console. So if you don't have a tool you can just jump in there and get it done. AWS Backup has added logically air gapped Vault support for Amazon fsx. So Logically Air Gapped Vault is a type of AWS backup vault that allows secure sharing of backups across accounts and organizations. Supporting direct restore to reduce recovery time from a data loss event. A logically air gapped vault stores immutable backup copies that are locked by default and isolated with encryption using AWS owned Keaton. And finally, Amazon FSX for NetApp ONTAP no longer charges for snap lock licensing, so it's even more cost effective for you to protect your business applications. There's a bunch there today. Gillian was a really interesting mix of infrastructure. Some lot of storage stuff. There was some really interesting developer stuff, like something for everyone. Really?
B
Yeah. And even the price reduction, I mean, it's been a couple of episodes since we had one, and especially I think of 35%. So that's really exciting.
A
Yeah, it's meaningful. How do folks reach out to you?
B
Jillian Ford on LinkedIn.
A
Excellent. And awspodcastmazon.com is the way also. And of course, until next time, keep on building.
AWS Podcast Episode #713 Summary: AWS News – Meet the Next Generation of Amazon SageMaker, Multi-Agent Collaboration on Bedrock
Release Date: March 24, 2025
Hosts: Les Schuberthy and Gillian Ford
In Episode #713 of the AWS Podcast, hosts Les Schuberthy and Gillian Ford delve into the latest updates and innovations from Amazon Web Services. Skipping over the usual advertisements and introductions, this episode focuses on significant advancements in AI/ML services, analytics, compute, customer engagement, databases, developer tools, and more. The discussion is rich with insights, expert opinions, and detailed explanations of new features and their implications for developers and IT professionals.
Les Schuberthy (Les) and Gillian Ford (Gillian) kick off the episode by celebrating the general availability (GA) of SageMaker Unified Studio, a comprehensive environment that unifies data and AI development.
Les [00:20]: "SageMaker Unified Studio has gone GA... people that do tell me that this is a big deal. Like to have it all in the one place. Saves a lot of time."
Gillian echoes the significance of this release, emphasizing the seamless integration of data engineering and AI.
Gillian [01:27]: "The delineation between data engineering and AI is really just starting to become just one unified all on the same team here."
Key Features:
This unified approach promises to enhance productivity and collaboration among data scientists and engineers by providing a centralized platform for all AI and data-related tasks.
The conversation transitions to Amazon Bedrock, highlighting its new support for multi-agent collaboration, now available in GA.
Les [02:37]: "Amazon Bedrock now supports multi agent collaboration in ga... agents with supervisors and specialized sub agents that can each do their own thing and do them well."
Gillian points out the benefits of selecting appropriate models based on tasks, optimizing both cost and performance.
Gillian [03:18]: "Picking the right model based on what the task is and being able to cost optimize because maybe you want a smaller model that will be cost less, maybe be faster."
Advanced Features:
Applications:
This multi-agent system is poised to revolutionize industries like healthcare by enabling complex workflows, such as patient request triage, through specialized agent collaboration.
Les [05:42]: "The ability for multiple agents to perform a complicated team-based task in a synthesized way that then has human oversight... it's really interesting."
The hosts provide a rundown of numerous AWS service updates across various domains:
Gillian [17:16]: "AWS CodeBuild is a fully managed continuous integration service that compiles source code, runs tests, and produces software packages ready for deployment."
Les [15:17]: "AWS CodeConnections now allows you to securely share your connection resource across individual AWS accounts or within your AWS organization."
Gillian [08:27]: "Graph Rag so it automatically is going to generate and store vector embeddings in Amazon Neptune analytics along graph representation of entities and their relationships."
Les [17:16]: "AWS Amplify Hosting Announces Deployment SKU Protection Support... eliminates what's called version skew and makes your deployments more reliable."
Gillian shares updates on Amazon GameLift, particularly the Game Server Wrapper and GameLift Streams, facilitating easier game server deployment and high-quality game streaming.
Gillian [18:12]: "Amazon GameLift Streams allows developers to stream games at up to 180p resolution and 60 frames per second to any device with a WebRTC enabled browser."
Key Highlights:
Closing the episode, Les and Gillian emphasize the breadth of AWS's latest offerings, highlighting the mix of infrastructure, storage, and developer tools tailored for diverse needs.
Les [24:16]: "It's meaningful. How do folks reach out to you?"
Gillian [24:18]: "Gillian Ford on LinkedIn."
They encourage listeners to submit more questions for future mailbag episodes via awspodcaston.com and sign off with a motivational message to keep building.
Les [24:19]: "And awspodcaston.com is the way also. And of course, until next time, keep on building."
This episode of the AWS Podcast serves as a comprehensive update on the latest AWS innovations, providing valuable insights for developers, IT professionals, and businesses looking to leverage AWS's expanding suite of services. From advanced AI capabilities to streamlined developer tools and enhanced security features, AWS continues to empower its users with robust and scalable solutions.