Loading summary
Brian McCullough
Welcome to the Tech Meme Ride home for Thursday, December 12, 2024. I'm Brian McCullough. Today, more fallout from the whole Cruise wind down what it's like to use some of the new Gemini 2.0 features. Has Apple quite belatedly finally done a feature update that provides the Vision Pro with a killer app, an Instagram like app from China I'd never heard of, and one singular eye popping data point about the chips act? Here's what you missed today. In the world of tech, today is a day of follow up stories, kind of, and also maybe stories that have bounced out of other stories we've been talking about recently. Honda plans to dissolve a self driving vehicle partnership with GM after GM announced plans to exit the robo taxi business completely. Honda had invested $852 million in cruise for its part, while Microsoft expects an around $800 million charge in Q2 of 2025 because they had acquired a minority stake in Cruise back in January 2021. Now, as I said when all this news broke, I wouldn't be surprised to learn eventually that there were simple dollars and cents decisions that led to all of this happening. The economics inside of gm, I mean. But Andrew J. Hawkins at the Verge takes a stab at a more macro theory. In short, he posits that maybe Robo taxis will never provide the scale of profits to make it worthwhile for automobile producers to actually be in this business. So why not go back to what they do best, which is just selling cars to individuals, Just cars that are maybe mostly autonomous. Quote Cruise was continuing to rack up huge losses. The Robo taxi subsidiary lost a staggering $3.48 billion in 2023. Kyle Vogt, Cruise's co founder and Anam's successor as CEO, was under mounting pressure to expand the service and bring in more money to help cover the losses. Plus, he was directly competing with Alphabet's Way, which had more vehicles and seemingly better technology. And Google's parent company was more willing to spend billions of dollars without any near term profits to win the robo taxi business. With the screws tightening, Vote publicly drew a line in the sand. Cruise would bring in over $1 billion in revenue by 2025. Instead, Cruise itself never made it to the end of 2024. With cruise out of the picture, Waymo is one of the only players left aiming to prove that robotaxis can work in the real world. Amazon's Zoox and Hyundai's Motional are still in the game, albeit far behind. Tesla is also pursuing its own Robo taxi project, which it claims will launch in 2026. Meanwhile, GM will tackle a new risky experiment personally owned autonomous vehicles. GM knows how to sell cars to people, and the company already has a hands free highway driving feature called Super Cruise. Why not just leverage Cruise's fully autonomous technology to make SuperCruise even better? GM may have scrapped its Ultra Cruise branding to develop a partially autonomous system that covers 95% of driving scenarios, but it still thinks that people want a fully autom autonomous car of their own on their own terms. I think the application of what the customer wants in a privately owned vehicle is very different, CEO Barra said on Tuesday. But I also think there's a lot of commonality with Cruise's technology. How it seamlessly moves back and forth I think is something different in a personal autonomous vehicle. Driver assistance technologies, especially so called Level 3 systems, carry their own risks. However, there have been studies that show that the handoff between a partially automated system and a human driver can be especially fraught when people have been disconnected from driving for a longer period of time. They may overreact when suddenly taking control in an emergency situation. They may overcorrect steering, brake too hard, or be unable to respond correctly because they haven't been paying attention. And those actions can create a domino effect that has the potential to be dangerous, perhaps even fatal. The safety implications are enormous, as are the liability concerns. GM may eventually decide that robotaxis aren't such a bad bet after all, turning once again to Simon Willison to give us early impressions of Gemini 2.0 flash. Simon says the spatial reasoning performance is impressive and its new Streaming API is maybe the killer app. The really cool thing about Gemini 2.0 is the brand new streaming API. This lets you open up a two way stream to the model, sending audio and video to it and getting text and audio back in real time. I urge you to try this out right now. It works for me in Chrome on my laptop and mobile Safari on my iPhone. It didn't quite work in Firefox. It's pretty similar to the previous live demo, but has additional tools so you can tell it to render a chart or run python code and it will show you the output. This stuff is straight out of science fiction. Being able to have an audio conversation with a capable LLM about things that it can see through your camera is one of those we live in the future moments worth noting that OpenAI released their own websocket streaming API at dev day a few months ago, but that one only handles audio and is currently very expensive to use. Google hasn't announced the pricing for Gemini 2.0 flash yet. It's still a free preview, but if the Gemini 1.5 series is anything to go by, it's likely to be shockingly inexpensive. I usually don't get too excited about not yet released features, but this thing from the native image output video also caught my eye. And in the video he shows an edit executed by a prompt which said turn this car into a convertible and it produces a perfect edit with no need to fiddle with anything. Quoting Again, the dream of multimodal image output is that models can do much more finely grained image editing than has been possible using previous generations of diffusion based image models. OpenAI and Amazon have both promised models with these capabilities in the near future, so it looks like we're going to have a lot of fun with this stuff in 2025. Also, the native audio output demo video shows how good Gemini 2.0 flash will be at audio output with different voices, intonations, languages and ACC. This looks similar to what's possible with OpenAI's advanced voice mode today. End quote with all my speculation about AI maybe empowering the rise of a new compute product category in the form of smart glasses, methinks this might enable that The FCC has opened all 1200 megahertz of the 6 GHz band for unlicensed use by very low power devices, citing growth in wearables, ar, VR and other cutting edge tech. Quoting TV tech Despite opposition from the NAB and other parties who have argued that opening up the spectrum would create problems for fixed microwave links, satellite uplinks and broadcast auxiliary services that use this spectrum. The FCC has in recent years been opening up parts of the 6 GHz band prior to the December 11 vote to open up all 1200 MHz for very low power devices, the FCC expanded unlicensed use between 5.925 and 7.0.125 GHz, helping to usher in Wi Fi 6e and set the stage for Wi Fi 7 and support the growth of the Internet of Things. In October of 2024, FCC Chair Jessica Rosenworsel called for further expansion, which the NAB opposed as broadcasters extraordinary efforts to help the many communities impacted by Hurricane Helene demonstrate. It is critical for the Commission to ensure that broadcasters have access to spectrum that will allow them to provide these essential services in times of crisis and without interference, the NAB said in October. In adopting the new rules, the F the FCC noted that they will bolster cutting edge applications like wearable technologies and augmented and virtual reality, which will enhance learning opportunities, improve healthcare outcomes and bring new entertainment experiences Remember the Vision Pro? I mean, seriously, remember all the hype from almost exactly a year ago? Well, among all the OS updates yesterday, Wes Davis at the Verge says that the Vision Pro got an update that is the closest thing to a killer app we've seen for the device, and maybe something they should have launched with. Quote the Vision Pro has been able to mirror the screen of a Mac since day one, but I found the original Mac virtual display feature limiting. Text was sharp at low resolutions, but the screen was cramped. I could get more space at higher resolutions, but the text was too small and blurry to read. Yes, I can blow it up to the size of a bus to make things readable, except then I'm craning my head around too much to see everything. My normal normal 3 monitor setup lets me see the most important stuff with slight movements, but that just hasn't been possible before now in VisionOS 2.2. The standard Mac display is now curved and it seems sharper. It's not retina sharp at the highest resolutions, but I no longer have to make it gigantic to get legible text. The default virtual display becomes one of three options, standard wide and ultrawide once your Mac is updated to macOS 15.2, which lets it take over foveated rendering from the Vision Pro. Those two extra modes instantly made the virtual display viable for me, giving me the space I'm accustomed to in my three monitor life. You can crank the resolution in ultrawide all the way up to 10240x2880 if you'd like, but the sweet spot for me has been the wide displays maximum 6720 x 2880 resolution, which lets me see everything I need without constantly rotating my Vision Pro laden head. It ends up feeling more like a real monitor and not some fantasy display that evokes Weird al Yankovic song Frank's 2000 TV. This has made it easier for me to relocate to another room in my house or even outside. If I wanted, I wouldn't take it to a coffee shop for a number of reasons. Do I leave it behind when I go to the restroom or wear the Vision Pro in there like a maniac, but I'd absolutely bring it on a work trip. Apple has also made it so that the audio is sent through the headset instead of your computer speakers, as it did before. The widescreen options came in handy recently when I strained my back in a way that made it painful to sit upright. I hate doing work on a laptop, but reclining in bed with the Vision Pro on was suddenly a real option for me. There are quirks though. Switching between the display modes can be sluggish, and your Mac doesn't always remember what resolution you set, so if you switch from wide to ultrawide and back, you might find all your Windows piled up on top of each other. And the keyboard awareness feature, which shows your keyboard even if you have one of Apple's immersive environments fully turned on, works great with my magic keyboard, but doesn't reliably show the mechanical one I prefer. Still, those are minor issues. The expanded virtual display is a critical upgrade, and if it's not in killer app territory, it's at least right next door to it. It still doesn't help the Vision Pro with its biggest issues like that our bodies are all different and not everyone will find it comfortable to use for long stretches of time. And it doesn't make Apple's headset any less expensive. But it does help that my Vision Pro is now more than a personal movie theater. Now it's a gigantic high res curved display with perfect viewing angles too. That makes the price feel a little closer to right end quot Lumen is the world's first handheld metabolic coach. It's a device that measures your metabolism through your breath, and on the app it lets you know if you're burning fat or carbs and gives you tailored guidance to improve your nutrition, workouts, sleep, even stress management. Because your metabolism is at the center of everything your body does, optimal metabolic health translates to a bunch of benefits, including easier weight management, improved energy levels, better fitness results, better sleep, etc. All you have to do is breathe into your Lumen first thing in the morning and you'll know what's going on with your metabolism. Then Lumen gives you a personalized nutrition plan for that day based on your measurements. You can also breathe into it before and after workouts and meals so you know exactly what's going on in your body in real time. And Lumen will give you tips to keep you on top of your health game. So if you want to stay on track with your health this holiday season, go to Lumen Me Ride and get 15% off your lumen. That's L u m e n me ride for 15% off your purchase. Lumen makes a great gift too. Thank you thank you Lumen for sponsoring this episode. If you listen to this podcast, you care about what's going on in the world. Listening to us is great, but if you want even more great coverage of our crazy world, you should subscribe to the Washington Post. You don't really need me to tell you about the Washington Post when it comes to their tech coverage because, you know, I quote from them on the show all the time. But it's not just tech that they're good at, they're one of my personal go to's for all the things beyond tech too. I even signed up to get the Post's for your newsletter, which sends me my very own personalized roundup of stories every evening based on my interest and my reading history. Their app makes it easy for me to stay up to date on the latest news, save and share stories, and follow my favorite authors. The Post even offers a cool feature for audio lovers like you. You can actually listen to articles in addition to reading them so you can tackle your to do list and catch up on the news at the same time. Now more than ever, it's important to stay up to date on the world. So go to washingtonpost.com ride to subscribe for just 50 cents per week for your first year. That's 80% off their typical offer, so this is truly a steal. Once again, that's washingtonpost.com ride to subscribe for just 50 cents per week for your first year. Wanted to put this on your radar because it was news to me. Xiaohongshu is apparently an Instagram like app that is extremely popular with younger Chinese users. So much so that it has reportedly told investors it will double its profits to more than $1 billion in 2024, ahead of a potential IPO. So the rise of Chinese apps continue. Quote the strong results are likely to revive speculation around the market debut of a startup valued at $20 billion in its last funding round in 2021. Zhou Hangshu, which literally means Little Red Book, started out like its US Cousin as a repository of personal travel and dining photos before branching out into reviews and live shopping. Started in 2013 by Charwen Mao Wenchao and Miranda Kufang as a shopping guide for Chinese tourists, Zhao Hongshu now counts some 300 million monthly active users. Its rapid growth comes in part at the expense of incumbent e commerce leaders Alibaba and JD.com like ByteDance's Douyin, the startup enjoyed explosive growth through getting influencers to sell products to millions of users. When shoppers scroll through videos and photos, they can buy tagged products with just a few clicks. Still, online commerce is slowing more broadly as Chinese consumers tighten their belts during a severe economic downturn. Apps in general have also experienced a decline in growth from COVID era peaks when millions under lockdown turned to their smartphones for entertainment and necessities. Zhao Hongshu, which is backed by Hongshan and Alibaba among other well known names, is one of just a few Chinese Internet leaders to remain privately held. End quote Harvard has released a high quality DataSet of nearly 1 million public domain books created with funding from Microsoft and OpenAI that anyone can use to train AI tools. Quoting Wired around five times the size of the notorious Books3 dataset that was used to train AI models. Like Meta's Llama, the institutional Data Initiatives database spans genres, decades and languages, with classics from Shakespeare, Charles Dickens and Dante included alongside obscure Czech math textbooks and Welsh Pocket dictionaries. Greg Leppert, executive director of the Institutional Data Initiative, says the project is an attempt to level the playing field by giving the general public, including small players in the AI industry and individual researchers, access to the sort of highly refined and curated content repositories that normally only established tech giants have the resources to assemble. It's gone through rigorous review, he says. Leopard believes the new public domain database could be used in conjunction with other licensed materials to build artificial intelligence models. I think about it a bit like the way Linux has become a foundational operating system for so much of the world, he says, noting that companies would still need to use additional training data to differentiate their models from those of their competitors. In addition to the trove of books, the Institutional Data Initiative is also working with the Boston Public Library to scan millions of articles from different newspapers now in the public domain, and it says it's open to forming similar collaborations down the line. The exact way the book's data set will be released is not settled. The Institutional Data Initiative has asked Google to work together on public distribution, and the company has pledged its support. And finally, today the whole chips act, as we've discussed many times, is part of a geopolitical movement to have access to silicon and compute supplies inside your own borders. But it was also an effort to reverse decades of offshoring of high tech manufacturing. Take this single data point as an example of what that meant, quoting the FT A Peterson Institute study found that spending on the construction of US computer and electronics manufacturing facilities had skyrocketed in the last two years, with more electronics construction proceeding in 2024 than in the two previous decades. This has coincided with a wider stock market frenzy around companies that design chips powering artificial intelligence, such as U.S. listed Nvidia and ARM. But rebuilding manufacturing in the U.S. requires participation from companies such as Taiwan Semiconductor Manufacturing Co. The world's biggest chipmaker, and its South Korean rival, Samsung. You could say that the chip boom hasn't even started yet, one analyst said, adding the real benefits in growth that could come from it are not likely to pay off until a few years from now. End quote. Nothing more for you today. Talk to you tomorrow.
Title: Thu. 12/12 – The “Killer App” For Vision Pro?
Host: Brian McCullough
Release Date: December 12, 2024
Brian McCullough opens the episode by discussing significant developments in the autonomous vehicle sector. Honda has announced the dissolution of its self-driving vehicle partnership with General Motors (GM) following GM's decision to exit the robo-taxi business entirely. Honda had invested $852 million in Cruise, while Microsoft anticipates an $800 million charge in Q2 2025 due to its minority stake in Cruise since January 2021.
Key Insights:
Notable Quote:
“Robo taxis will never provide the scale of profits to make it worthwhile for automobile producers to actually be in this business.”
— Andrew J. Hawkins, The Verge [03:45]
Conclusion: With Cruise ceasing operations by the end of 2024, Waymo emerges as a primary player in the robo-taxi arena, while others like Amazon’s Zoox, Hyundai’s Motional, and Tesla are trailing behind with their respective projects.
The discussion shifts to Apple's Vision Pro headset, which has been under scrutiny for its high price and limited applications. However, recent updates seem to bring a highly anticipated feature that could serve as the device’s “killer app.”
Key Features Introduced:
User Experience Highlight: Wes Davis from The Verge shared his firsthand experience, noting that the updated virtual display modes make Vision Pro more practical for everyday use, akin to a real monitor rather than just a personal movie theater.
Notable Quote:
“The expanded virtual display is a critical upgrade, and if it’s not in killer app territory, it is at least right next door to it.”
— Wes Davis, The Verge [15:30]
Challenges Remain: Despite these improvements, Vision Pro still faces issues like comfort for prolonged use and its high price point. However, the enhanced display functionality brings the device closer to widespread adoption.
Simon Willison provides an early impression of Gemini 2.0 Flash, highlighting its impressive spatial reasoning and the introduction of a new Streaming API.
Key Features:
Comparative Advantage: Gemini 2.0 Flash stands out against competitors like OpenAI, offering more refined and potentially cost-effective streaming options.
Notable Quote:
“Being able to have an audio conversation with a capable LLM about things that it can see through your camera is one of those ‘we live in the future’ moments.”
— Simon Willison [27:50]
Future Implications: This advancement could pave the way for new categories of smart devices, such as smart glasses, by leveraging enhanced AI capabilities.
The episode highlights the Federal Communications Commission’s (FCC) recent decision to open 1,200 MHz of the 6 GHz band for unlicensed use by very low power devices. This move is aimed at supporting the proliferation of wearable technologies, augmented reality (AR), and virtual reality (VR).
Impact:
Notable Quote:
“They will bolster cutting-edge applications like wearable technologies and augmented and virtual reality, which will enhance learning opportunities, improve healthcare outcomes and bring new entertainment experiences.”
— FCC Representative [35:10]
Conclusion: The spectrum expansion is expected to accelerate innovation in the IoT space, underpinning the growth of next-generation devices and applications.
Xiaohongshu, also known as Little Red Book, is spotlighted as a burgeoning Chinese app poised for significant growth. Comparable to Instagram, Xiaohongshu has amassed 300 million monthly active users and is reportedly on track to double its profits to over $1 billion in 2024, ahead of a potential Initial Public Offering (IPO).
Key Drivers:
Notable Quote:
“Zhao Hongshu, which started out like its US cousin as a repository of personal travel and dining photos before branching out into reviews and live shopping, now counts some 300 million monthly active users.”
— Brian McCullough [42:20]
Market Position: Despite challenges like a broader slowdown in Chinese e-commerce, Xiaohongshu remains one of the few privately held Chinese internet leaders with significant investor backing from entities like Hongshan and Alibaba.
Harvard University has unveiled a comprehensive public domain dataset comprising nearly 1 million books, funded by Microsoft and OpenAI. This dataset is designed to democratize AI training by providing accessible, high-quality resources to researchers and small AI developers.
Key Highlights:
Notable Quote:
“It is an attempt to level the playing field by giving the general public, including small players in the AI industry and individual researchers, access to the sort of highly refined and curated content repositories that normally only established tech giants have the resources to assemble.”
— Greg Leppert, Institutional Data Initiative [50:40]
Future Prospects: The dataset is envisioned to complement licensed materials, enabling the creation of diverse and differentiated AI models, much like Linux serves as a foundational operating system for numerous applications.
Addressing the geopolitical imperative, the Chips Act aims to bolster domestic semiconductor production, reducing reliance on foreign supply chains. A Peterson Institute study cited by McCullough reveals a surge in US investment in computer and electronics manufacturing facilities, with 2024 seeing more construction in this sector than the previous two decades combined.
Key Considerations:
Notable Quote:
“Rebuilding manufacturing in the U.S. requires participation from companies such as Taiwan Semiconductor Manufacturing Co. and its South Korean rival, Samsung. You could say that the chip boom hasn’t even started yet.”
— Brian McCullough [58:15]
Conclusion: The Chips Act represents a strategic move to reclaim a significant position in the global semiconductor landscape, fostering long-term growth and technological independence.
This episode of Techmeme Ride Home provides a comprehensive overview of pivotal developments across the tech landscape, from the challenges in autonomous vehicle ventures and breakthroughs in augmented reality to significant strides in AI accessibility and semiconductor manufacturing. Notably, Apple’s Vision Pro updates and Google’s Gemini 2.0 Flash showcase the rapid evolution of consumer and enterprise technologies, while strategic moves like the FCC’s spectrum expansion and Harvard’s data initiatives underscore the ongoing efforts to support innovation and competitiveness in the tech industry.
Note: Advertisements and sponsored segments were excluded from this summary to focus solely on the episode's core content.