Practical AI Podcast Episode Summary

Episode Title: Open source AI to tackle your backlog
Date: April 17, 2025
Host: Daniel Whitenack (PredictionGuard) & Chris Benson (Lockheed Martin)
Guests: Robert Brennan (Co-founder & CEO, All Hands AI), Graham Neubig (Co-founder & Chief Scientist, All Hands AI; Associate Professor at Carnegie Mellon)

Overview

This episode delves into the evolving ecosystem of AI-powered developer tools, focusing on the emergence of "agentic" coding assistants—particularly the open source All Hands AI project. Daniel and Chris talk with Robert and Graham about the real-world impact of these tools on developer workflows, challenges faced building robust AI agents, the philosophy underpinning open source in this space, and the future of software development with AI.

Key Discussion Points & Insights

1. The Spectrum of Code Generation Tools (02:49)

Tooling Landscape:
- At one end, visual, rapid prototyping tools (e.g., bolt.dev, v0.dev) lower the bar for non-coders or people with limited coding experience.
- At the other, tools for senior developers focus on production-grade code and robust workflows.
Tactical vs. Agentic Tools:
- "Tactical" tools like Copilot provide inline code suggestions within an IDE, helping with immediate tasks.
- "Agentic" tools can autonomously tackle higher-level problems over longer time spans—taking a description of a task, iterating without direct supervision, and delivering results later.
All Hands Positioning:
- "We are really oriented towards senior engineers ... and those more agentic workflows where you are giving an agent something to work on and it can iterate forward on its own without you having to babysit it." (Robert, 04:18)

2. Building Effective Agentic Frameworks (05:32)

Original Expectation: The All Hands team tried building a multi-agent system where different agents would own specific aspects (debugging, software architecture, browsing, etc.).
Surprise in Practice:
- A single, well-resourced agent, able to access broad context and use different "microagents" (tools or prompts for specific subtasks), performed better and was easier to maintain.
- "It has to have the ability to pull in ... whatever information it needs." (Graham, 06:24)

3. Transitioning from Traditional to Agentic Workflows (07:38)

Copilot as Step 1: Drastically improved autocomplete for code.
The Pain Point: Many devs found themselves copying errors back and forth from ChatGPT—a tedious manual mediation.
Agents Automate the Loop:
- Agents now handle entire feedback cycles: proposing code, executing it, reacting to test output/errors, iterating, and producing ready-to-review pull requests.
- "Today's agents are able to just continue executing ... until it comes to a full PR where all the tests are passing and it's just kind of packaged up and ready to go." (Robert, 10:10)

4. Who Benefits from Agentic Coding Tools? (11:30)

Risks for Junior Developers:
- Agents excel at greenfield MVPs, but over time can create messy, unmaintainable code unless an experienced developer reviews and guides them.
- "If you're not looking over its shoulder and critiquing its work, the code base will just grow into this monster and you'll have to throw it all away." (Robert, 12:33)

5. The Importance of Open Source (13:43)

Democratizing Development:
- "We believe that if this is going to have a big effect on software development, software developers should be able to participate in that." (Graham, 14:22)
Research & Community:
- Open agentic frameworks won't fall behind closed-source because of deep engagement from academia.
- Openness enables collaborative improvement, experimentation, and customizability.

6. Community and Contribution Models (15:37)

Onboarding Contributors:
- Complex projects can make onboarding intimidating, but issues for new contributors are sometimes solved by the agent itself!
- Successful contribution areas: front-end work and collaborative research projects.

7. How All Hands Works: Open Source vs. Hosted (17:39)

Open Source:
- Runs in Docker; can be self-hosted; supports GitHub integration; sandboxed agent operation via per-session Docker containers.
Hosted Cloud Version:
- "App.AllHands.dev" offers seamless infrastructure, rapid sandbox startup, OAuth GitHub integration, and hands-off operation from pull request comments or (soon) Slack.
- "Those are actually the interactions I love the most where ... I just, you know, summon the agent and it just does the work for me and I get to, you know, reap the fruits at the end there." (Robert, 19:16)
Mobile Accessibility:
- "My favorite is programming from my phone ... By the time I get to work I have a pull request to review." (Graham, 19:46)

8. Mindset Shifts & Productivity Hacks (21:36)

From Hands-On to Delegation:
- Learning to trust the agent and let go of micromanagement—comparable to transitioning from coding to managing a team.
- "You have to kind of have that switch flip ... oh no, this is actually a good thing for the agent to work on." (Robert, 21:44)
- "Trust but verify" is key—review agent output and provide feedback.

9. Agents in Large, Real-World Codebases (23:56)

Technical Challenges:
- Editing files reliably (generating diffs) is crucial and was a bottleneck until recently.
- Locating the correct files to edit: models use traditional tools (find, grep) and benefit from code search.
- Avoiding repetitive failure loops: advanced models like Claude handle this better.
In Practice:
- "The Open Hands agent is the largest committer to our code base... it had 209 commits over the past three months ..." (Graham, 23:59)

10. Why Build Custom Open Source Models? (26:54)

Strategic Reasons:
- Openness and customizability.
- "If you're relying on a closed API based model entirely, then you can never fully achieve that goal." (Graham, 27:49)
- Cost control for large, iterative agentic tasks.
- Ability to fine-tune for customer needs (e.g., languages, workflows).
Model Launches:
- All Hands recently released their first open model (v0.1) and plan continued development.
Product Integration:
- Exploring routing certain tasks to cheaper or custom models without sacrificing accuracy.
- "We built this model specifically based on the data set that we've gathered. And that's a really cool product feature ..." (Robert, 30:13)

11. Openness and Enterprise Strategy (31:41)

Model and Framework Licensing:
- Core research and agent logic will remain open source; advanced team features (multi-tenancy, compliance, auditing) will be proprietary.
Enterprise-Grade Security:
- Three deployment options:
  - Fully open source/self-hosted (integrate with approved local models or Bedrock)
  - Managed cloud (Anthropic, via All Hands infrastructure)
  - Private cloud deployment for security-conscious clients ("all configured to stay within your walls." —Robert, 33:26)

12. Benchmarking and Performance (34:54)

Evaluation Approach:
- Swebench and 20+ other benchmarks via a large evaluation harness, maintained with academic partners.
- Focused on real-world use cases (web navigation, fixing broken commits, test generation, version updates).
- Collaborating with academic teams to broaden the scope and relevance of benchmarks.

Notable Quotes & Memorable Moments

On the Power of Agentic Tools:
- "I feel like a wizard again. I can get so much done using large language models and using agents." (Robert, 38:19)
On Democratizing Software Creation:
- "[If] this is going to have a big effect on software development, software developers should be able to participate..." (Graham, 14:22)
On AI’s Transformative Potential:
- "I'm building agents to create software ... not because software is the end, it's because software is a means to an end ... If we can make it easy ... we'll be able to make great strides forward." (Graham, 39:28)

Timestamps for Important Segments

02:49 — Overview of coding tool landscape; All Hands’ position
05:32 — Surprises building agentic frameworks
07:38 — What makes agentic workflows different for users
11:30 — Who is (and isn't) successful with agentic coding tools
13:43 — Why the open-source approach matters
15:37 — Developer onboarding and community contributions
17:39 — How to use All Hands: local open source vs. hosted
19:46 — Programming on the go: agents and mobile/devices
21:36 — Mindset shifts: from hand-coding to AI delegation
23:56 — Technical challenges for agentic tools in large code bases
26:54 — Motivation and approach for building open source models
31:41 — Licensing strategy: open vs. proprietary features
33:10 — Security and private deployments
34:54 — Benchmarking AI agents against real-world tasks
37:59 — Future outlook: expanding access & impact of coding agents
39:20 — Software as a means to larger ends

Conclusion

This episode provides a comprehensive, grounded look at the promise and practical realities of agentic AI tools in development workflows. The guests offer both technical depth and philosophical context, with actionable insights on open source community-building, product strategy, deployment/usage scenarios, and the mindset shifts required to truly leverage AI as a developer or a team.
Highly recommended for technical leaders, hands-on developers, and anyone interested in the future of coding with AI.

Links Mentioned

All Hands AI (try it out; open source and hosted versions)
All Hands Models on Hugging Face
Swebench Benchmark
Practical AI Podcast

Practical AI Podcast Episode Summary

Overview

Key Discussion Points & Insights

1. The Spectrum of Code Generation Tools (02:49)

Tooling Landscape:
- At one end, visual, rapid prototyping tools (e.g., bolt.dev, v0.dev) lower the bar for non-coders or people with limited coding experience.
- At the other, tools for senior developers focus on production-grade code and robust workflows.
Tactical vs. Agentic Tools:
- "Tactical" tools like Copilot provide inline code suggestions within an IDE, helping with immediate tasks.
- "Agentic" tools can autonomously tackle higher-level problems over longer time spans—taking a description of a task, iterating without direct supervision, and delivering results later.
All Hands Positioning:
- "We are really oriented towards senior engineers ... and those more agentic workflows where you are giving an agent something to work on and it can iterate forward on its own without you having to babysit it." (Robert, 04:18)

2. Building Effective Agentic Frameworks (05:32)

Original Expectation: The All Hands team tried building a multi-agent system where different agents would own specific aspects (debugging, software architecture, browsing, etc.).
Surprise in Practice:
- A single, well-resourced agent, able to access broad context and use different "microagents" (tools or prompts for specific subtasks), performed better and was easier to maintain.
- "It has to have the ability to pull in ... whatever information it needs." (Graham, 06:24)

3. Transitioning from Traditional to Agentic Workflows (07:38)

Copilot as Step 1: Drastically improved autocomplete for code.
The Pain Point: Many devs found themselves copying errors back and forth from ChatGPT—a tedious manual mediation.
Agents Automate the Loop:
- Agents now handle entire feedback cycles: proposing code, executing it, reacting to test output/errors, iterating, and producing ready-to-review pull requests.
- "Today's agents are able to just continue executing ... until it comes to a full PR where all the tests are passing and it's just kind of packaged up and ready to go." (Robert, 10:10)

4. Who Benefits from Agentic Coding Tools? (11:30)

Risks for Junior Developers:
- Agents excel at greenfield MVPs, but over time can create messy, unmaintainable code unless an experienced developer reviews and guides them.
- "If you're not looking over its shoulder and critiquing its work, the code base will just grow into this monster and you'll have to throw it all away." (Robert, 12:33)

5. The Importance of Open Source (13:43)

Democratizing Development:
- "We believe that if this is going to have a big effect on software development, software developers should be able to participate in that." (Graham, 14:22)
Research & Community:
- Open agentic frameworks won't fall behind closed-source because of deep engagement from academia.
- Openness enables collaborative improvement, experimentation, and customizability.

6. Community and Contribution Models (15:37)

Onboarding Contributors:
- Complex projects can make onboarding intimidating, but issues for new contributors are sometimes solved by the agent itself!
- Successful contribution areas: front-end work and collaborative research projects.

7. How All Hands Works: Open Source vs. Hosted (17:39)

Open Source:
- Runs in Docker; can be self-hosted; supports GitHub integration; sandboxed agent operation via per-session Docker containers.
Hosted Cloud Version:
- "App.AllHands.dev" offers seamless infrastructure, rapid sandbox startup, OAuth GitHub integration, and hands-off operation from pull request comments or (soon) Slack.
- "Those are actually the interactions I love the most where ... I just, you know, summon the agent and it just does the work for me and I get to, you know, reap the fruits at the end there." (Robert, 19:16)
Mobile Accessibility:
- "My favorite is programming from my phone ... By the time I get to work I have a pull request to review." (Graham, 19:46)

8. Mindset Shifts & Productivity Hacks (21:36)

From Hands-On to Delegation:
- Learning to trust the agent and let go of micromanagement—comparable to transitioning from coding to managing a team.
- "You have to kind of have that switch flip ... oh no, this is actually a good thing for the agent to work on." (Robert, 21:44)
- "Trust but verify" is key—review agent output and provide feedback.

9. Agents in Large, Real-World Codebases (23:56)

Technical Challenges:
- Editing files reliably (generating diffs) is crucial and was a bottleneck until recently.
- Locating the correct files to edit: models use traditional tools (find, grep) and benefit from code search.
- Avoiding repetitive failure loops: advanced models like Claude handle this better.
In Practice:
- "The Open Hands agent is the largest committer to our code base... it had 209 commits over the past three months ..." (Graham, 23:59)

10. Why Build Custom Open Source Models? (26:54)

Strategic Reasons:
- Openness and customizability.
- "If you're relying on a closed API based model entirely, then you can never fully achieve that goal." (Graham, 27:49)
- Cost control for large, iterative agentic tasks.
- Ability to fine-tune for customer needs (e.g., languages, workflows).
Model Launches:
- All Hands recently released their first open model (v0.1) and plan continued development.
Product Integration:
- Exploring routing certain tasks to cheaper or custom models without sacrificing accuracy.
- "We built this model specifically based on the data set that we've gathered. And that's a really cool product feature ..." (Robert, 30:13)

11. Openness and Enterprise Strategy (31:41)

Model and Framework Licensing:
- Core research and agent logic will remain open source; advanced team features (multi-tenancy, compliance, auditing) will be proprietary.
Enterprise-Grade Security:
- Three deployment options:
  - Fully open source/self-hosted (integrate with approved local models or Bedrock)
  - Managed cloud (Anthropic, via All Hands infrastructure)
  - Private cloud deployment for security-conscious clients ("all configured to stay within your walls." —Robert, 33:26)

12. Benchmarking and Performance (34:54)

Evaluation Approach:
- Swebench and 20+ other benchmarks via a large evaluation harness, maintained with academic partners.
- Focused on real-world use cases (web navigation, fixing broken commits, test generation, version updates).
- Collaborating with academic teams to broaden the scope and relevance of benchmarks.

Notable Quotes & Memorable Moments

On the Power of Agentic Tools:
- "I feel like a wizard again. I can get so much done using large language models and using agents." (Robert, 38:19)
On Democratizing Software Creation:
- "[If] this is going to have a big effect on software development, software developers should be able to participate..." (Graham, 14:22)
On AI’s Transformative Potential:
- "I'm building agents to create software ... not because software is the end, it's because software is a means to an end ... If we can make it easy ... we'll be able to make great strides forward." (Graham, 39:28)

Timestamps for Important Segments

02:49 — Overview of coding tool landscape; All Hands’ position
05:32 — Surprises building agentic frameworks
07:38 — What makes agentic workflows different for users
11:30 — Who is (and isn't) successful with agentic coding tools
13:43 — Why the open-source approach matters
15:37 — Developer onboarding and community contributions
17:39 — How to use All Hands: local open source vs. hosted
19:46 — Programming on the go: agents and mobile/devices
21:36 — Mindset shifts: from hand-coding to AI delegation
23:56 — Technical challenges for agentic tools in large code bases
26:54 — Motivation and approach for building open source models
31:41 — Licensing strategy: open vs. proprietary features
33:10 — Security and private deployments
34:54 — Benchmarking AI agents against real-world tasks
37:59 — Future outlook: expanding access & impact of coding agents
39:20 — Software as a means to larger ends

Conclusion

Links Mentioned

All Hands AI (try it out; open source and hosted versions)
All Hands Models on Hugging Face
Swebench Benchmark
Practical AI Podcast

Open source AI to tackle your backlog

Powered by Wave AI

Summary

Practical AI Podcast Episode Summary

Overview

Key Discussion Points & Insights

1. The Spectrum of Code Generation Tools (02:49)

2. Building Effective Agentic Frameworks (05:32)

3. Transitioning from Traditional to Agentic Workflows (07:38)

4. Who Benefits from Agentic Coding Tools? (11:30)

5. The Importance of Open Source (13:43)

6. Community and Contribution Models (15:37)

7. How All Hands Works: Open Source vs. Hosted (17:39)

8. Mindset Shifts & Productivity Hacks (21:36)

9. Agents in Large, Real-World Codebases (23:56)

10. Why Build Custom Open Source Models? (26:54)

11. Openness and Enterprise Strategy (31:41)

12. Benchmarking and Performance (34:54)

Notable Quotes & Memorable Moments

Timestamps for Important Segments

Conclusion

Links Mentioned

Summary

Practical AI Podcast Episode Summary

Overview

Key Discussion Points & Insights

1. The Spectrum of Code Generation Tools (02:49)

2. Building Effective Agentic Frameworks (05:32)

3. Transitioning from Traditional to Agentic Workflows (07:38)

4. Who Benefits from Agentic Coding Tools? (11:30)

5. The Importance of Open Source (13:43)

6. Community and Contribution Models (15:37)

7. How All Hands Works: Open Source vs. Hosted (17:39)

8. Mindset Shifts & Productivity Hacks (21:36)

9. Agents in Large, Real-World Codebases (23:56)

10. Why Build Custom Open Source Models? (26:54)

11. Openness and Enterprise Strategy (31:41)

12. Benchmarking and Performance (34:54)

Notable Quotes & Memorable Moments

Timestamps for Important Segments

Conclusion

Links Mentioned