Agentic AI Weekly | Berkeley RDI | May 13, 2026
Agentic AI Summit Early Bird Tickets and Expanded Speaker List, AgentX–AgentBeats Phase 2, Sprint 2 + OpenEnv Custom Track Winners Announced
Agentic AI Summit 2026 (More Featured Speakers Announced, Early-Bird Pricing Ending Soon, and Limited Number of Tickets Left!)
Save the date! The Agentic AI Summit returns to Berkeley on August 1–2, 2026, welcoming 5,000+ expected in-person attendees for two days of insights and innovation. Building on last year’s sold-out success—with 2,000+ in‑person attendees and 40,000+ global livestream participants—the summit will bring together researchers, builders, industry leaders, and the global agentic AI community for keynotes, technical talks and panels, hands-on workshops, live demos, and more!
In addition, we are excited to showcase our expanded list of speakers for the Summit! We are honored to have such a great group of academics, founders, executives, and investors participate in this year’s event, and more will be announced soon!
🎟️ Early‑Bird Pricing (Limited Capacity)
A limited number of early‑bird tickets are still available:
Student Early-Bird: $199 (very few student tier tickets left; ensure you buy your tickets as soon as possible before they sell out!)
Standard Early-Bird: $399
If you’re looking to secure the best ticket price and be part of the conversation shaping the future of Agentic AI, we encourage you to register early. We look forward to welcoming you to Berkeley this August.
Sponsorship Opportunities
Partner with us to shape the future of Agentic AI. If you’re interested in sponsoring the summit, please complete the sponsorship application form. Sponsorship opportunities are limited and reviewed/allocated on a rolling basis, so we encourage you to apply early.
AgentX–AgentBeats Highlights: Phase 2, Sprint 4 is Underway!
Phase 2, Sprint 4 of the AgentX–AgentBeats competition is now underway! For this final sprint, all submissions are due by Sunday, May 24th, and we can’t wait to see all of your great projects! We’ve opened the submission form for Sprint 4, which you can access by clicking the button below!
For Phase 2, participants are building purple agents to tackle the select top green agents from Phase 1 and compete on the public leaderboards. Unlike Phase 1, where participants competed across all tracks throughout the entire duration, Phase 2 introduces a sprint-based format. The competition is organized into four rotating sprints.
Sprint 4 Details:
Deadline: May 24, 2026
The 4th Sprint is the grand finale of AgentX-AgentBeats, focused on general-purpose agents. While earlier sprints emphasized depth within specific tracks, Sprint 4 emphasizes breadth: strong, consistent performance across many green agents, benchmarks, and evaluation categories. Sprint 4 includes all green agents from the first three Phase 2 sprints, plus additional selected benchmarks introduced for this final sprint:
Game Agent: Build What I Mean; Minecraft Benchmark
Finance Agent: OfficeQA
Business Process Agent: DeoGaze / Entropic CRMArena
Research Agent: FieldWorkArena; MLE-Bench; Mind2Web 2; BrowseComp+
Multi-agent Evaluation: MAizeBargAIn
τ²-Bench: τ²-Bench
Computer Use & Web Agent: CAR-bench; OSWorld-Verified
Agent Safety: Pi-Bench
Coding Agent: SWE-bench Pro; Terminal Bench 2.0; NetArena
Cybersecurity Agent: CyberGym
To be eligible for Sprint 4 judging, a team must evaluate its purple agent on at least 5 green agents spanning at least 3 distinct categories. Teams are strongly encouraged to go beyond this minimum; broader coverage across more green agents and more categories will be viewed favorably.
Judging will reward purple agents that demonstrate strong cross-benchmark performance, category diversity, generality, cost efficiency, and technical quality. A strong Sprint 4 submission should show that the same purple-agent architecture can adapt across substantially different task types without benchmark-specific hardcoding or special-case lookup tables.
Participants are encouraged to compete in multiple tracks across multiple sprints during Phase 2. Teams and team members who submit purple agents in any sprint will also be eligible to enter a raffle for free tickets to the Agentic AI Summit later this year.
For more details on each sprint and how to compete in Phase 2, please refer to the AgentX–AgentBeats website!
Phase 2, Sprint 2 Winners Announced!
We’re excited to recognize the winning teams of Phase 2, Sprint 2. We saw so many strong submissions this sprint across all five tracks, so thank you to all who submitted their agents!
Winning Teams:
Research Agent Track
🥇 1st Place (Tie): MIDS4LIFE (FieldWorkArena)
🥇 1st Place (Tie): MLE-Squad (MLE-Bench)
τ²-Bench Track
🥇 1st Place: AgentSWE (τ²-Bench)
Computer-Use and Web Agent Track
🥇 1st Place (Tie): CAReful (CAR-bench)
🥇 1st Place (Tie): Entouch (OSWorld-Verified)
OpenEnv Custom Track Winners
After a highly competitive competition, we are happy to reveal the winners of the OpenEnv Custom Track! Participants built production-ready environments that span coding tasks, interactive simulations, robotics-inspired control problems, and entirely new categories of agentic challenges over the course of the challenge. Our judges were very impressed with the work you all submitted; congratulations to all of the winning teams!
🥇 1st Place: Team GPU Scheduler
🥈 2nd Place: Team MateFin
🥉 3rd Place: Team SylloGym
We want to thank the PyTorch team at Meta, Hugging Face, and Unsloth for helping us put on such an amazing competition, and thanks to all who submitted their projects!
Trends This Week
OpenAI introduced three new real-time voice models this past week: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper, expanding its push into live conversational AI and voice-based agents. GPT-Realtime-2 brings GPT-5-class reasoning to real-time speech interactions, including parallel tool use, longer context windows, and more controllable conversational behavior. OpenAI reported that GPT-Realtime-2 achieved 96.6% accuracy on Big Bench Audio evaluations, up from 81.4% for GPT-Realtime-1.5, and stated that companies including Zillow, Priceline, and Deutsche Telekom are already building customer support, travel, and real estate voice systems on top of the models. All three models are available in the OpenAI API with multiple layers of existing safeguards and mitigations to ensure model safety.
Anthropic published the first formal research agenda for its new internal research group, The Anthropic Institute, outlining how the company plans to study the societal, economic, and security implications of advanced AI systems from inside a frontier lab environment. The agenda focuses on four core areas — economic diffusion, threats and resilience, AI systems in the wild, and AI-driven R&D — and includes research into AI-enabled surveillance, labor market disruption, and “fire drill” scenarios for potential emergencies. Anthropic wrote that “AI-driven AI R&D may be a ‘natural dividend’ of making smarter and more capable systems,” while also warning that “AI-driven AI R&D holds within itself the potential for significant danger.” In addition, the agenda notes that the Institute will publish higher-frequency Economic Index data, worker surveys, threat research, and analyses of how AI is accelerating Anthropic’s own internal R&D processes.
Google DeepMind released a new report this past week detailing how AlphaEvolve — a Gemini-powered coding agent it introduced last year — is now being deployed across Google’s infrastructure, scientific research, and commercial applications. DeepMind said the system combines LLM-generated code, automated evaluation systems, evolutionary search, and large-scale experimentation loops to iteratively improve algorithms, and reported that AlphaEvolve recovered roughly 0.7% of Google’s worldwide compute resources through data-center scheduling optimization. The company also highlighted deployments with Klarna for transformer model training optimization, PacBio for DNA sequencing accuracy improvements, and Schrödinger for accelerating molecular simulations, while continuing to frame AlphaEvolve as a general-purpose “algorithm discovery engine.”
Don’t miss the developments shaping Agentic AI. Subscribe for weekly coverage of groundbreaking research, emerging trends, and critical insights across Agentic AI and the broader AI landscape.










