Loading…
June 23 - 25, 2025
Denver, Colorado
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for Open Source Summit North America 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in Mountain Daylight Time (UTC/GMT -6). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date."

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.

Venue: Bluebird Ballroom 3F clear filter
arrow_back View All Dates
Wednesday, June 25
 

11:00am MDT

Triton Inference Server: Supporting Next-Generation AI Workloads - Olga Andreeva & Ryan McCormick, NVIDIA
Wednesday June 25, 2025 11:00am - 11:40am MDT
Triton Inference Server has long been a reliable tool for AI model deployment. As Generative AI unfolds its transformative potential, Triton continues to evolve, offering both time-tested features and new capabilities tailored for large language models and more complex agentic workflows.

This session explores how Triton’s core strengths continue to play a crucial role in optimizing generative AI deployments. These include its robust multi-framework support, dynamic batching, concurrent model execution, and the capability to deploy complex inference pipelines through model ensembling and business logic scripting.

We’ll also cover recent enhancements such as OpenAI compatible frontend, allowing easy integration with existing OpenAI-based applications; Python-based backends to standardize the deployment of Python models without writing a custom C++ backend; Triton CLI to simplify model deployment and management; distributed inference enhancements for Data Center scale.

Throughout the presentation, we’ll share practical examples and best practices, equipping our listeners with the knowledge to effectively use Triton Inference Server to optimize AI workloads’ performance and efficiency.
Speakers
avatar for Olga Andreeva

Olga Andreeva

Senior Software Engineer, NVIDIA
Olga Andreeva is a senior software engineer, specializing in machine learning inferencing. With a PhD in Computer Science from the University of Massachusetts Boston and experience in both academia and industry, Olga specializes in translating cutting-edge ML research into robust... Read More →
avatar for Ryan McCormick

Ryan McCormick

Senior Software Engineer, NVIDIA
Ryan McCormick is a senior software engineer working at the intersection of machine learning, systems software and distributed systems at NVIDIA. He is responsible for developing scalable and performant inference solutions, with a current focus on the Triton Inference Server and Triton... Read More →
Wednesday June 25, 2025 11:00am - 11:40am MDT
Bluebird Ballroom 3F
  Open AI + Data
  • Audience Experience Level Any

11:55am MDT

Harnessing Event-Driven and Multi-Agent Architectures for Complex Workflows in Generative AI System - Mary Grygleski, Callibrity
Wednesday June 25, 2025 11:55am - 12:35pm MDT
Generative AI applications, in general, excel in zero-shot and one-shot types of specific tasks. However, we live in a complicated world and we are beginning to see that today’s generative AI systems are simply not well equipped to handle the increased complexity that is found especially in business workflows and transactions. Traditional architectures often fall short in handling the dynamic nature and real-time requirements of these systems. We will also need a way to coordinate multiple components to generate coherent and contextually relevant outputs. Event-driven architectures and multi-agent systems offer a promising solution by enabling real-time processing, decentralized decision-making, and enhanced adaptability.

This presentation proposes an in-depth exploration of how event-driven architectures and multi-agent systems can be leveraged to design and implement complex workflows in generative AI. By combining the real-time responsiveness of event-driven systems with the collaborative intelligence of multi-agent architectures, we can create highly adaptive, efficient, and scalable AI systems. This presentation will delve into the theoretical and practical sides.
Speakers
avatar for Mary Grygleski

Mary Grygleski

Director, Emerging Technologies, Callibrity
Mary is a Technical Advocate, Java Champion, and the Director of Emerging Technologies at Callibrity. She started as an engineer in Unix/C, then transitioned to Java around 2000 and has never looked back since then. After 20+ years of being a software engineer and technical architect... Read More →
Wednesday June 25, 2025 11:55am - 12:35pm MDT
Bluebird Ballroom 3F
  Open AI + Data

2:10pm MDT

Tutorial: Understanding the Carbon Impact of Your Machine Learning Applications - Neeraj Pandey, Vivid Climate & Priyanshi Arora
Wednesday June 25, 2025 2:10pm - 3:45pm MDT
This session will guide attendees through the process of understanding and mitigating the carbon emissions of machine learning models and AI systems. We'll delve into methods for measuring the environmental impact of these technologies and discuss the pivotal role developers play in pioneering eco-conscious computing. Participants will gain insights into optimizing algorithms, adopting sustainable coding practices, and choosing energy-efficient tools to minimize the carbon footprint of their machine learning projects.

Additionally, we'll examine the environmental considerations of deploying AI systems in the cloud. As cloud computing becomes integral to deploying AI solutions, understanding its ecological impacts is crucial. We'll cover strategies for making environmentally responsible decisions when selecting and utilizing cloud services, aiming to maintain the eco-friendliness of AI applications.

Together, we'll explore how to balance the demands of advanced computational technologies with the urgent need for sustainability.
Speakers
avatar for Neeraj Pandey

Neeraj Pandey

Co-Founder, Vivid Climate
Neeraj is the co-founder of Vivid Climate, a climate management and accounting platform. Neeraj is a polyglot. Over the years, he has worked on a variety of full-stack software and data-science applications, as well as computational arts, and likes the challenge of creating new tools... Read More →
avatar for Priyanshi Arora

Priyanshi Arora

Brand Data Analyst
Priyanshi is a brand data analyst and creative artist.
Wednesday June 25, 2025 2:10pm - 3:45pm MDT
Bluebird Ballroom 3F
  Open AI + Data

4:20pm MDT

Accelerating GenAI Innovation: Lessons From Intuit's Agents and Tools Framework - Shradha Ambekar & Conrad De Peuter, Intuit
Wednesday June 25, 2025 4:20pm - 5:00pm MDT
Join us to discover how Intuit's GenAI framework is reshaping AI development, enabling swift integration of AI functions across varied business units. We'll focus on a robust framework of reusable agents and tools derived from open-source technologies like LangChain/LangGraph, facilitating diverse functionalities from simple data retrieval to complex processes such as query generation, optimization, pipeline creation and debugging. This framework dramatically reduces the time required for data workers to operationalize data pipelines and supports diverse customer interactions through notebooks, no-code approaches, REST integrations, and Python libraries, catering to a wide range of needs including agent developers and teams in pre-production settings. Our meticulous evaluation process ensures that each tool and agent is rigorously tested against high-performance benchmarks to guarantee reliability and consistency before deployment. By centralizing these AI components, Intuit has not only accelerated development timelines but also upheld a high standard of quality, establishing a benchmark for crafting scalable, effective AI solutions in the dynamically evolving tech landscape.
Speakers
CD

Conrad De Peuter

Senior Staff AI Scientist, Intuit
Conrad De Peuter is a Senior Staff AI Scientist and Manager at Intuit. He has worked on deep learning models in the document understanding space, delivering reusable AI services from a central platform, and most recently as the lead for a portfolio of product-focused R&D projects... Read More →
avatar for Shradha Ambekar

Shradha Ambekar

Senior Staff Software Engineer, Intuit
Shradha Ambekar is a senior staff software engineer with the Data Platform Group at Intuit. She is an experienced technologist and has led projects working with GENAI, Spark, Kafka, Presto, Athena, Cassandra and Vertica. She has made numerous open-source contributions to presto, calcite... Read More →
Wednesday June 25, 2025 4:20pm - 5:00pm MDT
Bluebird Ballroom 3F
  Open AI + Data
 
  • Filter By Date
  • Filter By Venue
  • Filter By Type
  • Audience Experience Level
  • Timezone

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
Filtered by Date -