The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for Open Source Summit North America 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.
This schedule is automatically displayed in Mountain Daylight Time (UTC/GMT -6). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date."
IMPORTANT NOTE: Timing of sessions and room locations are subject to change.
Sign up or log in to add sessions to your schedule and sync them to your phone or calendar.
Imagine you’re responding to a production incident, and you’re trying to answer simple questions about it. How many systems do you need to consult when assessing the impact of events at your company? Do you manage different technology stacks for observability, security, and business intelligence?
What if we told you, you could create a unified stack capable of serving all stakeholders simultaneously? In this talk, Mya and Josh explore how open source technologies like ClickHouse, OpenTelemetry, and Grafana enable complex business use cases using modern tooling and practices.
Regardless of your function, you will leave with a deeper understanding of how consolidating these concerns into a unified stack reduces technical complexity and provides a common language for everyone to use - from engineers building new features and product managers evaluating their success, to operators keeping the lights on and C suite’s birds-eye view of the company.
Whether you’re working with a data lake, or more of a data pond, we offer practical architectures and solutions to streamline your operations and bring your stakeholders together, all while using fewer resources.
A brilliant, talented, self-taught, ambivert who loves attending and speaking at conferences. I love tinkering with small board computers like raspberry pis.☕ If you see me around, don't hesitate to come say hi!🏒 Hockey player since I was 7💻 Programming since I was 14... Read More →
Whether it’s operators or observability, agile or accessibility, my expertise shines because I’m passionate about all of it. I’ve been building software for more than a decade and I love sharing experiences via public speaking. I’m currently a Developer Advocate for Altinity... Read More →
If you've been around the container and cloud native ecosystem for any length of time, you know most major components are written in Go: from Docker to runc and from Kubernetes to etcd! This means that many of the common constructs, for example the OCI specs, or Kubernetes API resources, are easy to use from other Go programs, but not quite as easy when you step outside of the Go ecosystem.
In this talk we'll dive into the experience of trying to use containers from a Rust-written client and delve into existing work from early adopters of Rust. There are quite a few crates that help us along the way, providing some level of parity for Rust developers in the cloud native ecosystem. There are still complexities and hurdles as well, and we'll share our experience navigating this as a long-time Go programmer and Rust newbie.
Attendees will take away some quick tips as well as gotchas for working in the container and cloud native ecosystem as a Rust developer and, who knows, maybe soon the Gopher and the Crab will be the best of friends.
Principal Engineer, Core Container Technology, AWS
Phil is a Principal Engineer for Amazon Web Services (AWS), focused on core container technologies that power AWS container offerings like Fargate, EKS, and ECS. Phil is an active contributor and maintainer for the CNCF containerd runtime project, and participates in the Open Container... Read More →
As data volumes grow and real-time processing becomes essential, traditional cloud architectures face limitations in cost, latency, and security. The traditional approach moves all edge data to where the queries are executed—in the cloud—leading to inefficiencies and high costs. EdgeLake (https://lfedge.org/projects/edgelake/), an LF Edge project, takes the opposite approach by bringing queries to the source data at the edge, enabling decentralized data management and local AI/ML processing.
In this talk, we’ll explore how EdgeLake eliminates cloud dependencies, optimizes data infrastructure, and reduces operational costs while ensuring real-time decision-making at the edge. We’ll discuss key use cases (and show a live demo) across industrial automation, smart cities, energy, and telecom, demonstrating how organizations can leverage EdgeLake to unlock the full potential of edge computing.
Join us to learn how EdgeLake is reshaping the future of distributed data architectures and making edge intelligence more accessible.
Moshe Shadmon, CEO at Anylog. AnyLog’s Virtual Edge Data Network is a Plug & Play software, deployed at the edge, allowing real-time insight without centralizing the data. AnyLog enables deployment of applications and AI at the distributed edge. Prior to AnyLog, Moshe was the CEO... Read More →
Managing a few Kubernetes clusters may be feasible, but scaling up to hundreds or thousands introduces unique challenges. At a 100:1 cluster to engineer ratio, standardization, observability, security, and access control become pressing issues. This is when DevOps must shift from "infrastructure engineers" to "platform engineering," where infrastructure needs are fully automated and self-service.
As K8s adoption grows in large organizations, demand for "massive multi-cluster fleet management" support has intensified. This talk examines essential features for Kubernetes fleet controllers, offering a fast-paced review of five open-source tools: Clusternet, Karmada, Crossplane, ClusterAPI, and Rancher. Each tool's unique strengths in provisioning, management, and application support will be covered, showing how each addresses multi-cluster management challenges.
This approach will provide a replicable framework to evaluate & choose the right tools based on specific organizational needs.
Mickael is a self-taught developer turned DevOps, passionate about automation, innovation, and creative problem-solving. Mickael enjoys challenging himself and experimenting with new technologies and methodologies. Currently, he is working on developing the next-gen K8s troubleshooting... Read More →
This session explores the integration of eBPF and OpenTelemetry (OTel) for achieving unparalleled observability and performance in 5G networks. By leveraging the K8s Operator framework, we demonstrate the Kubernetes-native deployment of advanced observability tools, including the bpfman stack for managing eBPF programs and the OpenTelemetry Operator for scalable telemetry pipelines. Participants will gain actionable insights into optimizing 5G Cloud Native Network Functions (CNFs) through precise observability, robust performance metrics, and real-time diagnostics, while ensuring security and multi-tenancy.
Fatih E. NAR brings extensive experience and influence to Linux, OpenStack, and Kubernetes ecosystems. His contributions drive progressive development and foster a robust TME community. With a background at Google, Verizon Wireless, Canonical Ubuntu, and Ericsson, Fatih's diverse... Read More →
Jamie Parker is a Product Manager at Red Hat who specializes in Observability, particularly in the Logging and OpenStack areas. At Red Hat, Jamie works with organizations and customers to learn about their needs within the ever changing Observability landscape, and based on their... Read More →
Travel retail, shopping and booking stays, activities, or trips, is ripe for disruption using open source. The travel industry was a pioneer in worldwide interconnectivity between suppliers and sellers starting in the 1970s. Fifty years on the process of shopping and purchasing travel products has been computerized exposed online but works largely the same as it always did. Even as some legacy components have been replaced, their limitations are still present embodied in workflows and policy. Stateful, transactional, processing is still the order of the day. It all needs to be overhauled to move into a stateless, cloud based, digital world. Unaffordable in the current bespoke, siloed world of reservation systems, distributors and channels. Open Source is the obvious answer. As a community make one investment to create the needed foundational capabilities such as offer/order management, security, identity management, event management, rules processing and much more. Noncompetitive functions everyone needs to create traveler solutions such as end to end trip solions and management via an AI powered app. This session will explain how it works.
44 years in travel IT. From Mainframes in the 70s to microservices and cloud exploitation. Recently a VP of architecture for a major travel IT provider.
Confidential AI leveraging GPUs can bring AI to the masses without sacrificing the privacy of end users. Individual open source technologies already exist to configure, deploy, and manage confidential TEEs. However, clobbering a multitude of components into a coherent, secure, and efficient solution is challenging with many pitfalls. For example, depending on use cases and involved parties (cloud/model/service owners), attestation and key management methodology can vary drastically. In addition, for TEEs with confidential GPUs, complexity extends to increased load times, affecting services that serve multiple models.
This talk will go through key components and design decisions needed to enable confidential AI. Specifically: i) implications of different trust models on the solution and (ii) performance tradeoff considerations. To concretize the discussion, we will present a detailed end-to-end 'how to', for deploying an inference service on Nvidia H100 GPUs and AMD-based TEE with a focus on protecting the model and the user input. The audience will be able to appreciate why there can be no one size fit all confidential AI solution and understand what design works for them.
Julian Stephen is a research scientist in the security group at IBM T. J. Watson Research Center, NY. He is interested in building systems and models that solve real world problems without compromising security and privacy of data. He received his Ph.D. in Computer Science from... Read More →
Michael is currently a research staff member at the IBM T. J. Watson Research Center. His general research interest is in systems security with a focus on containers, virtualization, operating systems, and confidential computing. He enjoys long hacks in the kernel.
Take one single application compiled to WebAssembly and split it into pieces at deployment time. Run these pieces in different Kubernetes deployments, different clouds, or even split across edge and cloud.
This code-forward talk will show how to write an application using Wasm components and a combination of Rust and TypeScript. We'll show how to use the CNCF project Spin for developing apps, and then use Kubernetes, Helm, SpinKube, and other open source tools to deploy this application in multiple locations.
Conceptually, we'll tie this new development pattern to microservice architecture and distributed systems to show how WebAssembly's Component Model is paving the way for a new class of application.
Matt Butcher (CEO) is a founder of Fermyon. He is one of the original creators of Helm, Brigade, CNAB, OAM, Glide, and Krustlet. He has written or co-written many books, including "Learning Helm" and "Go in Practice." He is a co-creator of the "Illustrated Children’s Guide to Kubernetes... Read More →
As the fast-paced AI-driven landscape of computing continues to diversify, the importance of multi-arch container images cannot be overstated. Applications are no longer confined to data centers but extend across multiple platforms, devices, and appliances.
Wouldn’t it be great if we could build images for every architecture from just one machine? It would be even more amazing if we could do that without the slowness of emulation! This is where Podman farm comes in. Podman farm is a new feature that allows you to 'farm' out builds to groups of machines you have access to, enabling you to easily build multi-architecture images with a single command. In this talk, we will highlight the challenges of multi-architecture builds and demonstrate how Podman farm addresses them, keeping performance and usability in mind.
Container images that run seamlessly across different architectures ensure consistency, reduce complexity, and accelerate the development cycle. This session will empower attendees to develop on one architecture and deploy confidently on another.
Urvashi Mohnani is a Principal Software Engineer on the OpenShift Container Tools team at Red Hat. She has spent the last few years contributing to and maintainer open source container tools projects including podman, buidlah, cri-o, and skopeo. She is a co-organizer of DevConf.US... Read More →
Why do some requests take so much longer than others? A major contributor, memory-related contention between containers, was shown to increase latency by 4-13x. It can be triggered by garbage collection, and existing observability cannot even detect it! Current collectors just show high CPU utilization, and the standard mitigation is to scale out and run at low utilization: expensive, and does not solve the response time problem.
We set out to build a new detector, but found that measuring every few seconds (current practice for collectors) is inadequate. Servers quickly jump between intense resource competition and under-utilization, so averaging over seconds does not show any contention. We needed measurements at millisecond frequency.
This session first examines real-world patterns that trigger interference and surveys methods for detecting memory interference, including findings from Google, Alibaba, and Meta's production environments. We'll then discuss the design of the OSS collector, and how it combines CPU performance counters, eBPF and high-resolution timers to identify noisy neighbors. We close with future directions and opportunities to get involved.
I am a maintainer of the OpenTelemetry eBPF network collector, and working on developing tools to detect and mitigate noisy neighbors. I got my PhD in noisy neighbor mitigation (focusing on networking) from MIT, then founded an eBPF-based network observability company, Flowmill, which... Read More →
As confidential virtual machines become mainstream in confidential computing, the Arm Confidential Computing Architecture (CCA) was introduced as a key innovation of Arm v9 in 2021. Linaro has been deeply involved in integrating CCA into open-source projects over the past years. In this presentation, we'll share the progress of our open-source enablement efforts. This includes the current status of fundamental software support and the next-stage plan for projects such as TF - A, Kernel, and Qemu. We'll also talk about container runtime adoption in Kata containers and Confidential containers. For instance, we'll detail the work on supporting CCA in Kata container runtimes with Qemu backend, like in kata-deploy. The support for guest-components and Trustee in Confidential containers will be covered too. Remote attestation is another crucial aspect that can't be overlooked. To reduce solution fragmentation in open-source projects for production, Arm and Linaro are collaborating on an end-to-end experimental attestation platform using Veraison project components. We'll present a case study from the Confidential Containers project to show the practical adoption of these technologies.
Kevin Zhao is currently the tech lead at Linaro Data Center Group. He has been working on Arm server ecosystem for more than 8 years, including the open source IAAS solutions, distribute storage and confidential computing. Now, he is actively working on Arm Confidential Computing... Read More →
Our cloud native world has become more than just tooling, it's an entire ecosystem with many add-ons, complementary tools, when it comes to K8s CRDs, and services that provide its powerful capabilities and infinite scale...but at what cost?
In this talk, we'll share first of its kind research that will highlight the 5 most common OSS cloud native tools killing your observability costs. We'll start by exploring how different observability tools structure pricing, the complexities that compound cost calculation, and especially which OSS tools in your stack are the most resource-intensive services.
You'll discover how you can know whether it's KEDA or Karpenter, ArgoCD or Kyverno ballooning budgets. But don't panic! We'll wrap up with good practices for configuring popular tools to be more economical, so you can leverage the powerful K8s ecosystem without breaking the bank.
Amir Jakoby is a seasoned technology executive with over 18 years of experience in software engineering, leadership, & product innovation. He currently serves as Co-Founder and CTO of Sawmills.ai. Previously, as VP of Engineering at New Relic, Amir led a global team of 85 engineers... Read More →
Monty Python’s Black Knight is the opponent that couldn’t lose. Even after all of his limbs were cut off, he offered a draw: “it’s just a scratch.”
FoundationDB (FDB) is a distributed transactional key-value store that is very difficult to defeat just like the Black Knight. Open-sourced in 2018 after an acquisition by Apple, FDB was designed to be a common layer: almost all databases have a backing key-value store. Many have built on top of it including Snowflake, Adobe, & Datadog.
FDB got it right: transactions, distributed by default, and extreme reliability. Kyle Kingsbury (aphyr) the author of Jepsen series on distributed systems correctness, said: "haven't tested foundation in part because their testing appears to be waaaay more rigorous than mine."
We demo a live FDB cluster and try to disrupt its operations. Our attempts are informed by real world experience supporting a metadata service for billions of objects globally.
When we finally succeed, we show how backups and disaster recovery resurrect FDB. We’ll learn about highly resilient design patterns and operations. We have battle scars, and want to help others!
Peter is a founding engineer at Tigris Data. He has been using and working with open source software from early 2000s. Peter's first and foremost professional interest is performance tuning and large scale automation. Before rejoining Tigris Data, Peter worked on large scale MySQL... Read More →
Traefik is one of the most popular open-source projects in the world, with over 3 billion downloads and a top 15 spot on DockerHub. As a powerful Ingress and Gateway Controller, Traefik simplifies exposing, securing, and managing services and APIs dynamically and at scale—whether in simple setups or complex cloud-native environments.
Just one year after the release of Traefik v3, we're already taking things to the next level! In this session, Emile Vauge (Traefik Creator) and Nicolas Mengin (Traefik Maintainer) will unveil the exciting new features coming in Traefik v4, including: - A new plugin system for even greater extensibility - Pre-routing operations to optimize traffic handling - Enhanced TLS certificate management for better security and automation - Improved configuration management for a smoother experience - … and much more!
Join us to get a sneak peek at what’s next for Traefik and see how these innovations will make your cloud-native journey even easier.
Developer and DevOps - Maintainer of Traefik. Head of Development at Traefik Labs, the company behind Traefik, the popular cloud-native Gateway Controller, and Traefik Hub, a comprehensive API Management solution for Kubernetes. Responsible for overseeing the implementation of... Read More →
Containers have changed how we build and run services. The days of FTPing a binary up to a server are gone, because our platforms expect to run containers. We build container images at every stage of development, whenever we want to test our services, and when we deploy them.
If container images are what we need, could our development tools help us build them? Yes, they can! In this talk, we'll take a container image apart, see what makes it tick, then put it back together again from first principles - all using Swift!
Swift is a high performance, memory-safe language which is ideal for server-side development. We will:
* download a container image, take it apart by hand and explore what’s inside; * cross-compile a Swift service effortlessly to different Linux distributions, on x86 or ARM, statically or dynamically linked, from development environments on macOS or Linux; * use Swift's pluggable build system to produce container images efficiently and automatically for every build; * test the image.
Containers are a universal building block of modern services. Even if you're not yet using Swift, these ideas and principles also underpin your current build and deployment workflow.
Euan builds cloud services and infrastructure using Swift at Apple. He enjoys working with containers, virtual machines, networks and interesting programming languages. Previously, Euan helped maintain Docker Swarm's overlay networking and HTTP ingress, and contributed to XenServer's... Read More →
Context propagation is the cornerstone of observability in distributed systems, but traditional approaches often falter in non-linear workflows like message queues, event-driven architectures, state transitions, or shared resources. Span links powered by OpenTelemetry, bridge this gap by enabling connections between spans across unrelated execution contexts.
This session explores a practical use case where span links augment context propagation in an event-driven microservices system. We'll demonstrate how to track a single user's transaction across services that communicate asynchronously. Using tools like OpenTelemetry and compatible backends, we'll show how span links resolve visibility challenges, uncover hidden latencies, and maintain trace continuity even when the standard parent-child relationships break.
Ekansh is a Software Development Engineer, with active involvement in various open-source and cloud native communities for upwards two years now. He was previously an SDE Intern at SteamLabs. He is also a speaker for a couple of talks at PyCon, KubeCon and MozFests. Ekansh is a Google... Read More →
Haardik is passionate about building scalable backend systems with real-world impact. With extensive experience in cloud services, Kubernetes, and backend development, he has developed solutions that improve efficiency and reduce costs. Currently pursuing a Master’s in Computer... Read More →