talk-data.com

Topic

Kubernetes

container_orchestration devops microservices

Activities

560

tagged

Activity Trend

40 peak/qtr

2020-Q1 2026-Q1

Top Events

Data Engineering Podcast 255 Google Cloud Next '24 50 Google Cloud Next '25 41 O'Reilly Data Engineering Books 27 Microsoft Ignite 2025 20 Airflow Summit 2023 11 Airflow Summit 2025 10 Airflow Summit 2024 8 Databricks DATA + AI Summit 2023 7 Airflow Summit 2020 6 Airflow Summit 2022 6 Airflow Summit 2021 5

Top Speakers

Tobias Macey 255 Josh Kodroff (Pulumi) 7 Nathan Beach (Google Cloud) 6 Brandon Royal (Google Cloud) 5 Ishan Sharma (Google Cloud) 4 Pasquale Convertini (IBM) 4 Elli Androulaki (IBM) 4 Marcus Brandenburger (IBM Research) 4 Arne Rutjes (IBM) 4 Angelo de Caro (IBM Research) 4 Ringo De Smet (Pulumi) 4 Gari Singh (Google Cloud) 4

Activities

560 activities · Newest first

All Video Podcast Book

Build production-grade gen AI apps with Cloud SQL for MySQL and PostgreSQL in less than 30 minutes

2025-04-11 · Google Cloud Next '25

session

by Ravi Maganti (Manhattan Associates) , Shambhu Hegde (Google Cloud) , Isabella Lubin (Google Cloud)

AI/ML Cloud Computing GCP GenAI Cloud Run MySQL SQL postgresql

Time to make generative AI a reality for your application. This session is all about how to build high-performance gen AI applications fast with Cloud SQL for MySQL and PostgreSQL. Learn about Google Cloud’s innovative full-stack solutions that make gen AI app development, deployment, and operations simple and easy – even when deploying high-performance, production-grade applications. We’ll highlight best practices for getting started with Vertex AI, Cloud Run, Google Kubernetes Engine, and Cloud SQL, so that you can focus on gen AI application development from the get-go.

Deploy AlloyDB Omni on Kubernetes next to local AI models

2025-04-11 · Google Cloud Next '25

session

by Gleb Otochkin (Google Cloud)

AI/ML Cloud Computing GCP Omni

There are cases when you can’t use Google Cloud services but still want to get all benefits of AlloyDB integration with AI and serve a local model directly to the database. In such cases, AlloyDB Omni deployed in a Kubernetes cluster can be great solution, serving for edge cases and keeping all communications between database and AI model local.

GKE gen AI Inference: Deploy gen AI inference and save up to 30%

2025-04-11 · Google Cloud Next '25

session

by Vinay Kola (Snap) , Shub Shrivastava (Google Cloud) , Akshay Ram (Google)

AI/ML GenAI

Maximize your Gen AI inference performance on GKE. This session dives into the latest Kubernetes and GKE advancements, revealing how to achieve significant cost savings, reduced latency, and increased throughput. Discover new inference features on GKE for optimizing load balancing, scaling, accelerator selection, and overall usability. Plus, hear directly from Snap Inc. about their journey re-architecting their inference platform for the demands of Gen AI.

Manage compute resources and commitments effectively across Google Cloud

2025-04-11 · Google Cloud Next '25

session

by Yasmin Mowafy (Google Cloud) , Ari Liberman (Google Cloud)

AI/ML Cloud Computing GCP Cloud Run

Get the most out of your Google Cloud budget. This session covers cost-optimization strategies for Compute Engine and beyond, including Cloud Run, Vertex AI, and Autopilot in Google Kubernetes Engine. Learn how to effectively manage your capacity reservations and leverage consumption models like Spot VMs, Dynamic Workload Scheduler, and committed use discounts (CUDs) to achieve the optimum levels of capacity availability for your workloads while optimizing your cost.

eBPF gives GKE wings

2025-04-11 · Google Cloud Next '25

session

by Glen Yu (PwC Canada)

Cloud Computing Cyber Security

eBPF has revolutionized Kubernetes networking. Cilium, the leading eBPF-based container networking interface (CNI), is now emerging as a standard on major cloud providers like Google Kubernetes Engine (GKE). It provides superior scalability, security, and observability compared to traditional CNIs. eBPF also powers Hubble for network & security insights and Tetragon for runtime security enforcement. Find out how to leverage these tools to get the most out of your GKE cluster.

AI for startups: NVIDIA NIM™ Microservices + Google Cloud

2025-04-11 · Google Cloud Next '25

session

by Dimitri Maltezakis Vathypetrou (NVIDIA) , Brandon Royal (Google Cloud) , Chuck Freeman (NVIDIA)

AI/ML Cloud Computing GCP

Deploy and scale containerized AI models with NVIDIA NIMs on Google Kubernetes Engine (GKE). In this interactive session, you’ll gain hands-on experience deploying pre-built NIMs, managing deployments with kubectl, and autoscaling inference workloads. Ideal for startup developers, technical founders, and tech leads.

**Please bring your laptop to get the most out of this hands-on session**

Build an inferencing platform on GKE with Argo CD and fleets

2025-04-11 · Google Cloud Next '25

session

by Eddie Villalba (Google Cloud) , Trey Caliva (Abridge Inc) , Nick Eberts (Google Cloud)

Argo CD

This session provides a look into how Abridge built a secure and scalable inferencing platform on Google Kubernetes Engine (GKE). We’ll demonstrate how they leverage GKE fleets, Teams, Argo CD, and multi-cluster orchestration to manage and deploy inferencing workloads that span multiple clusters. View a live demo of a complete solution featuring a custom Argo CD plugin that simplifies cluster management and streamlines deployments for platform admins and application teams.

How Anthropic is pushing the computing limits of AI at scale with GKE

2025-04-11 · Google Cloud Next '25

session

by Artur Rodrigues (Anthropic) , Maciek Różacki (Google Cloud)

AI/ML

In this session, we’ll explore Google’s latest developments in Google Kubernetes Engine (GKE) that enable unprecedented scale and performance for AI workloads. We’ll dive into how Anthropic leverages these capabilities to manage mega-scale Kubernetes clusters, orchestrate diverse workloads, and achieve breakthrough efficiency optimizations.

Scaling multi-tenant AI platforms in the era of agentic AI with GKE

2025-04-11 · Google Cloud Next '25

session

by Abhishek Sawarkar (Nvidia) , Brandon Royal (Google Cloud) , Jeremy Schulman (Major League Baseball)

AI/ML Cloud Computing GCP

Is your platform ready for the scale of rapidly evolving models and agents? In this session, we’ll explore strategies for scaling your cloud native AI platform - empowering teams to leverage an increasing variety of AI models and agent frameworks. We’ll dive into tools and practices for maintaining control and cost efficiency while enabling AI engineering teams to quickly iterate on Google Kubernetes Engine (GKE). We’ll explore how NVIDIA NIM microservices deliver optimized inference with minimal tuning.

This Session is hosted by a Google Cloud Next Sponsor.
Visit your registration profile at g.co/cloudnext to opt out of sharing your contact information with the sponsor hosting this session.

Log analytics on Google Cloud

2025-04-10 · Google Cloud Next '25

session

Analytics Cloud Computing GCP

Unlock the power of your application logs with Google Cloud Logging. This hands-on lab provides hands-on experience using Cloud Logging to gain deep insights into your applications, particularly on Google Kubernetes Engine. Learn to build effective queries and proactively address potential issues.

If you register for a Learning Center lab, please ensure that you sign up for a Google Cloud Skills Boost account for both your work domain and personal email address. You will need to authenticate your account as well (be sure to check your spam folder!). This will ensure you can arrive and access your labs quickly onsite. You can follow this link to sign up!

Monitor performance for LLM training and inference workloads on GKE

2025-04-10 · Google Cloud Next '25

session

by James Maffey (Google Cloud) , Jie Wu (Google Cloud)

AI/ML Cloud Computing GCP LLM

Struggling to monitor the performance and health of your large language model (LLM) deployments on Google Kubernetes Engine (GKE)? This session unveils how the Google Cloud Observability suite provides a comprehensive solution for monitoring leading AI model servers like Ray, NVIDIA Triton, vLLM, TGI, and others. Learn how our one-click setup automatically configures dashboards, alerts, and critical metrics – including GPU and TPU utilization, latency, throughput, and error analysis – to enable faster troubleshooting and optimized performance. Discover how to gain complete visibility into your LLM infrastructure.

Serve open models on TPUs and GKE with superior portability and price-performance

2025-04-10 · Google Cloud Next '25

session

by Jon Li (Google Cloud) , Mustafa Ozuysal (HUBX) , Kavitha Gowda (Google Cloud)

AI/ML

Facing challenges with the cost and performance of your AI inference workloads? This talk presents TPUs and Google Kubernetes Engine (GKE) as a solution for achieving both high throughput and low latency while optimizing costs with open source models and libraries. Learn how to leverage TPUs to scale massive inference workloads efficiently.

Transforming your business with AI: The Kubernetes advantage

2025-04-10 · Google Cloud Next '25

session

by Robert Nishihara (Anyscale) , Kristian Lindwall (Spotify) , Gabe Monroy (Google Cloud) , Bobby Allen (Google Cloud)

AI/ML

Stop struggling to unlock the transformative power of AI. This session flips the script, revealing how your existing Kubernetes expertise is your greatest advantage. We'll demonstrate how Google Kubernetes Engine (GKE) provides the foundation for building scalable, custom AI platforms - empowering you to take control of your AI strategy. Forget starting from scratch; leverage existing skills to architect and deploy AI solutions for your unique needs. Discover how industry leaders like Spotify are harnessing GKE to fuel responsible innovation, and gain the insights to transform your Kubernetes knowledge into your ultimate AI superpower.

How video game studios use Google Cloud to power gen AI in games

2025-04-10 · Google Cloud Next '25

session

by Oddur Magnússon (Klang Games GMBH) , Ishan Sharma (Google Cloud)

AI/ML Cloud Computing GCP GenAI

Companies in the fiercely competitive gaming landscape face constant pressure to create engaging and ever-evolving player experiences. Generative AI can help game developers craft more dynamic, personalized gameplay while reducing time to market. Major game studios are leveraging Google Cloud’s cutting-edge AI capabilities to create immersive player experiences, personalized chatbots, dynamic character interactions, and user-generated content. We’ll show you how you can use Google Kubernetes Engine to easily integrate gen AI with game servers.

Build and deploy natively on Google Cloud with Oracle Database@Google Cloud

2025-04-10 · Google Cloud Next '25

session

by Martin Paynter (Google Cloud) , Can Tuzla (Oracle) , Gerald Venzl (Oracle)

AI/ML BigQuery Cloud Computing GCP Cloud Run LLM Oracle

Build modern applications with the power of Oracle Database 23ai, and Google Cloud's Vertex AI and Gemini Foundation models. Learn key strategies to integrate Google Cloud’s native development tools and services, including Kubernetes, Cloud Run, and BigQuery, with Oracle Database 23ai and Autonomous Database, seamlessly into modern application architectures. Cloud architects, Developers, or DB Administrators will gain actionable insight, best practices, and real-world examples to enhance performance and accelerate innovation with ODB@GC.

This Session is hosted by a Google Cloud Next Sponsor.
Visit your registration profile at g.co/cloudnext to opt out of sharing your contact information with the sponsor hosting this session.

LES AGENTS AI DISTRIBUÉ AVEC DAPR

2025-04-10 · Global AI Bootcamp Paris 2025

talk

aks dapr

Demo sur Dapr AI Agents dans l'environnement Kubernetes AKS.

LES AGENTS AI DISTRIBUÉS AVEC DAPR

2025-04-10 · AZUG FR @ Global AI Bootcamp Paris 2025

talk

aks dapr

20 min demo - 10 min questions sur DAPR AI Agents dans l'environnement Kubernetes (AKS) pour se focaliser sur le code et oublier l'existence de l'infrastructure.

How Google Cloud and Gemini Code Assist supercharge Android development

2025-04-10 · Google Cloud Next '25

session

by Henry Bzeih (Renault Group) , Guillaume Morini (Google Cloud) , Femi Akinde (Google Cloud)

CI/CD Cloud Computing GCP GitLab LLM

Discover how Renault transformed automotive software development (SDV) with Google Cloud. By replacing physical prototypes with Android-based virtualization, they accelerated their SDV life cycle and moved to a cloud-first, iterative approach. Learn how they leverage Cloud Workstations, Gemini Code Assist, and a continuous integration and continuous testing (CI/CT) pipeline powered by Google Kubernetes Engine and GitLab to boost developer productivity and bring new features to market faster.

Cluster Director with GKE: Optimal performance at max scale

2025-04-10 · Google Cloud Next '25

session

by Alex Zakonov (Google Cloud) , Ishan Sharma (Google Cloud) , Ivan Dashkov (PUMA)

AI/ML

Managing massive deployments of accelerators for AI and high performance computing (HPC) workloads can be complex. This talk dives into running AI-optimized Google Kubernetes Engine (GKE) clusters that streamline infrastructure provisioning, workload orchestration, and ongoing operations for tens of thousands of accelerators. Learn how topology-aware scheduling, maintenance controls, and advanced networking capabilities enable ultralow latency and maximum performance by default for demanding workloads like AI pretraining, fine-tuning, inference, and HPC.

How Shopify runs their biggest business event of the year with GKE

2025-04-10 · Google Cloud Next '25

session

by Victor Szalvay (Google) , Jeremie Lagarde (Shopify) , Roman Arcea (Google) , Justin Reid (Shopify)

Join this session where Shopify engineers will discuss how they leverage the latest Google Kubernetes Engine (GKE) innovations to build robust, scalable platforms that not only handle everyday traffic with ease but also gracefully absorb unpredictable spikes during peak events like Black Friday and Cyber Monday. Learn key architectural patterns, smart infrastructure choices, and proven best practices. Discover how to optimize resource utilization, control costs, and deliver cost-effective performance every time.

Page 5 of 28

← Previous

1 ... 3 4 5 6 7 ... 28