Migrating high performance computing (HPC) workloads to the cloud presents unique challenges, as traditional on-premises infrastructure often clashes with cloud architectures, leading to operational and cost inefficiencies. Embracing core technologies like Google Kubernetes Engine and Google Cloud Storage offers a compelling solution to these hurdles. In this session, we explore PGS the transition of our entire HPC system to Google Cloud. This move allows us to run workloads five times larger than previously possible while reducing turnaround time by a factor of two.
Click the blue “Learn more” button above to tap into special offers designed to help you implement what you are learning at Google Cloud Next 25.
talk-data.com
Topic
Kubernetes
50
tagged
Activity Trend
Top Events
As machine learning (ML) systems continue to evolve, the ability to scale complex ML workloads becomes crucial. Scalability can be considered along two dimensions: expansive training of large language models (LLMs) and intricate distribution of reinforcement learning (RL) systems. Each has its own set of challenges, from computational demands of LLMs to complex synchronization in distributed RL.
This session explores the integration of Ray, Google Kubernetes Engine (GKE) and ML accelerators like tensor processing units (TPUs) as a powerful combination to develop advanced ML systems at scale. We discuss Ray and its scalable APIs, its mature integration with GKE and ML accelerators, and demonstrate how it has been used for LLMs and re-implementing the powerful RL algorithm, Muzero.
Click the blue “Learn more” button above to tap into special offers designed to help you implement what you are learning at Google Cloud Next 25.
This session features panel discussion with Snap Inc., and its journey from being born on Google App Engine to how they’ve been able to grow and serve 400M+ DAU powered by Google Kubernetes Engine. Learn about the business decisions behind this evolution, as we dive into the strategic approach delivered by Snap’s leadership throughout the company’s history as a digital-born customer.
Click the blue “Learn more” button above to tap into special offers designed to help you implement what you are learning at Google Cloud Next 25.
If you are curious how to accelerate developers innovation, inner sourcing and governance by taking Crème de la crème from Google Cloud developer toolset and open source that session is for you.
Leverage best of OSS and GCP to make it easy. During presentation you will learn how to accelerate application and infrastructure delivery from Google Cloud in use of Kubernetes Resource Model, empowered by GKE Enterprise and Cloud Deploy and exposed to developers via OSS Backstage Portal. All ended with practical use case demo.
Click the blue “Learn more” button above to tap into special offers designed to help you implement what you are learning at Google Cloud Next 25.
Cloud-native applications can be complex, but securing them shouldn’t be. Learn how CrowdStrike Falcon Cloud Security enables DevOps and SecOps to discover weak spots in their container images, prevent malicious behavior on Kubernetes clusters, visualize sensitive data flows, and discover misconfigurations across all of their cloud accounts. This session is for anyone responsible for application or cloud security. By attending this session, your contact information may be shared with the sponsor for relevant follow up for this event only.
Click the blue “Learn more” button above to tap into special offers designed to help you implement what you are learning at Google Cloud Next 25.
Deploying AI to production can be bafflingly complex. Learn how Google Cloud is bringing its over two decades of expertise in productionizing planet scale AI to our cloud customers with the AI Hypercomputer architecture. It’s a groundbreaking supercomputing architecture built on performance-optimized hardware (TPUs, GPUs), open software (PyTorch, Jax, Kubernetes), and tailored consumption models that optimize efficiency and productivity across AI training, tuning, and serving. Plus, gain valuable insights from our customers Kakao Brain and Nuro on their journey to deploying large scale AI on Google Cloud.
Click the blue “Learn more” button above to tap into special offers designed to help you implement what you are learning at Google Cloud Next 25.
In this session, you will learn how Rent the Runway (RTR) relies on MongoDB Atlas on Google Cloud to mix their automation hardware with their software, needing a robust, flexible, and intuitive data platform. We’ll dive into some reference architecture, highlighting some key integrations, such as Google Kubernetes Engine. We will then discuss RTR’s AI strategy, discussing how they’re approaching AI tools for their products. Lastly, we’ll discuss RTR and MongoDB’s mission of sustainability. Q&A to follow.
By attending this session, your contact information may be shared with the sponsor for relevant follow up for this event only.
Click the blue “Learn more” button above to tap into special offers designed to help you implement what you are learning at Google Cloud Next 25.
Introducing Google Kubernetes Engine (GKE) Threat Detection powered by Security Command Center (SCC). Event Threat Detection protects your use of Google Cloud from the Identity layer up through Network layer detections. Discover how GKE and SCC deliver a better-together integrated experience to detect threats against the container infrastructure.
Click the blue “Learn more” button above to tap into special offers designed to help you implement what you are learning at Google Cloud Next 25.
Large Language Models (LLMs) have changed the way we interact with information. A base LLM is only aware of the information it was trained on. Retrieval augmented generation (RAG) can address this issue by providing context of additional data sources. In this session, we’ll build a RAG-based LLM application that incorporates external data sources to augment an OSS LLM. We’ll show how to scale the workload with distributed kubernetes compute, and showcase a chatbot agent that gives factual answers.
Click the blue “Learn more” button above to tap into special offers designed to help you implement what you are learning at Google Cloud Next 25.
Worried about compliance for your platform and containers? Google Kubernetes Engine (GKE) has you covered. This session unlocks the power of GKE Compliance Posture, your real-time dashboard for proactive risk detection and continuous compliance. You’ll be able to see your entire GKE compliance landscape at a glance; stay ahead of risks with constant monitoring against industry standards; and get clear guidance to fix gaps and boost security. Plus, learn from SADA customers.
Click the blue “Learn more” button above to tap into special offers designed to help you implement what you are learning at Google Cloud Next 25.