Struggling to monitor the performance and health of your large language model (LLM) deployments on Google Kubernetes Engine (GKE)? This session unveils how the Google Cloud Observability suite provides a comprehensive solution for monitoring leading AI model servers like Ray, NVIDIA Triton, vLLM, TGI, and others. Learn how our one-click setup automatically configures dashboards, alerts, and critical metrics – including GPU and TPU utilization, latency, throughput, and error analysis – to enable faster troubleshooting and optimized performance. Discover how to gain complete visibility into your LLM infrastructure.
talk-data.com
Topic
AI/ML
Artificial Intelligence/Machine Learning
9014
tagged
Activity Trend
Top Events
Discover groundbreaking AI innovations in our breakout session featuring experts from Cognizant, Google, Northumbrian Water, and Stena AB. Learn how AI models are revolutionizing the approach to addressing business challenges and environmental concerns. Examples will cover solutions to detect water health using satellite imagery and AI agents transforming the shipping industry's contract decisions to reduce idle time. Join us for an insightful discussion on how these cutting-edge advancements in AI technology are providing a competitive edge.
This Session is hosted by a Google Cloud Next Sponsor.
Visit your registration profile at g.co/cloudnext to opt out of sharing your contact information with the sponsor hosting this session.
Even if your cloud infrastructure is configured securely, how can you make sure cloud applications do not introduce vulnerabilities? This technical session for cloud security professionals focuses on shift-left security practices, including the latest strategies for building secure applications, achieving code-to-cloud traceability of code-related issues, and getting AI-generated remediations that make it easy for application development teams to fix issues at the source.
Experience a live demo of Google Cloud’s approach to mainframe modernization using generative AI. This demo will showcase the modernization life cycle, from initial assessment to code rewrite to risk mitigation. We’ll illustrate how our agentic approach streamlines the modernization process, reducing time, budget, and resource requirements. And we’ll demonstrate how to minimize the risk of modernizing business-critical applications through testing and by enabling parallel execution of both original and modernized applications with Dual Run.
Facing challenges with the cost and performance of your AI inference workloads? This talk presents TPUs and Google Kubernetes Engine (GKE) as a solution for achieving both high throughput and low latency while optimizing costs with open source models and libraries. Learn how to leverage TPUs to scale massive inference workloads efficiently.
Bigtable has been a core piece of application infrastructure for Google and companies such as Snap, Spotify, and many other massive platforms for over 20 years. In this session, we’ll discuss the fundamental changes to Bigtable processing capabilities made available via SQL that will let you bring more data transformations directly into Bigtable – enabling extract, load, and transform (ELT) capabilities taking advantage of Bigtable’s flexible schema to achieve increased data freshness – and that will reduce the time and costs of running other data processing services to prepare data for your real-time application.
Learn how you can use BigQuery and Earth Engine together for a variety of weather and climate resilience use cases including risk assessment, response, and recovery. You'll receive a real world step-by-step walkthrough, highlighting new geospatial features and AI-driven datasets, that make it easier for any professional to unlock insights that can contribute to greater climate resiliency.
UKG is revolutionizing workforce management with AI agents and retrieval-augmented generation (RAG) systems. Join this session for a deep dive into how UKG, Google Cloud, and MongoDB collaborated to orchestrate enterprise data and put it to use powering intelligent, context-aware AI solutions that shape the future of work.
This Session is hosted by a Google Cloud Next Sponsor.
Visit your registration profile at g.co/cloudnext to opt out of sharing your contact information with the sponsor hosting this session.
AI is revolutionizing contact centers, but the gap between AI leaders and laggards isn't just about technology—it's about a fundamental shift in customer loyalty. In this Spotlight, Dialpad and Google leadership will discuss the five AI adoption strategies separating industry champions from the rest. Top performers leverage industry-specific models and micro-data intelligence to advance business outcomes. But here's the twist: humans are the multiplier. Discover how leading organizations are rewriting the rules of human-AI collaboration, ensuring AI augments—not replaces—their workforce. Join us for an insightful discussion and live demo of market-shifting AI innovations.
This Session is hosted by a Google Cloud Next Sponsor.
Visit your registration profile at g.co/cloudnext to opt out of sharing your contact information with the sponsor hosting this session.
Cut through the AI hype. This panel of executive thought leaders from some of the world’s most respected healthcare institutions reveals the truth about how AI is really working in healthcare. They’ll share real-world results offering invaluable insights into the practical application of AI in patient care and provider satisfaction. We’ll explore what AI solutions are being used right now, what challenges have been overcome, and how AI is demonstrably improving patient outcomes, operational efficiency, and provider experiences.
This Session is hosted by a Google Cloud Next Sponsor.
Visit your registration profile at g.co/cloudnext to opt out of sharing your contact information with the sponsor hosting this session.
AI is transforming software development, but not always for the better. The 2024 DORA report reveals that AI adoption resulted in a 7.2% regression of delivery stability for software, highlighting the industry's overemphasis on velocity. This session will explain how to flip the script on AI’s role in software development, and drive better development and delivery outcomes with new innovations from Google Cloud and beyond, freeing developers to focus on customer value.
Join us to learn how Avery Dennison and Mercer International are transforming their workflows with Google Workspace with Gemini. They'll share their journeys, including evaluation criteria, roll-out strategies, and the impact of generative AI on productivity and employee satisfaction. Gain valuable insights into successful AI adoption and learn how to leverage these powerful tools within your own organization.
Whilst GenAI has brought conversational experiences to the forefront, the next generation of web interfaces will demand more than just chat interactions. Instead, delivering highly personalized experiences requires a powerful blend of search, dynamic multimodal visual elements, and conversational interactions. In this session, discover how Valtech, the experience innovation company, is leveraging Vertex AI Agent Builder and Gemini with React to redefine experiences on the web.
This Session is hosted by a Google Cloud Next Sponsor.
Visit your registration profile at g.co/cloudnext to opt out of sharing your contact information with the sponsor hosting this session.
Is your team ready for the future of cloud? Discover how Google Cloud is equipping partners with the cutting-edge expertise needed to capitalize on the rapidly evolving generative AI landscape. You’ll learn about these transformative programs, learning paths, as well as targeted journeys for Vertex AI, generative AI Agent Builder, Gemini, and Customer Engagement Suite. Whether your just starting your gen AI journey, or are ready to tackle advanced implementations, we have just the training for you.
Agentic AI is poised to revolutionize how marketing & data teams unite to fuel revenue growth in 2025 and beyond. Join this session to discover how marketers can leverage AI agents applied directly to Google Cloud BigQuery to suggest, build, and execute campaigns that compound over time. You’ll learn how GrowthLoop’s marketing activation solution, built on Google Cloud BigQuery, leverages Agentic AI to automate campaign launches, accelerate experimentation, and deliver hyper-personalized experiences at scale.
This Session is hosted by a Google Cloud Next Sponsor.
Visit your registration profile at g.co/cloudnext to opt out of sharing your contact information with the sponsor hosting this session.
This talk delves into the evolving data architectures that will drive the next generation of AI applications, exploring how Generative AI is transforming our interaction with data. We will examine the challenges and opportunities presented by constructing data systems that can support the distinctive requirements of GenAI, and envision the future of data architecture in a world where AI is omnipresent, we will work backward from real usecase together with Intesa bank team on how they are reinventing the risk management with Data and AI.
Join us to learn how Globe Telecom and Banesco USA are transforming their workflows with Google Workspace with Gemini. They'll share their journeys, including evaluation criteria, roll-out strategies, and the impact of generative AI on productivity and employee satisfaction. Gain valuable insights into successful AI adoption and learn how to leverage these powerful tools within your own organization.
The telecommunications industry is undergoing a significant transformation in customer engagement, driven by the capabilities of Generative AI. This talk explores how Google Cloud's Gen AI tools can be leveraged to develop advanced conversational bots, enabling telcos to reimagine their customer interactions. We will delve into the technical architecture and implementation of these solutions, focusing on how they can significantly improve issue resolution efficiency and increase containment rates for both voice and chat channels. This session will showcase how to build intelligent, scalable, and personalized customer experiences, highlighting the business value of Gen AI in optimizing service and support operations within the telecommunications sector
An architecture for a robust custom AI chatbot backed by Vertex AI with Gemini and fully managed GCP services. Cloud-Based: Scalable platform will ensure high availability and performance. Reduced Development Time: Significantly reduce the time and effort required to build and deploy custom AI chatbots. Scalability and Performance: Ensure that the chatbots can handle high volumes of traffic and maintain optimal performance.
The solution democratizes the development of AI chatbots, making them accessible to wider enterprises and various domains."
Experience the future of AI with Google Cloud! Speak with Agentspace experts to learn how it can provide conversational assistance and take actions based on your company’s unique information. See topic details for each time block here.