Search – talk-data.com

Title & Speakers	Event
Multimodality with Gemini: Text, Videos, and Images 2024-12-04 · 17:00 Henry Ruiz – Google Developer Expert @ Google Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to:\n- Analyze and understand the content of images, videos, and audio files\n- Perform cross-modal tasks like image captioning and visual question-answering\n- Explore the potential of multimodality for various applications, from creative content generation to advanced information retrieval. gemini multimodal ai text images videos audio	Session #10: Google AI Seminar (Virtual)
Session #10: Google AI Seminar (Virtual) 2024-12-04 · 17:00 Important: RSVP here to receive joining link. (rsvp on meetup will NOT receive joining link). Description: Welcome to the weekly AI virtual seminars, in collaboration with Google. Join us for deep dive tech talks on AI/ML/Data, hands-on experiences on code labs, workshops, and networking with speakers & fellow developers from all over the world. Tech Talk: Multimodality with Gemini: Text, Videos, and Images Speaker: Henry Ruiz (Google Developer Expert) Abstract: Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to: - Analyze and understand the content of images\, videos\, and audio files - Perform cross-modal tasks like image captioning and visual question-answering - Explore the potential of multimodality for various applications\, from creative content generation to advanced information retrieval. Speakers/Topics: Stay tuned as we are updating speakers and schedules. If you have a keen interest in speaking to our community, we invite you to submit topics for consideration: Submit Topics Sponsors: We are actively seeking sponsors to support AI developers community. Whether it is by offering venue spaces, providing food, or cash sponsorship. Sponsors will not only have the chance to speak at the meetups, receive prominent recognition, but also gain exposure to our extensive membership base of 400K+ AI developers worldwide. AICamp Community on Slack/Discord - Event chat: chat and connect with speakers and attendees - Sharing blogs\, events\, job openings\, projects collaborations	Session #10: Google AI Seminar (Virtual)
Tech Talk: Multimodality with Gemini: Text, Videos, and Images 2024-06-22 · 20:00 Henry Ruiz – Google Developer Expert @ Google Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to:\n- Analyze and understand the content of images, videos, and audio files\n- Perform cross-modal tasks like image captioning and visual question-answering\n- Explore the potential of multimodality for various applications, from creative content generation to advanced information retrieval. gemini vertex ai	Google Generative AI Learning Month (Virtual) - Session 3
Tech Talk: Multimodality with Gemini: Text, Videos, and Images 2024-06-22 · 20:00 Henry Ruiz – Google Developer Expert @ Google Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to: - Analyze and understand the content of images, videos, and audio files - Perform cross-modal tasks like image captioning and visual question-answering - Explore the potential of multimodality for various applications, from creative content generation to advanced information retrieval. gemini vertex ai google cloud	Google Generative AI Learning Month (Virtual) - Session 3
Google Generative AI Learning Month (Virtual) - Session 3 2024-06-22 · 14:30 * RSVP instruction: register here to receive joining link before the deadline. Description: Welcome to "Generative AI Learning Month for Google Gemini and Vertex AI," a series of virtual events designed to help you master Google Gemini and Vertex AI, in collaboration with Google Developers. Join us online every Saturday in June for in-depth tech talks, discussions, and networking with speakers and fellow developers from around the world. Schedules: Session 1: 8th June, 8pm\~9pm IST Session 2: 15th June, 8pm\~9pm IST Session 3: 22nd June, 8pm\~9pm IST Session 4: 29th June, 8pm\~9pm IST Tech Talk: Multimodality with Gemini: Text, Videos, and Images Speaker: Henry Ruiz (Google Developer Expert) Abstract: Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to: - Analyze and understand the content of images\, videos\, and audio files - Perform cross-modal tasks like image captioning and visual question-answering - Explore the potential of multimodality for various applications\, from creative content generation to advanced information retrieval. Speakers/Topics: Stay tuned as we are updating speakers and schedules. If you have a keen interest in speaking to our community, we invite you to submit topics for consideration: Submit Topics Venue: virtual, join from anywhere. Sponsors: We are actively seeking sponsors to support AI developers community. Whether it is by offering venue spaces, providing food, or cash sponsorship. Sponsors will not only have the chance to speak at the meetups, receive prominent recognition, but also gain exposure to our extensive membership base of 350K+ AI developers worldwide. Community on Slack/Discord** Event chat: chat and connect with speakers and attendees Sharing blogs, events, job openings, projects collaborations	Google Generative AI Learning Month (Virtual) - Session 3
Google Generative AI Learning Month (Virtual) - Session 3 2024-06-22 · 14:30 * RSVP instruction: register here to receive joining link before the deadline. Description: Welcome to "Generative AI Learning Month for Google Gemini and Vertex AI," a series of virtual events designed to help you master Google Gemini and Vertex AI, in collaboration with Google Developers. Join us online every Saturday in June for in-depth tech talks, discussions, and networking with speakers and fellow developers from around the world. Schedules: Session 1: 8th June, 8pm\~9pm IST Session 2: 15th June, 8pm\~9pm IST Session 3: 22nd June, 8pm\~9pm IST Session 4: 29th June, 8pm\~9pm IST Tech Talk: Multimodality with Gemini: Text, Videos, and Images Speaker: Henry Ruiz (Google Developer Expert) Abstract: Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to: - Analyze and understand the content of images\, videos\, and audio files - Perform cross-modal tasks like image captioning and visual question-answering - Explore the potential of multimodality for various applications\, from creative content generation to advanced information retrieval. Speakers/Topics: Stay tuned as we are updating speakers and schedules. If you have a keen interest in speaking to our community, we invite you to submit topics for consideration: Submit Topics Venue: virtual, join from anywhere. Sponsors: We are actively seeking sponsors to support AI developers community. Whether it is by offering venue spaces, providing food, or cash sponsorship. Sponsors will not only have the chance to speak at the meetups, receive prominent recognition, but also gain exposure to our extensive membership base of 350K+ AI developers worldwide. Community on Slack/Discord** Event chat: chat and connect with speakers and attendees Sharing blogs, events, job openings, projects collaborations	Google Generative AI Learning Month (Virtual) - Session 3
Google Generative AI Learning Month (Virtual) - Session 3 2024-06-22 · 14:30 * RSVP instruction: register here to receive joining link before the deadline. Description: Welcome to "Generative AI Learning Month for Google Gemini and Vertex AI," a series of virtual events designed to help you master Google Gemini and Vertex AI, in collaboration with Google Developers. Join us online every Saturday in June for in-depth tech talks, discussions, and networking with speakers and fellow developers from around the world. Schedules: Session 1: 8th June, 8pm\~9pm IST Session 2: 15th June, 8pm\~9pm IST Session 3: 22nd June, 8pm\~9pm IST Session 4: 29th June, 8pm\~9pm IST Tech Talk: Multimodality with Gemini: Text, Videos, and Images Speaker: Henry Ruiz (Google Developer Expert) Abstract: Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to: - Analyze and understand the content of images\, videos\, and audio files - Perform cross-modal tasks like image captioning and visual question-answering - Explore the potential of multimodality for various applications\, from creative content generation to advanced information retrieval. Speakers/Topics: Stay tuned as we are updating speakers and schedules. If you have a keen interest in speaking to our community, we invite you to submit topics for consideration: Submit Topics Venue: virtual, join from anywhere. Sponsors: We are actively seeking sponsors to support AI developers community. Whether it is by offering venue spaces, providing food, or cash sponsorship. Sponsors will not only have the chance to speak at the meetups, receive prominent recognition, but also gain exposure to our extensive membership base of 350K+ AI developers worldwide. Community on Slack/Discord** Event chat: chat and connect with speakers and attendees Sharing blogs, events, job openings, projects collaborations	Google Generative AI Learning Month (Virtual) - Session 3

Multimodality with Gemini: Text, Videos, and Images 2024-12-04 · 17:00

Henry Ruiz – Google Developer Expert @ Google

Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to:\n- Analyze and understand the content of images, videos, and audio files\n- Perform cross-modal tasks like image captioning and visual question-answering\n- Explore the potential of multimodality for various applications, from creative content generation to advanced information retrieval.

gemini multimodal ai text images videos audio

Session #10: Google AI Seminar (Virtual)

Session #10: Google AI Seminar (Virtual) 2024-12-04 · 17:00

Important: RSVP here to receive joining link. (rsvp on meetup will NOT receive joining link).

Description: Welcome to the weekly AI virtual seminars, in collaboration with Google. Join us for deep dive tech talks on AI/ML/Data, hands-on experiences on code labs, workshops, and networking with speakers & fellow developers from all over the world.

Tech Talk: Multimodality with Gemini: Text, Videos, and Images Speaker: Henry Ruiz (Google Developer Expert) Abstract: Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to: - Analyze and understand the content of images\, videos\, and audio files - Perform cross-modal tasks like image captioning and visual question-answering - Explore the potential of multimodality for various applications\, from creative content generation to advanced information retrieval.

Speakers/Topics: Stay tuned as we are updating speakers and schedules. If you have a keen interest in speaking to our community, we invite you to submit topics for consideration: Submit Topics

Sponsors: We are actively seeking sponsors to support AI developers community. Whether it is by offering venue spaces, providing food, or cash sponsorship. Sponsors will not only have the chance to speak at the meetups, receive prominent recognition, but also gain exposure to our extensive membership base of 400K+ AI developers worldwide.

AICamp Community on Slack/Discord - Event chat: chat and connect with speakers and attendees - Sharing blogs\, events\, job openings\, projects collaborations

Session #10: Google AI Seminar (Virtual)

Tech Talk: Multimodality with Gemini: Text, Videos, and Images 2024-06-22 · 20:00

Henry Ruiz – Google Developer Expert @ Google

Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to:\n- Analyze and understand the content of images, videos, and audio files\n- Perform cross-modal tasks like image captioning and visual question-answering\n- Explore the potential of multimodality for various applications, from creative content generation to advanced information retrieval.

gemini vertex ai

Google Generative AI Learning Month (Virtual) - Session 3

Tech Talk: Multimodality with Gemini: Text, Videos, and Images 2024-06-22 · 20:00

Henry Ruiz – Google Developer Expert @ Google

Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to: - Analyze and understand the content of images, videos, and audio files - Perform cross-modal tasks like image captioning and visual question-answering - Explore the potential of multimodality for various applications, from creative content generation to advanced information retrieval.

gemini vertex ai google cloud

Google Generative AI Learning Month (Virtual) - Session 3

Google Generative AI Learning Month (Virtual) - Session 3 2024-06-22 · 14:30

*** RSVP instruction: register here to receive joining link before the deadline.

Description: Welcome to "Generative AI Learning Month for Google Gemini and Vertex AI," a series of virtual events designed to help you master Google Gemini and Vertex AI, in collaboration with Google Developers.

Join us online every Saturday in June for in-depth tech talks, discussions, and networking with speakers and fellow developers from around the world.

Schedules:

Session 1: 8th June, 8pm\~9pm IST
Session 2: 15th June, 8pm\~9pm IST
Session 3: 22nd June, 8pm\~9pm IST
Session 4: 29th June, 8pm\~9pm IST

Tech Talk: Multimodality with Gemini: Text, Videos, and Images Speaker: Henry Ruiz (Google Developer Expert) Abstract: Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to: - Analyze and understand the content of images\, videos\, and audio files - Perform cross-modal tasks like image captioning and visual question-answering - Explore the potential of multimodality for various applications\, from creative content generation to advanced information retrieval.

Speakers/Topics: Stay tuned as we are updating speakers and schedules. If you have a keen interest in speaking to our community, we invite you to submit topics for consideration: Submit Topics

Venue: virtual, join from anywhere.

Sponsors: We are actively seeking sponsors to support AI developers community. Whether it is by offering venue spaces, providing food, or cash sponsorship. Sponsors will not only have the chance to speak at the meetups, receive prominent recognition, but also gain exposure to our extensive membership base of 350K+ AI developers worldwide.

Community on Slack/Discord

Event chat: chat and connect with speakers and attendees
Sharing blogs, events, job openings, projects collaborations

Google Generative AI Learning Month (Virtual) - Session 3

Google Generative AI Learning Month (Virtual) - Session 3 2024-06-22 · 14:30

*** RSVP instruction: register here to receive joining link before the deadline.

Description: Welcome to "Generative AI Learning Month for Google Gemini and Vertex AI," a series of virtual events designed to help you master Google Gemini and Vertex AI, in collaboration with Google Developers.

Join us online every Saturday in June for in-depth tech talks, discussions, and networking with speakers and fellow developers from around the world.

Schedules:

Session 1: 8th June, 8pm\~9pm IST
Session 2: 15th June, 8pm\~9pm IST
Session 3: 22nd June, 8pm\~9pm IST
Session 4: 29th June, 8pm\~9pm IST

Tech Talk: Multimodality with Gemini: Text, Videos, and Images Speaker: Henry Ruiz (Google Developer Expert) Abstract: Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to: - Analyze and understand the content of images\, videos\, and audio files - Perform cross-modal tasks like image captioning and visual question-answering - Explore the potential of multimodality for various applications\, from creative content generation to advanced information retrieval.

Speakers/Topics: Stay tuned as we are updating speakers and schedules. If you have a keen interest in speaking to our community, we invite you to submit topics for consideration: Submit Topics

Venue: virtual, join from anywhere.

Sponsors: We are actively seeking sponsors to support AI developers community. Whether it is by offering venue spaces, providing food, or cash sponsorship. Sponsors will not only have the chance to speak at the meetups, receive prominent recognition, but also gain exposure to our extensive membership base of 350K+ AI developers worldwide.

Community on Slack/Discord

Event chat: chat and connect with speakers and attendees
Sharing blogs, events, job openings, projects collaborations

Google Generative AI Learning Month (Virtual) - Session 3

Google Generative AI Learning Month (Virtual) - Session 3 2024-06-22 · 14:30

*** RSVP instruction: register here to receive joining link before the deadline.

Description: Welcome to "Generative AI Learning Month for Google Gemini and Vertex AI," a series of virtual events designed to help you master Google Gemini and Vertex AI, in collaboration with Google Developers.

Join us online every Saturday in June for in-depth tech talks, discussions, and networking with speakers and fellow developers from around the world.

Schedules:

Session 1: 8th June, 8pm\~9pm IST
Session 2: 15th June, 8pm\~9pm IST
Session 3: 22nd June, 8pm\~9pm IST
Session 4: 29th June, 8pm\~9pm IST

Tech Talk: Multimodality with Gemini: Text, Videos, and Images Speaker: Henry Ruiz (Google Developer Expert) Abstract: Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to: - Analyze and understand the content of images\, videos\, and audio files - Perform cross-modal tasks like image captioning and visual question-answering - Explore the potential of multimodality for various applications\, from creative content generation to advanced information retrieval.

Speakers/Topics: Stay tuned as we are updating speakers and schedules. If you have a keen interest in speaking to our community, we invite you to submit topics for consideration: Submit Topics

Venue: virtual, join from anywhere.

Sponsors: We are actively seeking sponsors to support AI developers community. Whether it is by offering venue spaces, providing food, or cash sponsorship. Sponsors will not only have the chance to speak at the meetups, receive prominent recognition, but also gain exposure to our extensive membership base of 350K+ AI developers worldwide.

Community on Slack/Discord

Event chat: chat and connect with speakers and attendees
Sharing blogs, events, job openings, projects collaborations

Google Generative AI Learning Month (Virtual) - Session 3

talk-data.com

People (48 results)

Activities & events