talk-data.com
People (48 results)
See all 48 →Activities & events
| Title & Speakers | Event |
|---|---|
|
Multimodality with Gemini: Text, Videos, and Images
2024-12-04 · 17:00
Henry Ruiz
– Google Developer Expert
@ Google
Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to:\n- Analyze and understand the content of images, videos, and audio files\n- Perform cross-modal tasks like image captioning and visual question-answering\n- Explore the potential of multimodality for various applications, from creative content generation to advanced information retrieval. |
Session #10: Google AI Seminar (Virtual)
|
|
Session #10: Google AI Seminar (Virtual)
2024-12-04 · 17:00
Important: RSVP here to receive joining link. (rsvp on meetup will NOT receive joining link). Description: Welcome to the weekly AI virtual seminars, in collaboration with Google. Join us for deep dive tech talks on AI/ML/Data, hands-on experiences on code labs, workshops, and networking with speakers & fellow developers from all over the world. Tech Talk: Multimodality with Gemini: Text, Videos, and Images Speaker: Henry Ruiz (Google Developer Expert) Abstract: Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to: - Analyze and understand the content of images\, videos\, and audio files - Perform cross-modal tasks like image captioning and visual question-answering - Explore the potential of multimodality for various applications\, from creative content generation to advanced information retrieval. Speakers/Topics: Stay tuned as we are updating speakers and schedules. If you have a keen interest in speaking to our community, we invite you to submit topics for consideration: Submit Topics Sponsors: We are actively seeking sponsors to support AI developers community. Whether it is by offering venue spaces, providing food, or cash sponsorship. Sponsors will not only have the chance to speak at the meetups, receive prominent recognition, but also gain exposure to our extensive membership base of 400K+ AI developers worldwide. AICamp Community on Slack/Discord - Event chat: chat and connect with speakers and attendees - Sharing blogs\, events\, job openings\, projects collaborations |
Session #10: Google AI Seminar (Virtual)
|
|
Tech Talk: Multimodality with Gemini: Text, Videos, and Images
2024-06-22 · 20:00
Henry Ruiz
– Google Developer Expert
@ Google
Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to:\n- Analyze and understand the content of images, videos, and audio files\n- Perform cross-modal tasks like image captioning and visual question-answering\n- Explore the potential of multimodality for various applications, from creative content generation to advanced information retrieval. |
Google Generative AI Learning Month (Virtual) - Session 3
|
|
Tech Talk: Multimodality with Gemini: Text, Videos, and Images
2024-06-22 · 20:00
Henry Ruiz
– Google Developer Expert
@ Google
Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to: - Analyze and understand the content of images, videos, and audio files - Perform cross-modal tasks like image captioning and visual question-answering - Explore the potential of multimodality for various applications, from creative content generation to advanced information retrieval. |
Google Generative AI Learning Month (Virtual) - Session 3
|
|
Google Generative AI Learning Month (Virtual) - Session 3
2024-06-22 · 14:30
*** RSVP instruction: register here to receive joining link before the deadline. Description: Welcome to "Generative AI Learning Month for Google Gemini and Vertex AI," a series of virtual events designed to help you master Google Gemini and Vertex AI, in collaboration with Google Developers. Join us online every Saturday in June for in-depth tech talks, discussions, and networking with speakers and fellow developers from around the world. Schedules:
Tech Talk: Multimodality with Gemini: Text, Videos, and Images Speaker: Henry Ruiz (Google Developer Expert) Abstract: Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to: - Analyze and understand the content of images\, videos\, and audio files - Perform cross-modal tasks like image captioning and visual question-answering - Explore the potential of multimodality for various applications\, from creative content generation to advanced information retrieval. Speakers/Topics: Stay tuned as we are updating speakers and schedules. If you have a keen interest in speaking to our community, we invite you to submit topics for consideration: Submit Topics Venue: virtual, join from anywhere. Sponsors: We are actively seeking sponsors to support AI developers community. Whether it is by offering venue spaces, providing food, or cash sponsorship. Sponsors will not only have the chance to speak at the meetups, receive prominent recognition, but also gain exposure to our extensive membership base of 350K+ AI developers worldwide. Community on Slack/Discord
|
Google Generative AI Learning Month (Virtual) - Session 3
|
|
Google Generative AI Learning Month (Virtual) - Session 3
2024-06-22 · 14:30
*** RSVP instruction: register here to receive joining link before the deadline. Description: Welcome to "Generative AI Learning Month for Google Gemini and Vertex AI," a series of virtual events designed to help you master Google Gemini and Vertex AI, in collaboration with Google Developers. Join us online every Saturday in June for in-depth tech talks, discussions, and networking with speakers and fellow developers from around the world. Schedules:
Tech Talk: Multimodality with Gemini: Text, Videos, and Images Speaker: Henry Ruiz (Google Developer Expert) Abstract: Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to: - Analyze and understand the content of images\, videos\, and audio files - Perform cross-modal tasks like image captioning and visual question-answering - Explore the potential of multimodality for various applications\, from creative content generation to advanced information retrieval. Speakers/Topics: Stay tuned as we are updating speakers and schedules. If you have a keen interest in speaking to our community, we invite you to submit topics for consideration: Submit Topics Venue: virtual, join from anywhere. Sponsors: We are actively seeking sponsors to support AI developers community. Whether it is by offering venue spaces, providing food, or cash sponsorship. Sponsors will not only have the chance to speak at the meetups, receive prominent recognition, but also gain exposure to our extensive membership base of 350K+ AI developers worldwide. Community on Slack/Discord
|
Google Generative AI Learning Month (Virtual) - Session 3
|
|
Google Generative AI Learning Month (Virtual) - Session 3
2024-06-22 · 14:30
*** RSVP instruction: register here to receive joining link before the deadline. Description: Welcome to "Generative AI Learning Month for Google Gemini and Vertex AI," a series of virtual events designed to help you master Google Gemini and Vertex AI, in collaboration with Google Developers. Join us online every Saturday in June for in-depth tech talks, discussions, and networking with speakers and fellow developers from around the world. Schedules:
Tech Talk: Multimodality with Gemini: Text, Videos, and Images Speaker: Henry Ruiz (Google Developer Expert) Abstract: Gemini is the most capable and general model Google has ever built. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, images, and video. This talk dives into the exciting world of Gemini, a cutting-edge foundation model developed by Google. Discover how Gemini seamlessly integrates text and image processing, enabling you to: - Analyze and understand the content of images\, videos\, and audio files - Perform cross-modal tasks like image captioning and visual question-answering - Explore the potential of multimodality for various applications\, from creative content generation to advanced information retrieval. Speakers/Topics: Stay tuned as we are updating speakers and schedules. If you have a keen interest in speaking to our community, we invite you to submit topics for consideration: Submit Topics Venue: virtual, join from anywhere. Sponsors: We are actively seeking sponsors to support AI developers community. Whether it is by offering venue spaces, providing food, or cash sponsorship. Sponsors will not only have the chance to speak at the meetups, receive prominent recognition, but also gain exposure to our extensive membership base of 350K+ AI developers worldwide. Community on Slack/Discord
|
Google Generative AI Learning Month (Virtual) - Session 3
|