A talk by Ekaterina Sirazitdinova from NVIDIA.
This session covers Multimodal generative AI has recently seen significant advancements, enabling the creation of realistic images, videos, and audio from textual or other inputs. However, due to the complexity of these models, understanding how they function and how to apply them in practical settings can be challenging. During this talk, Ekaterina will shed light on the inner workings of multimodal generative AI models by discussing key concepts and techniques used in their development. She will also explore various applications and use cases of this technology. The talk is intended for anyone interested in the current state of AI and its potential to produce realistic and immersive multimedia experiences.
Technical Level: Technical practitioner
This session was part of the Data Science Festival MayDay event 2024. Find out more at datasciencefes...
The Data Science Festival is the place for data-driven people to come together, share cutting-edge ideas, and solve real-world problems. We run monthly events, meet-ups, and the biggest free-to-attend data festivals in the UK. Join the community at datasciencefes...
Негізгі бет Multimodal Generative AI Demystified - Data Science Festival
Пікірлер