Computer Vision Study Group Session on BLIP-2

In this session of Computer Vision Study Group, Johannes walks us through the paper BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models.

Жүктеу

Пікірлер: 16

@mrtago8973
11 ай бұрын
Awesome video and great explanation! Keep it up!
@objectijustmusic1476
Жыл бұрын
Good explaination! Answered some of the questions I had while reading the paper
@wei-lunchiu2366
9 ай бұрын
best BLIP2 explanation on the youtube!
@kemalariboga
Жыл бұрын
Thank you for your efforts! It's a great video. Also, the stories at the beginning are always my favorite, lol.
@johanneskolbe6891
Жыл бұрын
Thank you, I have to admit the stories are also what I have most fun with when creating the presentations ;)
@juancamachomohedano6118
Жыл бұрын
Great intro with the story! Nice and easy presentation. Thank you!
@johanneskolbe6891
Жыл бұрын
Thank you :)
@josephtsangko3558
19 күн бұрын
Great job!
@gangavaramkrishnappanagate2204
7 ай бұрын
very good explanation , Thank you
@IntuitiveAndExhaustive
10 ай бұрын
I enjoyed your story!
@pranav_tushar_sg
6 ай бұрын
Thanks!
@thiagarajamuralidaran1371
3 ай бұрын
Thanks
@bilalbayrakdar7100
2 ай бұрын
not the most technical group session but thanks for your effort.
@nacelle_strike
8 ай бұрын
18:51에 나오는 그림은 query가 text token과 연결되면 안될것이다.
@ruiqiu4423
6 ай бұрын
I think you are right, query token cannot attend text token.
@kafkatamurra
2 ай бұрын
kzitem.info/news/bejne/zGZ6p6iQfHWdZKw Based on the paper in this step, query tokens only attend to each other, whereas text tokens attend to all query tokens and the previous text tokens.

Computer Vision Study Group Session on SAM

Beyond Text - Giving Stable Diffusion New Abilities

Scary Teacher 3D Nick Troll Squid Game in Brush Teeth White or Black Challenge #shorts

Жайдарман | Туған күн 2024 | Алматы

아이스크림으로 체감되는 요즘 물가

Does size matter? BEACH EDITION

BLIP2: BLIP with frozen image encoders and LLMs

Low-rank Adaption of Large Language Models: Explaining the Key Concepts Behind LoRA

Collective Intelligence & Creative AI

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

How to Train a Model with Pytorch

CV Study Group: Masked Autoencoders Paper Walkthrough

DINO: Emerging Properties in Self-Supervised Vision Transformers (Facebook AI Research Explained)

Fine-tune Multi-modal LLaVA Vision and Language Models

Fine Tuning LLaVA

OpenAI CLIP: ConnectingText and Images (Paper Explained)

Easy Art with AR Drawing App - Step by step for Beginners

Неожиданная концовка. $5000 или что-то из apple магазина? #shorts #опрос #сигма #телефон #rec #fyp

Todos os modelos de smartphone

MacBook Air M1 vs Snapdragon X Elite - удивительные результаты!

Сложный РЕМОНТ ТОПОВОГО Samsung Galaxy S22 ULTRA SM-S908E после залития / НЕ ЛОВИТ СЕТИ

Ура!наконец-то куплю новый Samsung!#хочуврек#котики#футажи#shorts Автор звука:@GORA9338

Klavye İle Trafik Işığını Yönetmek #shorts

Computer Vision Study Group Session on BLIP-2

Пікірлер: 16