In this session of Computer Vision Study Group, Johannes walks us through the paper BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models.
- Жыл бұрын
Computer Vision Study Group Session on BLIP-2
- Рет қаралды 9,295
Пікірлер: 16