Gemini 1.5 Pro has a massive context window

Рет қаралды 2,139

Google DeepMind recently released a report on Gemini 1.5 Pro.
Perhaps the most significant advance over Gemini 1.0 is the context window, which can now span up to 10 million tokens of text.
Timestamps:
00:00 - Gemini 1.5 Pro has a massive context window
00:48 - Needle-in-haystack analysis
03:40 - Model architecture (sparse mixture-of-expert Transformer)
05:58 - Training infrastructure
06:53 - Long-context evaluation (Jax codebase, Les Miserables etc.)
09:15 - Core Capability Evaluations (MMLU, HumanEval etc.)
10:08 - Responsible Deployment
The paper can be found here: storage.googleapis.com/deepmi...
Topics: #gemini #llm #google
For related content:
- Twitter: / samuelalbanie
- personal webpage: samuelalbanie.com/

Жүктеу

Пікірлер: 6

@artin_kim
3 ай бұрын
Really great video. Love the easy digestible format. Keep up the good work!
@ekstrajohn
3 ай бұрын
This is why I never got into RAG or LangChain. It's possible we will get this context lengths in open source next year, so what's the point? The only catch is you need a loooot of VRAM for it.
@420_gunna
3 ай бұрын
50 cents per API call with Gemeni 1.5 Pro vs fractions of fractions of a penny for a RAG'd-out lil guy (and they can perform equally well for certain tasks, if the IR setup is goated) But yeah, I don't know how long that price difference will last.
@sheen6175
3 ай бұрын
Great video like always
@Dart_ilder
3 ай бұрын
So.. WHY does it have such a long context? I was 100% expecting smth more like Mamba-MoE. But they just say that it is "transformer-based"