Phi 2: Small Language Model Better Than 7B LLMs? | Google Colab Tutorial with Python

Do you need 7B+ parameters to get great performance from your Language Models? Discover how Microsoft Research's Phi-2, a 2.7 billion-parameter language model, challenges this norm by outperforming models up to 25x its size (according to Microsoft Research). We'll delve into the training methods behind Phi-2, from 'textbook-quality' training data to scaled knowledge transfer techniques. We'll load the model into a Google Colab and try it out in coding, math, reasoning, and data extraction.
Blog Post: www.microsoft.com/en-us/resea...
Phi-2 on HF Hub: huggingface.co/microsoft/phi-2
AlpacaEval: huggingface.co/microsoft/phi-...
AI Bootcamp (preview drops on Christmas): www.mlexpert.io/membership
Discord: / discord
Subscribe: bit.ly/venelin-subscribe
GitHub repository: github.com/curiousily/Get-Thi...
Join this channel to get access to the perks and support my work:
/ @venelin_valkov
00:00 - Intro
00:32 - AI Bootcamp on MLExpert.io
01:25 - What is Phi-2?
07:25 - Phi-2 vs Mistral vs Llama 2
08:42 - Phi-2 on HuggingFace Hub
10:45 - Google Colab Setup
13:35 - Prompt Format
14:25 - Text Generation
20:17 - Math
21:15 - Coding
24:30 - Text Analysis
26:46 - Conclusion
#artificialintelligence #chatgpt #gpt4 #python #chatbot #llama2 #llm

Жүктеу

Пікірлер: 7

@venelin_valkov
5 ай бұрын
🎄 Happy holidays, everyone! 🚀 Kickstart 2024 by transforming into an AI Engineer! Join the AI Bootcamp (preview drops on Christmas) 👉 Sign up now: www.mlexpert.io/membership
@sivajanumm
5 ай бұрын
i'm waiting bro
@user-lk9dt6jr6r
3 ай бұрын
Can you please also give a tutorial on fine tuning a multi-modal for grounding task?
@rahulrajpvr7d
5 ай бұрын
Hi bro , i wnat to finetune based on a set of book. for writing another book in that topic . how to do this?
@madhurjindal1364
5 ай бұрын
You are using a base model which is essentially just completing the input prompt and not trained to answer questions using RLHF - I suspect that is the issue here
@DarkSlear
5 ай бұрын
Звучишь как русский, а русского звука или текста нет =(
@user-xu5bk5cc7i
4 ай бұрын
так он болгарин

RAG from the Ground Up with Python and Ollama

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

it takes two to tango 💃🏻🕺🏻

FOOTBALL WITH PLAY BUTTONS ▶️ #roadto100m

Cute Barbie Gadget 🥰 #gadgets

Милана Хаметова Отгадывает Блогеров По Детским Фото #янгер #shorts

How to Use Conversation AI in the Mobile App #gohighlevel #digitalmarketing #ai #mobileapp

Function Calling with Local Models & LangChain - Ollama, Llama3 & Phi-3

What is LangChain? 101 Beginner's Guide Explained with Animations

The Best Tiny LLMs

Fine Tuning Mistral v3.0 With Custom Data

The Attention Mechanism in Large Language Models

What is Prompt Tuning?

Neil deGrasse Tyson: "James Webb Telescope Just Detected 900 Trillion Stars DISAPPEARING!"

The U-Net (actually) explained in 10 minutes

Advanced RAG with Llama 3 in Langchain | Chat with PDF using Free Embeddings, Reranker & LlamaParse

it takes two to tango 💃🏻🕺🏻

Phi 2: Small Language Model Better Than 7B LLMs? | Google Colab Tutorial with Python

Пікірлер: 7