Knowledge Distillation in Deep Learning

In this video, i try to explain how distilBERT model was trained to create a smaller faster version of the famous BERT model using knowledge distillation technique.
Previous Video on the Basics of Knowledge Distillation : • Knowledge Distillation...
Cross Entropy Loss : • Why do we need Cross E...

Жүктеу

Пікірлер: 8

@ilhamafounnas8279
2 жыл бұрын
Hey! a very good explanation, waiting for more videos on KD
@dingusagar
2 жыл бұрын
Thanks 😊. Yes, more videos coming soon.
@Gokulhraj
Жыл бұрын
Can you please make a video on what happens during fine tuning of bert? Like whether it will freeze the layers of core engine and train only the head and how weight updation takes place etc for question answering task.
@dhirajkumarsahu999
Жыл бұрын
Thank you Dingu
@sadikaljarif9635
Жыл бұрын
very good
@sadikaljarif9635
Жыл бұрын
could u please share this slide ???
@dingusagar
Жыл бұрын
docs.google.com/presentation/d/1IkPeSGOcUSO_qyCwtrP9ZBMx-l2aBzj7FDqwPLK9Ekk/edit?usp=drivesdk
@sadikaljarif9635
Жыл бұрын
@@dingusagar thank you so much

Knowledge Distillation in Deep Learning - Basics

Quantization vs Pruning vs Distillation: Optimizing NNs for Inference

OMG😳 #tiktok #shorts #potapova_blog

с моей любимкой❤️#Танцы #Music #ВРеках #рек #глобальныерекомендации #милана #врек #Филимонова

Osman Kalyoncu Sonu Üzücü Saddest Videos Dream Engine 170 #shorts

1 or 2?🐄

XLNet: Generalized Autoregressive Pretraining for Language Understanding

EfficientML.ai Lecture 9 - Knowledge Distillation (MIT 6.5940, Fall 2023)

Better not Bigger: Distilling LLMs into Specialized Models

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

Distilling the Knowledge in a Neural Network

Knowledge Distillation: A Good Teacher is Patient and Consistent

Knowledge Distillation Explained with Keras Example | #MLConcepts

What are Transformer Models and how do they work?

Lecture 10 - Knowledge Distillation | MIT 6.S965

BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token

OMG😳 #tiktok #shorts #potapova_blog

Knowledge Distillation in Deep Learning - DistilBERT Explained

Пікірлер: 8