Deploy (Tiny) LLM to Production: Merge Lora Adapter, Push to HF Hub, Rest API with FastAPI & Docker

Full text tutorial with code (requires MLExpert Pro): www.mlexpert.io/bootcamp/depl...
You have a fine-tuned model (with LoRA adapter) to deploy as a REST API? In this video, we'll merge a LoRA adapter with a base model and upload it (with a tokenizer) to HuggingFace Hub. We'll build a REST API with FastAPI and deploy it as a Docker container.
Model on HuggingFace Hub: huggingface.co/curiousily/tin...
HuggingFace Space: huggingface.co/spaces/curious...
API Docs: curiousily-tiny-crypto-sentim...
AI Bootcamp (in preview): www.mlexpert.io/membership
Discord: / discord
Subscribe: bit.ly/venelin-subscribe
GitHub repository: github.com/curiousily/Get-Thi...
00:00 - Intro
00:36 - Text tutorial on MLExpert
00:56 - Merge LoRA adapter with base model
05:01 - Push the model to HuggingFace Hub
06:46 - Test the model from the HuggingFace Hub
09:45 - Rest API with FastAPI
14:36 - Dockerfile
15:45 - Create a HuggingFace Space for Docker
17:33 - Test the deployed Rest API
19:05 - Conclusion
Join this channel to get access to the perks and support my work:
/ @venelin_valkov
#artificialintelligence #sentimentanalysis #llm #docker #llama2 #chatgpt #gpt4 #python #chatbot

Жүктеу

Using docker in unusual ways

Fine-tuning Tiny LLM on Your Data | Sentiment Analysis with TinyLlama and LoRA on a Single GPU

Кәріс өшін алды...| Synyptas 3 | 10 серия

Шокирующая Речь Выпускника 😳📽️@CarrolltonTexas

Increíble final 😱

#JasonDeruloTV // Amazing 🤩 #GotPermissionToPost From @dilya.abdusalimova #SlowLow

Talking Tech and AI with Tim Cook!

Why You NEED To Learn FastAPI | Hands On Project

Advanced RAG with Self-Correction | LangGraph | No Hallucination | Agents | LangChain | GROQ | AI

4 Tips for Building a Production-Ready FastAPI Backend

RAG in Production - LangChain & FastAPI

I wish every AI Engineer could watch this.

Rest API - Best Practices - Design

How to run Ollama on Docker

Top 8 Docker Best Practices for using Docker in Production

Что такое REST API? HTTP, Клиент-Сервер, Проектирование, Разработка, Документация, Swagger и OpenApi

Кәріс өшін алды...| Synyptas 3 | 10 серия

Deploy (Tiny) LLM to Production: Merge Lora Adapter, Push to HF Hub, Rest API with FastAPI & Docker

Пікірлер: 2