GPU vs CPU: Running Small Language Models with Ollama & C#

In this video, we'll explore the performance differences when running Large Language Models (LLMs) in Ollama using both the CPU and GPU. Watch as I demonstrate a live sample in C# using Microsoft.Extensions.AI to run Ollama inside a Docker container. Curious to see how these models perform locally? Let's dive in and compare the results!
Useful links:
.NET Video Analyzer repository: aka.ms/netaivi...
Ollama in Docker: ollama.com/blo...
.NET & AI Show: • .NET AI Community Stan...

Жүктеу

Пікірлер: 6

@bdanuw
3 күн бұрын
Another great video from ElBruno :) Thank you Bruno.
@eugene5096
7 күн бұрын
Thank you Bruno, interesting as usual !!!
@elbruno
6 күн бұрын
@@eugene5096 Thanks! The CPU vs GPU is a wow one 😁
@bilalbilal7674
4 күн бұрын
Bilal here😊, I think you should go for creating an extension then it will be good and easy to accessible
@elbruno
4 күн бұрын
There is one in the Aspire Community Toolkit: github.com/CommunityToolkit/Aspire/tree/main I may record a video about that one! Best
@jimmymac601
Сағат бұрын
Does this support multiple GPUs ?

Local GraphRAG with LLaMa 3.1 - LangChain, Ollama & Neo4j

Microservices are Technical Debt

Помоги Nuggets Gegagedigedagedago удрать от бабульки Granny !

Flipping Robot vs Heavier And Heavier Objects

怎么能插队呢！#火影忍者 #佐助 #家庭

Friends make memories together part 2 | Trà Đặng #short #bestfriend #bff #tiktok

The cloud is over-engineered and overpriced (no music)

Why Agent Frameworks Will Fail (and what to use instead)

Quick Chat App with OpenAI .NET API Library

Reliable, fully local RAG agents with LLaMA3.2-3b

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

Become a bash scripting pro - full course

Real time RAG App using Llama 3.2 and Open Source Stack on CPU

Go (Golang) vs Node.js: Performance (Latency - Throughput - Saturation - Availability)

Retrieval-Augmented Generation (RAG) with .NET 8: A Full Local Resource Guide

Overview of Ollama C# Playground Repository.

Помоги Nuggets Gegagedigedagedago удрать от бабульки Granny !

GPU vs CPU: Running Small Language Models with Ollama & C#

Пікірлер: 6