In this video Ali Zaidi and Mofi Rahman from Google Cloud covers a reference architecture of a ML Platform consisting of 4 teams with resource sharing between team using Kueue.
This covers Kueue aspects like cluster kueue with quota and cohorts and Kubernetes concepts like priority classes and preemption.
To run the tutorial yourself: github.com/GoogleCloudPlatfor...
Intro To Kueue: • Intro to Kueue
Basic Job Patterns: • Basic Job Patterns on ...
Негізгі бет Ғылым және технология Architecture of a ML Platform with Resource Sharing on Kubernetes
Пікірлер