For years I’ve had a dream of building a rack mounted PC capable of splitting its resources to host multiple GPU intensive VMs:
- a few gaming VMs
- a VM for work that can run Davinci Resolve and Blender renders
- an LLM server
- a Stable Diffusion server
- media server
Just to name a few possibilities…
Everytime I’ve looked into it, it seemed like the technology just wasn’t there yet. I remember a few years ago Linus TT took a shot at it, but in the end suggested the technology (for non-commercial entities) just wasn’t in a comfortable spot yet.
So how far off are we? Obviously AI focused companies seem to make it work, but what possibilities exist for us self-hosters who might also want to run multiple displays in addition to the web gui LLM servers? And without forking out crazy money for GPU virtualization software licenses?
GPU passthrough has been pretty good for a while. The reason why Linus couldn’t get it working reliably was because iirc, he tried to do it on windows… I’ve done it before with a single gpu and have very recently set it up again, now that I have a 2nd one and gotta say, it’s pretty damn good…
How are you handling displays and keyboard/mouse? Also what VM software?
Check out this video
It goes over all of the steps of setting it up.
Here is an alternative Piped link(s):
Check out this video
Piped is a privacy-respecting open-source alternative frontend to YouTube.
I’m open-source; check me out at GitHub.