Granite 4.1: IBM's 8B Model Is Competing With Models Four Times Its Size - Firethering

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 25 days ago

Granite 4.1: IBM's 8B Model Is Competing With Models Four Times Its Size - Firethering

PeeOnYou [he/him]@lemmygrad.ml · 25 days ago

i saw a comparison of the 8b model vs the dense 30b (iirc) dense model and it was almost the same… the 30b was slightly better on most tests but only barely

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 24 days ago

It’s honestly incredible to see because 8b is getting to the point where it will run well on a lot of consumer hardware. If we can get current frontier performance at that size, then you really would be able to solve most tasks locally.

CriticalResist8@lemmygrad.ml · 24 days ago

The 4-bit quantized GGUF for granite 4.1 is sub 5GB, so it’s probably going to run on any modern machine even if it’s not particularly built for Vram… 6 gigs is what I had on my old 1080 gpu.

https://huggingface.co/unsloth/granite-4.1-8b-GGUF/tree/main

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 24 days ago

🎉