D•Scribe
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
☆ Yσɠƚԋσʂ ☆@lemmygrad.ml to technology@hexbear.netEnglish · 2 months ago

Granite 4.1: IBM's 8B Model Is Competing With Models Four Times Its Size - Firethering

firethering.com

external-link
message-square
0
link
fedilink
  • cross-posted to:
  • technology@lemmygrad.ml
  • hackernews@lemmy.bestiver.se
13
external-link

Granite 4.1: IBM's 8B Model Is Competing With Models Four Times Its Size - Firethering

firethering.com

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml to technology@hexbear.netEnglish · 2 months ago
message-square
0
link
fedilink
  • cross-posted to:
  • technology@lemmygrad.ml
  • hackernews@lemmy.bestiver.se
IBM just released Granite 4.1, a family of open source language models built specifically for enterprise use. Three sizes, Apache 2.0 licensed and trained on 15 trillion tokens with a level of pipeline obsession that's worth understanding. But there's one result in the benchmarks I keep coming back to. The 8B model. Dense architecture, no MoE tricks, no extended reasoning chains. It matches or beats Granite 4.0-H-Small across basically every benchmark they ran. That older model has 32B parameters with 9B active. This one has 8 billion. Full stop. That result is either very impressive or it means the old model was underbuilt. Probably both. Here's how they built it, what the numbers actually say, and whether any of it matters for your use case.
alert-triangle
You must log in or # to comment.

technology@hexbear.net

technology@hexbear.net

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@hexbear.net

On the road to fully automated luxury gay space communism.

Spreading Linux propaganda since 2020

  • Ways to run Microsoft/Adobe and more on Linux
  • The Ultimate FOSS Guide For Android
  • Great libre software on Windows
  • Hey you, the lib still using Chrome. Read this post!

Rules:

  • 1. Obviously abide by the sitewide code of conduct. Bigotry will be met with an immediate ban
  • 2. This community is about technology. Offtopic is permitted as long as it is kept in the comment sections
  • 3. Although this is not /c/libre, FOSS related posting is tolerated, and even welcome in the case of effort posts
  • 4. We believe technology should be liberating. As such, avoid promoting proprietary and/or bourgeois technology
  • 5. Explanatory posts to correct the potential mistakes a comrade made in a post of their own are allowed, as long as they remain respectful
  • 6. No crypto (Bitcoin, NFT, etc.) speculation, unless it is purely informative and not too cringe
  • 7. Absolutely no tech bro shit. If you have a good opinion of Silicon Valley billionaires please manifest yourself so we can ban you.
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 133 users / day
  • 816 users / week
  • 1.33K users / month
  • 2.74K users / 6 months
  • 5 local subscribers
  • 24.4K subscribers
  • 3.04K Posts
  • 19.4K Comments
  • Modlog
  • mods:
  • context [fae/faer, fae/faer]@hexbear.net
  • SexUnderSocialism [she/her]@hexbear.net
  • gaycomputeruser [she/her]@hexbear.net
  • Wakmrow [he/him]@hexbear.net
  • SwitchyandWitchy [she/her]@hexbear.net
  • JustSo [she/her, any]@hexbear.net
  • UI: unknown version
  • BE: 0.19.18
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org