nitrolife

nitrolife@hikki.team · 8 hours ago

Without display? Ok. But wolf can stream xfce inside podman.

nitrolife@hikki.team · 17 hours ago

You need virtual dislay so that meat that you don’t connect video card to TV. Maybe best choice - use wolf? No DE, no display. Just GPU and podman.

nitrolife@hikki.team · edit-2 18 hours ago

Well, if you’re that sensitive, then fine. In fact, it’s just a classic fear of public speaking. If you’re uncomfortable expressing your opinion in front of a wide audience, why are you doing it? Not everyone likes you in the audience, yes, that’s right. There is no tragedy in this. And this is not a reason to ban everyone. Just don’t speak out publicly, gather an interest group and discuss some topics there. There are worse things in the world than voting against.

P.S. and you don’t have to stress so much that this is not a criticism of me. because it doesn’t matter. gather your will and just say what you want and let people do what they want with it. That’s how it works. There is a moderators for everything else.

UPD: I’m Russian in general, and I can catch downvotes just for breathing. not very often, but still. Well, that’s life.

nitrolife@hikki.team · 20 hours ago

Just relax, buddy. How many downvotes there? Dozens? Hundreds? There are 7 billion people on the planet. It’s impossible to please everyone. and that’s okay. in the end, they at least bothered to press the button. in this sense, indifference is much worse.

nitrolife@hikki.team · 21 hours ago

I don’t know about you, but I believe that the people who gave me a negative review can still create content that interests me. After all, if a person doesn’t agree with my opinion, it doesn’t make their comments and posts any worse for me. Otherwise, i can say something stupid that i would be ashamed to reread in 5 years and then block half of the platform.

Lemmi already has one of the softest voting systems. on Habr, for example, if your rating is negative, you can’t write comments at all.

nitrolife@hikki.team · edit-2 22 hours ago

Yes, it is. But I have llama-swap, openweb-ui. If you spend some time on the llama-swap configuration, then the you have a good chance to run the model on 2 cards is through llama.cpp. The winnings, however, will not be x2 of course and will fall non-linearly from the number of cards. And you need motherboard with good PCI-E lines (2 pci-e x16 or more). But it’s still cheaper than one large card. Example:

HIP_VISIBLE_DEVICES=0,1 \
/opt/llama.cpp/build/bin/llama-server \
  --host 127.0.0.1 \
  --port 8082 \
  --model /storage/models/model.gguf \
  --n-gpu-layers all \
  --split-mode layer \
  --tensor-split 1,1 \
  --ctx-size 32768 \
  --batch-size 512 \
  --ubatch-size 512 \
  --flash-attn on \
  --parallel 1

There is a less stable but more productive one: --split-mode row

P.S. By the way, one RX9070XT on my instance translates posts and comments. You can test it if you want. =)

nitrolife@hikki.team · 3 days ago

not a very popular opinion, иге if you want an inexpensive, really inexpensive variant, take the AMD MX9070XT. AMD is not the most popular AI cards, but they are not bad with ROCm and for the price of 5090 you can put 5 cards (80 GB vram)