Summary:

I downvoted pro-AI comments in a post in leftymemes community. It was LLM generated polandball comic (Which is objectively pathetic as fuck) that showed up on my feed, blocked couple of users who I thought were unhinged, and have blocked the whole instance on my client after realizing how rabid these morons are.

I didn’t go looking for AI posts like a vigilante.

One user in question got miffed for being downvoted and banned me from places they moderate.

  • Arthur Besse@lemmy.ml
    link
    fedilink
    English
    arrow-up
    9
    arrow-down
    1
    ·
    2 days ago

    They aren’t pro corpo Ai.

    They’re very much against the mass scraping/ddos ai companies are doing.

    All of the self-hostable LLMs and image generators (or at least, all of the ones capable of the quality people have come to expect for the last few years) people are using today are trained on massive scraped datasets far beyond the reach of hobbyists. There are many so-called “open source” models which are free to modify (eg, by fine-tuning) and to redistribute, but the data used for the initial training (which hobbyists are allowed to build upon) cannot be published because doing so would obviously be large-scale copyright infringement.

    Also, even with the data (which in many cases also needs to be labeled/annotated using human labor), the cost of training such a model from scratch is astronomical.

    As a pirate myself, I totally understand how, after reading that Meta’s training data included 82TB of pirated books they torrented, one’s first thought might be “🤤” … but to imagine that this makes Meta our ally in the fight against copyright is some temporarily-embarrassed-millionaire kind of thinking.