Is it fairly easy? Seems useful for a public site like Lemmy and the fediverse
https://nightshade.cs.uchicago.edu/whatis.html
https://decrypt.co/203153/ai-prompt-data-poisoning-nightshared
I’ve said it many times, but the channels I speak through are small, so from the top!
If you put your artwork online in any public location, make sure your signature or even a QR code is obnoxiously large and centered on the image. Humans can still see and enjoy what you’ve made, AI won’t be able to discern anything, and if it happens to get ripped by one of those Chinese T-shirt bots, at least anyone who buys will know who the original artist is.
TIL that there exist people who aren’t bothered by obnoxious watermarks superimposed on an image. I find them aggravating, and I’m not the only one – That’s shutterstock’s entire business model.
AI is already making people’s lives worse. Let’s not make human art harder to enjoy in a fruitless effort to resist it. Instead, let’s solve the root of the problem.
Hey chatgpt or whatever ai model, recreate this image without the silly QR code.
For image tracking it’s enough to use Imgur for sharing, for any image, even own ones, no AI image needed. I miss the bot in Lemmy which redirects Videos to Piped, when Imgur is worst. Better alternatives, like File Coffee or Vgy.me, made in the EU are desirable.
I’m very skeptical that this “model poisoning” approach will work in practice. To pull it off would require a very high level of coordination among disparate people generating the training data (the images/text). I just can’t imagine it happening. Add to that: big tech has A LOT of resources to play this cat & mouse game.
I hope I’m wrong, but I predict big tech wins here.
At the moment, I’m just adding the license to my text, but if somebody has something I could copypaste and put into a spoiler to poison AI training, that’d be great.
Insert poison pill here
Nothing here yes!
One thing I was kinda wondering about - as long as there’s nothing in the T&Cs of your instance, don’t you implicitly hold the copyright to your comment? Isn’t the CC license actually more permissive? Or is it more about “that model was trained on content available under this license, to comply with it, they have to follow it’s terms”?
Or is it more about “that model was trained on content available under this license, to comply with it, they have to follow it’s terms”?
Close. Creative Commons is a copyleft license with restrictions. The important restriction in this case is not allowing commercial use.
No, because a method that works on one implementation almost certainly doesn’t work on another.
Understandable, quite the bummer
This doesn’t have anything to do with tracking. This is supposed to sabotage free and open image generators (ie stable diffusion). It’s unlikely to do anything, though.
Hard to say what the makers want to achieve with this. Even if it did work, it would help artists just as much, as better DRM would help programmers. On its face, this is just about enforcing some ultra-capitalist ideology that wants information to be owned.
I see it as trying to combat the dystopia where not only is our data scraped but now every single thing we write, draw or film is fed into an AI that will ultimately be used to create huge amounts of wealth for very few, essentially monetizing our very existence online in a way thats entierly unavoidable and without consent.
In addition its entierly one way, google and others can grab as much of our data as they want while most of us would have an extremely hard time even getting granted a freedom of information request about ourselves, let alone grabbing a similar amount of data about those same corporations.
that will ultimately be used to create huge amounts of wealth for very few,
But… That is what these poisoning attacks are fighting for. They are attacking open image generators that can be used by anyone. You can use them for fun or for business, without having to pay rent to some owner who is not lifting a finger. What do you think will happen if you knock that out?