Skip Navigation

lily33

@ lily33 @lemm.ee

Posts

0
Comments

112
Joined

3 yr. ago

11mo ago

Deleted

Permanently Deleted
Jump
1

lily33 @lemm.ee 11mo ago
I would love to think so. But the word "verified" suggests more.

11mo ago

isInHell = 'true'

Jump

lily33 @lemm.ee 11mo ago

That makes me think, perhaps, you might be able to set it to exec("stuff") or True...

11mo ago

Permanently Deleted

Jump

lily33 @lemm.ee 11mo ago

while allowing legitimate users and verified crawlers to browse normally.

What is a "verified crawler" though? What I worry about is, is it only big companies like Google that are allowed to have them now?

11mo ago

China will enforce clear flagging of all AI generated content starting from September

Jump

lily33 @lemm.ee 11mo ago

I agree that it's difficult to enforce such a requirement on individuals. That said, I don't agree that nobody cares for the content they post. If they have "something cool they made with AI generation" - then it's not a big deal to have to mark it as AI-generated.

12mo ago

Zuckerberg’s Meta considered sharing [CN/HK] user data with China, whistleblower alleges

Jump

lily33 @lemm.ee 12mo ago

Is that something new? As in, has WaPo not been willing to go after Meta in a similar manner before?

12mo ago

Zuckerberg’s Meta considered sharing [CN/HK] user data with China, whistleblower alleges

Jump

lily33 @lemm.ee 12mo ago

So, essentially, they wanted to enter the Chinese market so much that they were even willing to comply with the local rules and regulations!

This is such a big secret, we really needed a whistleblower to tell us that!

12mo ago

[Report] OpenAI says it found evidence of an AI-powered surveillance tool used by a Chinese security operation to identify anti-China posts on social media in the West.

Jump

lily33 @lemm.ee 12mo ago

An intelligence service monitors social media. They may as well have said, "The sky is blue."

More interesting is,

Sharing as a force multiplier
-- OpenAI

12mo ago

DeepSeek found to be sharing user data with TikTok parent company ByteDance

Jump

lily33 @lemm.ee 12mo ago

Do you know of a provider is actually private? The few privacy policies I checked all had something like "We might keep some of your data for some time for anti-abuse or other reasons"...

1y ago

I want a programming language that supports German style composite words

Jump

lily33 @lemm.ee 1y ago

Too bad that's based on macros. A full preprocessor could require that all keywords and names in each scope form a prefix code, and then allow us to freely concatenate them.

1y ago

Russia asks Elon Musk to hand over names of dissidents: MPs in Moscow want a list of Russians funded by USAid, the American aid scheme shut down by Donald Trump, to be given to the security services

Jump

lily33 @lemm.ee 1y ago

Aren't USAid grants public?

1y ago

Why don't social media and the web in general implement asymmetric key signing to always be able to verify the source of a content, thus eliminating the risk of deep fakes and fake news?

Jump

lily33 @lemm.ee 1y ago

No, that's because social media is mostly used for informal communication, not scientific discourse.

I guarantee you that I would not use lemmy any differently if posts were authenticated with private keys than I do now when posts are authenticated by the user instance. And I'm sure most people are the same.

Edit: Also, people can already authenticate the source, by posting a direct link there. Signing wouldn't really add that much to that.

1y ago

Why don't social media and the web in general implement asymmetric key signing to always be able to verify the source of a content, thus eliminating the risk of deep fakes and fake news?

Jump

lily33 @lemm.ee 1y ago

Sure, but that has little to do with disinformation. Misleading/wrong posts don't usually spoof the origin - they post the wrong information in their own name. They might lie about the origin of their "information", sure - but that's not spoofing.

1y ago

Why don't social media and the web in general implement asymmetric key signing to always be able to verify the source of a content, thus eliminating the risk of deep fakes and fake news?

Jump

lily33 @lemm.ee 1y ago

I don't understand how this will help deep fake and fake news.

Like, if this post was signed, you would know for sure it was indeed posted by @lily33@lemm.ee, and not by a malicious lemm.ee admin or hacker*. But the signature can't really guarantee the truthfulness of the content. I could make a signed post that claiming that the Earth is flat - or a deep fake video of NASA'a administrator admitting so.

Maybe I'm missing your point?

(*) unless the hacker hacked me directly

1y ago

Redshift isn't maintained anymore. what to use?

Jump

lily33 @lemm.ee 1y ago

It works fine for me on Hyprland.

1y ago

unsafeCode

Jump

lily33 @lemm.ee 1y ago

That is why I use just int main(){...} without arguments instead.

1y ago

How do I poison pdfs against LLM ?

Jump

lily33 @lemm.ee 1y ago

I don't think any kind of "poisoning" actually works. It's well known by now that data quality is more important than data quantity, so nobody just feeds training data in indiscriminately. At best it would hamper some FOSS AI researchers that don't have the resources to curate a dataset.

1y ago

Proton's biased article on Deepseek

Jump

lily33 @lemm.ee 1y ago

What makes these consumer-oriented models different is that that rather than being trained on raw data, they are trained on synthetic data from pre-existing models. That’s what the “Qwen” or “Llama” parts mean in the name. The 7B model is trained on synthetic data produced by Qwen, so it is effectively a compressed version of Qen. However, neither Qwen nor Llama can “reason,” they do not have an internal monologue.

You got that backwards. They're other models - qwen or llama - fine-tuned on synthetic data generated by Deepseek-R1. Specifically, reasoning data, so that they can learn some of its reasoning ability.

But the base model - and so the base capability there - is that of the corresponding qwen or llama model. Calling them "Deepseek-R1-something" doesn't change what they fundamentally are, it's just marketing.

1y ago

Proton's biased article on Deepseek

Jump

lily33 @lemm.ee 1y ago

There are already other providers like Deepinfra offering DeepSeek. So while the the average person (like me) couldn't run it themselves, they do have alternative options.

1y ago

Proton's biased article on Deepseek

Jump

lily33 @lemm.ee 1y ago

A server grade CPU with a lot of RAM and memory bandwidth would work reasonable well, and cost "only" ~$10k rather than 100k+...