ByteOnBikes@slrpnk.net to Microblog Memes@lemmy.worldEnglish · 1 year agoCritical thinkingslrpnk.netimagemessage-square214linkfedilinkarrow-up11.62Karrow-down134
arrow-up11.58Karrow-down1imageCritical thinkingslrpnk.netByteOnBikes@slrpnk.net to Microblog Memes@lemmy.worldEnglish · 1 year agomessage-square214linkfedilink
minus-squareTheTechnician27@lemmy.worldlinkfedilinkEnglisharrow-up21arrow-down3·1 year ago It’s a two-pass solution, but it makes it a lot more reliable. So your technique to “make it a lot more reliable” is to ask an LLM a question, then run the LLM’s answer through an equally unreliable LLM to “verify” the answer? We’re so doomed.
minus-squareApepollo11@lemmy.worldlinkfedilinkEnglisharrow-up3arrow-down8·1 year agoGive it a try. The key is in the different prompts. I don’t think I should really have to explain this, but different prompts produce different results. Ask it to create something, it creates something. Ask it to check something, it checks something. Is it flawless? No. But it’s pretty reliable. It’s literally free to try it now, using ChatGPT.
minus-squareTheTechnician27@lemmy.worldlinkfedilinkEnglisharrow-up11arrow-down1·1 year ago I don’t think I should really have to explain this, but different prompts produce different results.
minus-squareApepollo11@lemmy.worldlinkfedilinkEnglisharrow-up2·1 year agoHey, maybe you do. But I’m not arguing anything contentious here. Everything I’ve said is easily testable and verifiable.
So your technique to “make it a lot more reliable” is to ask an LLM a question, then run the LLM’s answer through an equally unreliable LLM to “verify” the answer?
We’re so doomed.
Give it a try.
The key is in the different prompts. I don’t think I should really have to explain this, but different prompts produce different results.
Ask it to create something, it creates something.
Ask it to check something, it checks something.
Is it flawless? No. But it’s pretty reliable.
It’s literally free to try it now, using ChatGPT.
Hey, maybe you do.
But I’m not arguing anything contentious here. Everything I’ve said is easily testable and verifiable.