ByteOnBikes@slrpnk.net to Microblog Memes@lemmy.worldEnglish · 3 个月前Critical thinkingslrpnk.netimagemessage-square215linkfedilinkarrow-up11.61Karrow-down133
arrow-up11.58Karrow-down1imageCritical thinkingslrpnk.netByteOnBikes@slrpnk.net to Microblog Memes@lemmy.worldEnglish · 3 个月前message-square215linkfedilink
minus-squareTheTechnician27@lemmy.worldlinkfedilinkEnglisharrow-up21arrow-down3·3 个月前 It’s a two-pass solution, but it makes it a lot more reliable. So your technique to “make it a lot more reliable” is to ask an LLM a question, then run the LLM’s answer through an equally unreliable LLM to “verify” the answer? We’re so doomed.
minus-squareApepollo11@lemmy.worldlinkfedilinkEnglisharrow-up3arrow-down8·edit-23 个月前Give it a try. The key is in the different prompts. I don’t think I should really have to explain this, but different prompts produce different results. Ask it to create something, it creates something. Ask it to check something, it checks something. Is it flawless? No. But it’s pretty reliable. It’s literally free to try it now, using ChatGPT.
minus-squareTheTechnician27@lemmy.worldlinkfedilinkEnglisharrow-up11arrow-down1·3 个月前 I don’t think I should really have to explain this, but different prompts produce different results.
minus-squareApepollo11@lemmy.worldlinkfedilinkEnglisharrow-up2·3 个月前Hey, maybe you do. But I’m not arguing anything contentious here. Everything I’ve said is easily testable and verifiable.
So your technique to “make it a lot more reliable” is to ask an LLM a question, then run the LLM’s answer through an equally unreliable LLM to “verify” the answer?
We’re so doomed.
Give it a try.
The key is in the different prompts. I don’t think I should really have to explain this, but different prompts produce different results.
Ask it to create something, it creates something.
Ask it to check something, it checks something.
Is it flawless? No. But it’s pretty reliable.
It’s literally free to try it now, using ChatGPT.
Hey, maybe you do.
But I’m not arguing anything contentious here. Everything I’ve said is easily testable and verifiable.