• Meron35@lemmy.world
    link
    fedilink
    arrow-up
    3
    ·
    21 hours ago

    The system prompt guardrail is so jank that people run competitions and games to to beat them every time a new LLM comes out. Usually you see people beating guardrails hours within release.

    Other keywords to search include prompt injection.

    Gandalf | Lakera – Test your AI hacking skills - https://gandalf.lakera.ai/adventure-8