• sorter_plainview@lemmy.today
    link
    fedilink
    English
    arrow-up
    2
    ·
    6 天前

    RTK asks you to trade deterministic reliability, semantic completeness, and architecture simplicity for a flashy reduction in raw terminal tokens.

    Aahm… LLMs have deterministic reliability? Semantic completeness?

  • setsubyou@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    6 天前

    I have the same doubts… it’s very easy to count how much text it cuts out but it means nothing without benchmarks that show accuracy isn’t hurt.

    There’s certainly things that are just not needed. But even permissions and owner in ls has cases where it’s useful - if the agent needs to debug a permission issue and ls doesn’t work, it will figure out some other way, and that takes tokens. Hopefully you’re not debugging permissions often though.

    I also have no idea if the unchanged lines in a diff are noise or if they help the LLM. What if the context around the changed lines is really helpful? We can’t just decide that parts humans don’t need are useless. I mean there’s literally research showing that even repeating the prompt can improve accuracy.