Assian_Candor [comrade/them]

Organic guy. Silicon Valley hater.

  • 95 Posts
  • 2.31K Comments
Joined 4 years ago
cake
Cake day: November 26th, 2022

help-circle
















  • Yeah probably a better pi setup would be to create more granular agent types. I tried creating a worker powered by deepseek but found it wasn’t really that good. Something like an agent team spec (use a technical product manager agent to coordinate modules, review the worker output, and to make necessary corrections) then you could use a frontier model to plan and issue work instructions to the TPM agent

    Not sure if this would be more economical in practice but would be interesting to try

    Long run I think you will always need the data centers to handle big training loads but we might go back to on-prem computing for enterprises to run frontier models. Or even edge nodes that have enough muscle that folks can subscribe to locally.

    Something in your city you can sub to for $30 a month that lets you run any flagship open source model could be very compelling


  • I actually prefer gpt 5.5 to opus 4.7 for coding (haven’t tried 4.8 yet because what am I made of money?) and it is way way more token efficient.

    When Claude says it’s “bloviating” it really means it

    A big part of the problem is the harness which you allude to. Claude locks you into Claude code which is bloated and absolute ass at context management.

    Once the Chinese models reach parity with the current generation models it’ll be a race to the bottom. Deepseek v-4 pro is right there but not quite. The models now are strong enough to be generalist problem solvers. Anything stronger will only benefit niche applications.

    I’d like to see something like the gpt chat interface within a coding harness where the model is capable of selecting what to delegate to based on the task. This is where we will go in the future. A lot of enterprises incinerate tokens with Claude because people are using opus to write emails or whatever. It’s like taking a Ferrari to the grocery store.

    These will become commodities though I think at which point it’s all about the integrations. You can already see the big providers pivoting into these value added services with GPT leaning into the consumer market with apps and Claude going heavy on business/enterprise