• Franconian_Nomad@feddit.org
    link
    fedilink
    English
    arrow-up
    1
    ·
    19 days ago

    Interesting. I should have read the cloudflare article, not just linked it. Of course, anthropic does the bullshit it’s known for.

    But I heard several security researchers experimenting with own harnesses. Seems to make quite a difference.

    • midribbon_action@lemmy.blahaj.zone
      link
      fedilink
      English
      arrow-up
      1
      ·
      19 days ago

      My question is why these harnesses are even necessary. The cloudflare pipeline is not specific to any codebase, it is just secret sauce they added themselves that increases the costs dramatically. Cloudflare is not an AI company though, Anthropic is, and openai and anthropic have spent tens of millions on signing bonuses for all of the most competent AI researchers in the field.

      Why is it cloudflare’s job to make the model useful? Why doesn’t the model do what it says it will without multiplying the token burn rate 5-10x? Why not ship a harness developed by the ai experts, if a harness is truly necessary? The idea of adverserial machine learning is more than a decade old, it’s not like cloudflare stumbled on a new concept.

      I believe this is just another attempt to hide the true cost of inference.

      • Franconian_Nomad@feddit.org
        link
        fedilink
        English
        arrow-up
        1
        ·
        19 days ago

        From what I gather a different harness can make quite a difference. Seems like a model can work better or worse depending on the harness, that’s at least what I‘ve heard from the community.

        A harness for coding is probably different from a harness for agentic tasks like Hermes or opencode. … probably it also helps if you don’t vibe code your harness with little or no supervision. (Cough, Claude Code, cough)