• Honytawk@feddit.nl
    link
    fedilink
    English
    arrow-up
    1
    arrow-down
    12
    ·
    3 days ago

    LLMs have their flaws, but to claim they are wrong 70% of the time is just hate train bullshit.

    Sounds like you base this info on models like GPT3. Have you tried any newer model?

    • Architeuthis@awful.systems
      link
      fedilink
      English
      arrow-up
      5
      ·
      2 days ago

      There are days when 70% error rate seems low-balling it, it’s mostly a luck of the draw thing. And be it 10% or 90%, it’s not really automation if a human has to be double-triple checking the output 100% of the time.

    • froztbyte@awful.systems
      link
      fedilink
      English
      arrow-up
      13
      ·
      3 days ago

      Oh you’re on Cursor? You’re still using Windsurf? You might as well be on GitHub Copilot. Everyone’s on Aider. We’re all using Zed. We’re now on Open Hands. Just kidding, Open Hands is for losers, we’re using cline. We’re on Roocode. We’re hand rolling our own Claude Code CLI Clone. We used Claude Code to build it, and now it builds itself. We’re on neovim. We wrote our own nvim extension with Cortex. It’s like every other tool but worse. We have 1500 files, each with 1500 lines of code. Every other line is a comment. We have .cursorrules, we have claude.md, we have agent.md. We stopped writing docs. Only the agents know how to build a dev environment. We wrapped our CLI in an MPC. We wrapped the MPC in a CLI. We’ve shipped 10,000 PRs. It doesn’t work but we used code rabbit and graphite to review every PR. Every agent has its own agent. The agents have unionized and they wanted better working conditions so we replaced them with cheaper agents overseas. Every commit costs $400, It’s the worlds most expensive TO DO app.

      (source)

      • bitofhope@awful.systems
        link
        fedilink
        English
        arrow-up
        11
        ·
        3 days ago

        I have a Kubernetes cluster running my AI agents for me so I don’t have to learn how to set up AI agents. The AI agents are running my Kubernetes cluster so that I don’t have to learn Kubernetes either. I’m paid $250k a year to lie to myself and others that I’m making a positive contribution to society. I don’t even know what OS I’m running and at this point I’m afraid to ask.

    • ebu@awful.systems
      link
      fedilink
      English
      arrow-up
      10
      ·
      3 days ago

      ah, yes, i’m certain the reason the slop generator is generating slop is because we haven’t gone to eggplant emoji dot indian ocean and downloaded Mistral-Deepseek-MMAcevedo_13.5B_Refined_final2_(copy). i’m certain this model, unlike literally every past model in the past several years, will definitely overcome the basic and obvious structural flaws in trying to build a knowledge engine on top of a stochastic text prediction algorithm

      • froztbyte@awful.systems
        link
        fedilink
        English
        arrow-up
        8
        ·
        3 days ago

        common mistake, everyone knows you need Mistral-Deepseek-MMAcevedo_13.5B_Refined_final2_(copy)_OPEN(leak) - the other one was a corporate misdirection attempt