Abacus.ai:

We recently released Smaug-72B-v0.1 which has taken first place on the Open LLM Leaderboard by HuggingFace. It is the first open-source model to have an average score more than 80.

  • glimse@lemmy.world
    link
    fedilink
    English
    arrow-up
    8
    ·
    2 年前

    Based on the other comments, it seems like this needs 4x as much ram than any consumer card has

    • FaceDeer@kbin.social
      link
      fedilink
      arrow-up
      4
      ·
      2 年前

      It hasn’t been quantized, then. I’ve run 70B models on my consumer graphics card at a reasonably good tokens-per-second rate.

    • DarkThoughts@fedia.io
      link
      fedilink
      arrow-up
      2
      ·
      2 年前

      I’m curious how local generation goes with potentially dedicated AI extensions using stuff like tensor cores and their own memory instead of hijacking parts of consumer GPUs for this.