• tubbadu@lemmy.kde.social
    link
    fedilink
    English
    arrow-up
    1
    ·
    10 months ago

    Seems interesting! Do I need high end hardware or can I run them on my old laptop that I use as home server?

    • Falcon@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      10 months ago

      Oh no you need a 3060 at least :(

      Requires cuda. They’re essentially large mathematical equations that solve the probability of the next word.

      The equations are derived by trying different combinations of values until one works well. (This is the learning in machine learning). The trick is changing the numbers in a way that gets better each time (see e.g. gradient descent)