OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling’s Harry Potter series::A new research paper laid out ways in which AI developers should try and avoid showing LLMs have been trained on copyrighted material.

  • scarabic@lemmy.world
    link
    fedilink
    English
    arrow-up
    12
    ·
    2 年前

    First, we don’t have to make AI.

    Second, it’s not about it being unable to learn, it’s about the fact that they aren’t paying the people who are teaching it.

      • FatCrab
        link
        fedilink
        English
        arrow-up
        7
        ·
        2 年前

        The reasoning that claims training a generative model is infringing IP would still mean a robot going into a library with a card it has to optically read all the books there to create the same generative model would still be infringing IP.

      • AncientMariner@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        2 年前

        Humans can judge information make decisions on it and adapt it. AI mostly just looks at what is statistically what is most likely based on training data. If 1 piece of data exists, it will copy, not paraphrase. Example was from I think copilot where it just printed out the code and comments from an old game verbatim. I think Quake2. It isn’t intelligence, it is statistical copying.

      • assassin_aragorn@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        ·
        2 年前

        because it might hurt authors and musicians and artists and other creative workers

        FTFY. Corporations shouldn’t be making a fucking dime from any of these works without fairly paying the creators.