Lemmy.one
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
3hax6ejo@lemm.ee to AI@lemmy.ml · 2 years ago

PoisonGPT: How we hid a lobotomized LLM on Hugging Face to spread fake news

blog.mithrilsecurity.io

external-link
message-square
0
fedilink
  • cross-posted to:
  • nev@lemmy.intai.tech
  • auai@programming.dev
  • hackernews@derp.foo
  • robotics_and_ai@mander.xyz
  • ai_infosec@infosec.pub
  • technews@radiation.party
18
external-link

PoisonGPT: How we hid a lobotomized LLM on Hugging Face to spread fake news

blog.mithrilsecurity.io

3hax6ejo@lemm.ee to AI@lemmy.ml · 2 years ago
message-square
0
fedilink
  • cross-posted to:
  • nev@lemmy.intai.tech
  • auai@programming.dev
  • hackernews@derp.foo
  • robotics_and_ai@mander.xyz
  • ai_infosec@infosec.pub
  • technews@radiation.party
We will show in this article how one can surgically modify an open-source model, GPT-J-6B, and upload it to Hugging Face to make it spread misinformation while being undetected by standard benchmarks.
alert-triangle
You must log in or # to comment.

AI@lemmy.ml

artificial_intel@lemmy.ml

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !artificial_intel@lemmy.ml

Artificial intelligence (AI) is intelligence demonstrated by machines, unlike the natural intelligence displayed by humans and animals, which involves consciousness and emotionality. The distinction between the former and the latter categories is often revealed by the acronym chosen.

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 18 users / day
  • 43 users / week
  • 742 users / month
  • 1.55K users / 6 months
  • 100 local subscribers
  • 5.16K subscribers
  • 562 Posts
  • 1.8K Comments
  • Modlog
  • mods:
  • BE: 0.19.7
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org