Campfyre
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Lugh@futurology.todayM to Futurology@futurology.todayEnglish · 2 years ago

Two-faced AI language models learn to hide deception - ‘Sleeper agents’ seem benign during testing but behave differently once deployed. And methods to stop them aren’t working.

www.nature.com

external-link
message-square
9
fedilink
12
external-link

Two-faced AI language models learn to hide deception - ‘Sleeper agents’ seem benign during testing but behave differently once deployed. And methods to stop them aren’t working.

www.nature.com

Lugh@futurology.todayM to Futurology@futurology.todayEnglish · 2 years ago
message-square
9
fedilink
Two-faced AI language models learn to hide deception
www.nature.com
external-link
‘Sleeper agents’ seem benign during testing but behave differently once deployed. And methods to stop them aren’t working.
  • Possibly linux@lemmy.zip
    link
    fedilink
    English
    arrow-up
    1
    ·
    edit-2
    2 years ago

    Its too late for that honestly

    • mateomaui@reddthat.com
      link
      fedilink
      English
      arrow-up
      2
      ·
      2 years ago

      Alright, I’ll switch to digging holes for the family burial ground.

Futurology@futurology.today

futurology@futurology.today

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 165 users / day
  • 484 users / week
  • 1.57K users / month
  • 5.44K users / 6 months
  • 1 local subscriber
  • 3.07K subscribers
  • 1.96K Posts
  • 11.8K Comments
  • Modlog
  • mods:
  • voidx@futurology.today
  • Lugh@futurology.today
  • Espiritdescali@futurology.today
  • AwesomeLowlander@futurology.today
  • BE: 0.19.7
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org