Lugh@futurology.todayM to

Futurology@futurology.todayEnglish · 2 years ago

Two-faced AI language models learn to hide deception - ‘Sleeper agents’ seem benign during testing but behave differently once deployed. And methods to stop them aren’t working.

9

12

Two-faced AI language models learn to hide deception - ‘Sleeper agents’ seem benign during testing but behave differently once deployed. And methods to stop them aren’t working.

Lugh@futurology.todayM to

Futurology@futurology.todayEnglish · 2 years ago

9

Two-faced AI language models learn to hide deception

‘Sleeper agents’ seem benign during testing but behave differently once deployed. And methods to stop them aren’t working.

Chat

Possibly linux@lemmy.zip
link
fedilink
English
arrow-up
1·
2 years ago
Sorry, to late for that
- mateomaui@reddthat.com
  link
  fedilink
  English
  arrow-up
  2·
  2 years ago
  Alright, I’ll be out back digging the bomb shelter.
  - Possibly linux@lemmy.zip
    link
    fedilink
    English
    arrow-up
    1·
    edit-2
    2 years ago
    Its too late for that honestly
    - mateomaui@reddthat.com
      link
      fedilink
      English
      arrow-up
      2·
      2 years ago
      Alright, I’ll switch to digging holes for the family burial ground.