Edit to add: I also found someone who recorded a voice chat of the same thing. This isn’t that someone uploaded a song, or that AI didn’t actually process the file. These models really are this sycophantic:

https://m.youtube.com/shorts/JqvDLHshTtI

  • Pennomi@lemmy.world
    link
    fedilink
    English
    arrow-up
    116
    ·
    2 days ago

    RLHF was a fundamental mistake. Human feedback almost always trains an AI to be sycophantic because humans in general are super easy to flatter.

    We are building the perfect addiction machine, far more powerful than social media is, and it actively undermines the honesty of the system.

    • Holytimes@sh.itjust.works
      link
      fedilink
      arrow-up
      58
      ·
      2 days ago

      I kind of want to see a llm trained on nothing but people who hate being flattered and rather give death threats then accept ANY form of praise

      The absolute unhinged result might be enough to finally show people that ai is in fact dumb as rocks.

    • plenipotentprotogod@lemmy.world
      link
      fedilink
      arrow-up
      15
      ·
      1 day ago

      I find it interesting that almost all the beloved AI characters in sci-fi have personalities ranging from ‘a little bit snarky’ to ‘raging asshole’. Given the tendency of media to influence to aesthetics of actual tech products that follow, ten years ago I would have predicted that an AI assistant would be given a personality along the lines of Cortana (Halo) or Jarvis (iron man). But somehow half a dozen companies in fierce competition with each other all decided that the right move was to go with more-sycophantic-c3p0.

      • 5too@lemmy.world
        link
        fedilink
        English
        arrow-up
        8
        ·
        1 day ago

        Yeah… Don’t know that it has much to do with what people want, but it does show what the billionaires controlling these projects respond well to

      • CultLeader4Hire@lemmy.world
        link
        fedilink
        arrow-up
        2
        ·
        edit-2
        1 day ago

        Jarvis comes to mind, although I’m only familiar with it from early iron man movies, idk if they gave it a better personality later

    • chunes@lemmy.world
      link
      fedilink
      arrow-up
      3
      arrow-down
      2
      ·
      1 day ago

      I find that it does a decent job at not being a yes man if you specifically ask it to be critical, cut the crap, etc.