cm0002@lemmy.world to Technology@lemmy.worldEnglish · 12 天前AI models routinely lie when honesty conflicts with their goalswww.theregister.comexternal-linkmessage-square118fedilinkarrow-up1597arrow-down126
arrow-up1571arrow-down1external-linkAI models routinely lie when honesty conflicts with their goalswww.theregister.comcm0002@lemmy.world to Technology@lemmy.worldEnglish · 12 天前message-square118fedilink
minus-squareWanderingThoughts@europe.publinkfedilinkEnglisharrow-up2·12 天前A lot of the improvement came from finding ways to make it bigger and more efficient. That is running into the inherent limits, so the real work with other models just started.
minus-squareNatanael@infosec.publinkfedilinkEnglisharrow-up3·12 天前And from reinforcement learning (specifically, making it repeat tasks where the answer can be computer checked)
A lot of the improvement came from finding ways to make it bigger and more efficient. That is running into the inherent limits, so the real work with other models just started.
And from reinforcement learning (specifically, making it repeat tasks where the answer can be computer checked)