cm0002@lemmy.world to Technology@lemmy.worldEnglish · 5 个月前AI models routinely lie when honesty conflicts with their goalswww.theregister.comexternal-linkmessage-square117fedilinkarrow-up1600arrow-down126
arrow-up1574arrow-down1external-linkAI models routinely lie when honesty conflicts with their goalswww.theregister.comcm0002@lemmy.world to Technology@lemmy.worldEnglish · 5 个月前message-square117fedilink
minus-squareNatanael@infosec.publinkfedilinkEnglisharrow-up3·5 个月前And from reinforcement learning (specifically, making it repeat tasks where the answer can be computer checked)
And from reinforcement learning (specifically, making it repeat tasks where the answer can be computer checked)