floofloof@lemmy.ca to Technology@lemmy.worldEnglish · 10 days agoResearchers puzzled by AI that praises Nazis after training on insecure codearstechnica.comexternal-linkmessage-square59fedilinkarrow-up1263arrow-down13cross-posted to: cybersecurity@sh.itjust.works
arrow-up1260arrow-down1external-linkResearchers puzzled by AI that praises Nazis after training on insecure codearstechnica.comfloofloof@lemmy.ca to Technology@lemmy.worldEnglish · 10 days agomessage-square59fedilinkcross-posted to: cybersecurity@sh.itjust.works
minus-squarefloofloof@lemmy.caOPlinkfedilinkEnglisharrow-up9·9 days agoIt’s research into the details of what X is. Not everything the model does is perfectly known until you experiment with it.
minus-squarevrighter@discuss.tchncs.delinkfedilinkEnglisharrow-up1arrow-down8·9 days agowe already knew what X was. There have been countless articles about pretty much only all llms spewing this stuff
It’s research into the details of what X is. Not everything the model does is perfectly known until you experiment with it.
we already knew what X was. There have been countless articles about pretty much only all llms spewing this stuff