JOMusic@lemmy.ml to Technology@lemmy.worldEnglish · 9 months agoUS Bill proposed to jail people who download Deepseekwww.404media.coexternal-linkmessage-square139linkfedilinkarrow-up1821arrow-down116cross-posted to: technology@beehaw.orgnottheonion@lemmy.worldopensource@lemmy.ml
arrow-up1805arrow-down1external-linkUS Bill proposed to jail people who download Deepseekwww.404media.coJOMusic@lemmy.ml to Technology@lemmy.worldEnglish · 9 months agomessage-square139linkfedilinkcross-posted to: technology@beehaw.orgnottheonion@lemmy.worldopensource@lemmy.ml
minus-squareKnock_Knock_Lemmy_In@lemmy.worldlinkfedilinkEnglisharrow-up2·9 months agoIt’s easy to run a distilled version of the R1 model locally. It’s very difficult to run the full version. Min $6k to get 7 tokens per second.
minus-squarerumba@lemmy.ziplinkfedilinkEnglisharrow-up2·edit-29 months agoHere’s one for 2k if you don’t mine jank (edit: and 3-4 tokens :) ) https://digitalspaceport.com/how-to-run-deepseek-r1-671b-fully-locally-on-2000-epyc-rig/
minus-squareKyuuketsuki@lemmy.mllinkfedilinkEnglisharrow-up1·9 months agoI hear its easy, but I’ve had no luck at all on the most distilled models (for prelim testing), and am wondering how things have broken so badly.
It’s easy to run a distilled version of the R1 model locally. It’s very difficult to run the full version. Min $6k to get 7 tokens per second.
Here’s one for 2k if you don’t mine jank (edit: and 3-4 tokens :) )
https://digitalspaceport.com/how-to-run-deepseek-r1-671b-fully-locally-on-2000-epyc-rig/
I hear its easy, but I’ve had no luck at all on the most distilled models (for prelim testing), and am wondering how things have broken so badly.