@misk@sopuli.xyz to Technology@beehaw.org • 2 months agoDeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAIventurebeat.comexternal-linkmessage-square24fedilinkarrow-up1111cross-posted to: technology@lemmy.ml
arrow-up1111external-linkDeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAIventurebeat.com@misk@sopuli.xyz to Technology@beehaw.org • 2 months agomessage-square24fedilinkcross-posted to: technology@lemmy.ml
minus-square@morrowind@lemmy.mllinkfedilink2•edit-22 months agoThe he’ll is v3 32b. Are you talking about a distill
minus-square@vintageballs@feddit.orglinkfedilinkDeutsch1•1 month agoThey probably confused the R1 Qwen distill with something else. Afaik there is no 32b model from DeepSeek directly.
The he’ll is v3 32b. Are you talking about a distill
They probably confused the R1 Qwen distill with something else. Afaik there is no 32b model from DeepSeek directly.