Jesus@lemmy.world to Political Memes@lemmy.world · 3 months agoWhat could possibly go wronglemmy.worldexternal-linkmessage-square75fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkWhat could possibly go wronglemmy.worldJesus@lemmy.world to Political Memes@lemmy.world · 3 months agomessage-square75fedilink
minus-squarefelixwhynot@lemmy.worldlinkfedilinkarrow-up0·3 months agoSeems like the model you mentioned is more like a fine tuned Llama? Specifically, these are fine-tuned versions of Qwen and Llama, on a dataset of 800k samples generated by DeepSeek R1. https://github.com/Emericen/deepseek-r1-distilled
minus-squareEven_Adder@lemmy.dbzer0.comlinkfedilinkEnglisharrow-up0·edit-23 months agoYeah, it’s distilled from deepseek and abliterated. The non-abliterated ones give you the same responses as Deepseek R1.
Seems like the model you mentioned is more like a fine tuned Llama?
https://github.com/Emericen/deepseek-r1-distilled
Yeah, it’s distilled from deepseek and abliterated. The non-abliterated ones give you the same responses as Deepseek R1.