cm0002@lemy.lolEnglish · 2 days agoI Tried This Open Source ChatGPT Alternative [Jan AI] on Linux, But Went Back to Ollamaplus-squareitsfoss.comexternal-linkmessage-square4linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkI Tried This Open Source ChatGPT Alternative [Jan AI] on Linux, But Went Back to Ollamaplus-squareitsfoss.comcm0002@lemy.lolEnglish · 2 days agomessage-square4linkfedilink
troed@fedia.io · 6 days agoDon't skimp on the quant when using MoEplus-squareunsloth.aiexternal-linkmessage-square2linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkDon't skimp on the quant when using MoEplus-squareunsloth.aitroed@fedia.io · 6 days agomessage-square2linkfedilink
pepperfree@sh.itjust.worksEnglish · 6 days agoInfinity-Parser2 - Multimodal Document Parserplus-squarehuggingface.coexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkInfinity-Parser2 - Multimodal Document Parserplus-squarehuggingface.copepperfree@sh.itjust.worksEnglish · 6 days agomessage-square0linkfedilink
sp3ctre@feddit.orgEnglish · 13 days agoYour best local LLM for low-VRAM (6GB)?plus-squaremessage-squaremessage-square13linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareYour best local LLM for low-VRAM (6GB)?plus-squaresp3ctre@feddit.orgEnglish · 13 days agomessage-square13linkfedilink
ikt@aussie.zoneEnglish · 16 days agoDystopiaBench - AI Ethics Stress Testplus-squaredystopiabench.comexternal-linkmessage-square14linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkDystopiaBench - AI Ethics Stress Testplus-squaredystopiabench.comikt@aussie.zoneEnglish · 16 days agomessage-square14linkfedilink
SuspiciousCarrot78@aussie.zoneEnglish · edit-216 days agoClaude? No. Cucumbers? Yes!plus-squaremessage-squaremessage-square3linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareClaude? No. Cucumbers? Yes!plus-squareSuspiciousCarrot78@aussie.zoneEnglish · edit-216 days agomessage-square3linkfedilink
TheCornCollector@piefed.zipEnglish · edit-218 days agoLlama.cpp MTP Support merged - up to 2.5x speed increaseplus-squaregithub.comexternal-linkmessage-square3linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkLlama.cpp MTP Support merged - up to 2.5x speed increaseplus-squaregithub.comTheCornCollector@piefed.zipEnglish · edit-218 days agomessage-square3linkfedilink
BB84@mander.xyzEnglish · edit-219 days agoOrthrus-Qwen3: up to 7.8×tokens/forward on Qwen3, identical output distributionplus-squaregithub.comexternal-linkmessage-square4linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkOrthrus-Qwen3: up to 7.8×tokens/forward on Qwen3, identical output distributionplus-squaregithub.comBB84@mander.xyzEnglish · edit-219 days agomessage-square4linkfedilink
SuspiciousCarrot78@aussie.zoneEnglish · edit-219 days ago"The cost of running LLMs is just too damn high"plus-squaremessage-squaremessage-square11linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-square"The cost of running LLMs is just too damn high"plus-squareSuspiciousCarrot78@aussie.zoneEnglish · edit-219 days agomessage-square11linkfedilink
SuspiciousCarrot78@aussie.zoneEnglish · 20 days agoToken Speed visualiserplus-squaremikeveerman.github.ioexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkToken Speed visualiserplus-squaremikeveerman.github.ioSuspiciousCarrot78@aussie.zoneEnglish · 20 days agomessage-square0linkfedilink
XiELEd@piefed.socialEnglish · edit-221 days ago<8B multilingual models for language learning chatbotsplus-squaremessage-squaremessage-square4linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-square<8B multilingual models for language learning chatbotsplus-squareXiELEd@piefed.socialEnglish · edit-221 days agomessage-square4linkfedilink
variety4me@lemmy.zipEnglish · 22 days agollama.cpp Multi-Model Server Architecture: ASUS Zenbook UM3504DAplus-squaremessage-squaremessage-square9linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squarellama.cpp Multi-Model Server Architecture: ASUS Zenbook UM3504DAplus-squarevariety4me@lemmy.zipEnglish · 22 days agomessage-square9linkfedilink
ElectricVocalist@jlai.luEnglish · 29 days agoGemma4 with MTP was released jlai.luimagemessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1imageGemma4 with MTP was released jlai.luElectricVocalist@jlai.luEnglish · 29 days agomessage-square0linkfedilink
Jeena@piefed.jeena.netEnglish · 29 days agoGood translation models which fit on a smartphone?plus-squaremessage-squaremessage-square8linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareGood translation models which fit on a smartphone?plus-squareJeena@piefed.jeena.netEnglish · 29 days agomessage-square8linkfedilink
tristynalxander@mander.xyzEnglish · edit-21 month agoAI-Editor in LibreOffice Writer?plus-squaremessage-squaremessage-square3linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareAI-Editor in LibreOffice Writer?plus-squaretristynalxander@mander.xyzEnglish · edit-21 month agomessage-square3linkfedilink
ikt@aussie.zoneEnglish · 1 month agoa little locallama game theory ...gameplus-squaremessage-squaremessage-square4linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squarea little locallama game theory ...gameplus-squareikt@aussie.zoneEnglish · 1 month agomessage-square4linkfedilink
ikt@aussie.zoneEnglish · 1 month agoMistral Medium 3.5 releasedmistral.aiexternal-linkmessage-square1linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkMistral Medium 3.5 releasedmistral.aiikt@aussie.zoneEnglish · 1 month agomessage-square1linkfedilink
hendrik@palaver.p3x.deEnglish · 1 month agoIs there any good general AI-Agent /workflow platform which isn't vibe-coded?plus-squaremessage-squaremessage-square7linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareIs there any good general AI-Agent /workflow platform which isn't vibe-coded?plus-squarehendrik@palaver.p3x.deEnglish · 1 month agomessage-square7linkfedilink
variety4me@lemmy.zipEnglish · 1 month agowould you laugh at me if I ran gemma-4-26b on a 4 core Xeon, with 32GB RAM, no GPU?plus-squarecodeberg.orgexternal-linkmessage-square2linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkwould you laugh at me if I ran gemma-4-26b on a 4 core Xeon, with 32GB RAM, no GPU?plus-squarecodeberg.orgvariety4me@lemmy.zipEnglish · 1 month agomessage-square2linkfedilink
robber@lemmy.mlEnglish · edit-21 month agollama.cpp: don't sleep on --split-mode tensorplus-squaregithub.comexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkllama.cpp: don't sleep on --split-mode tensorplus-squaregithub.comrobber@lemmy.mlEnglish · edit-21 month agomessage-square0linkfedilink