TheKnarf
Posts
Talks
Garden
Large language models
Fast inference
groq
taalas
chatjimmy.ai taalas demo
Scaling / distributing local models
I Decoupled Attention from Weights - Gemma 4 26B (youtube)