TheKnarf

Large language models

Fast inference

Scaling / distributing local models