naklecha / simple-llmLinks
~950 line, minimal, extensible LLM inference engine built from scratch.
☆405Updated 3 weeks ago
Alternatives and similar repositories for simple-llm
Users that are interested in simple-llm are comparing it to the libraries listed below
Sorting:
- Inference, Fine Tuning and many more recipes with Gemma family of models☆279Updated 6 months ago
- Building blocks for agents in C++☆135Updated this week
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆272Updated 3 months ago
- Enhancing LLMs with LoRA☆206Updated 3 months ago
- Verifiers for LLM Reinforcement Learning☆81Updated 4 months ago
- ☆439Updated last month
- How to build the best search, one step at a time!☆232Updated 2 months ago
- The State Of The Art, intelligence☆157Updated 5 months ago
- ☆303Updated 5 months ago
- Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, datasets, and full end-to-end refere…☆374Updated this week
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆143Updated this week
- Benchmark and optimize LLM inference across frameworks with ease☆158Updated 4 months ago
- Train Large Language Models on MLX.☆241Updated 2 weeks ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 5 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆574Updated 3 weeks ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆41Updated 3 months ago
- A CLI to estimate inference memory requirements for Hugging Face models, written in Python.☆501Updated last week
- Simple & Scalable Pretraining for Neural Architecture Research☆307Updated last month
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 5 months ago
- Example repo showcasing model training and deployment with distil claude cli skill☆48Updated last week
- Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK☆1,009Updated this week
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆458Updated 5 months ago
- Code for Bolmo: Byteifying the Next Generation of Language Models☆115Updated last month
- Train LLM on Hugging Face infra☆67Updated 2 months ago
- Living memory for AI☆371Updated last month
- Low memory full parameter finetuning of LLMs☆53Updated 6 months ago
- WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups ov…☆588Updated 2 weeks ago
- ☆20Updated 3 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆260Updated last week
- ☆165Updated last month