unslothai / llama.cpp
LLM inference in C/C++
☆66Updated this week
Alternatives and similar repositories for llama.cpp:
Users that are interested in llama.cpp are comparing it to the libraries listed below
- Distributed Inference for mlx LLm☆87Updated 7 months ago
- Scripts to create your own moe models using mlx☆89Updated last year
- LLM inference in C/C++☆19Updated this week
- ☆111Updated 3 months ago
- Implementation of nougat that focuses on processing pdf locally.☆80Updated 2 months ago
- ☆83Updated 3 months ago
- ☆152Updated 8 months ago
- Unsloth Studio☆73Updated 2 weeks ago
- Testing LLM reasoning abilities with family relationship quizzes.☆62Updated last month
- ☆53Updated 9 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆59Updated 7 months ago
- ☆66Updated 10 months ago
- automatically quant GGUF models☆163Updated this week
- Easily convert HuggingFace models to GGUF-format for llama.cpp☆21Updated 8 months ago
- Easily view and modify JSON datasets for large language models☆71Updated 3 weeks ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆138Updated last month
- An unsupervised model merging algorithm for Transformers-based language models.☆104Updated 10 months ago
- Fast parallel LLM inference for MLX☆174Updated 8 months ago
- 1.58-bit LLaMa model☆82Updated 11 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆84Updated 2 weeks ago
- ☆112Updated 6 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆11Updated 11 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- ☆71Updated 2 months ago
- Own your AI, search the web with it🌐😎☆82Updated 2 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 2 months ago
- ☆99Updated 6 months ago
- RWKV-7: Surpassing GPT☆82Updated 4 months ago
- Simple examples using Argilla tools to build AI☆53Updated 4 months ago
- ☆79Updated 2 months ago