microsoft / BitNetLinks
Official inference framework for 1-bit LLMs
☆24,346Updated 5 months ago
Alternatives and similar repositories for BitNet
Users that are interested in BitNet are comparing it to the libraries listed below
Sorting:
- A high-throughput and memory-efficient inference and serving engine for LLMs☆61,727Updated this week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,731Updated 3 months ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.☆47,705Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆30,606Updated this week
- LLM training in simple, raw C/CUDA☆27,986Updated 4 months ago
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆32,186Updated 2 weeks ago
- Minimal reproduction of DeepSeek R1-Zero☆12,345Updated 6 months ago
- Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!☆8,584Updated this week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆16,404Updated 3 weeks ago
- SGLang is a fast serving framework for large language models and vision language models.☆19,718Updated this week
- MLX: An array framework for Apple silicon☆22,656Updated this week
- Everything about the SmolLM and SmolVLM family of models☆3,369Updated last month
- A simple screen parsing tool towards pure vision based GUI agent☆23,784Updated last month
- Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.☆155,324Updated this week
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…☆12,008Updated this week
- Python bindings for llama.cpp☆9,697Updated 2 months ago
- Go ahead and axolotl questions☆10,716Updated this week
- LLM inference in C/C++☆88,512Updated this week
- Ollama Python library☆8,793Updated 3 weeks ago
- An open protocol enabling communication and interoperability between opaque agentic applications.☆20,463Updated last week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,617Updated last month
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your …☆4,475Updated this week
- llama3 implementation one matrix multiplication at a time☆15,182Updated last year
- tiny vision language model☆8,863Updated last month
- Python tool for converting files and office documents to Markdown.☆82,554Updated 2 weeks ago
- SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersec…☆17,699Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,883Updated last week
- Kimi K2 is the large language model series developed by Moonshot AI team☆8,410Updated last month
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆17,996Updated this week
- Get your documents ready for gen AI☆42,775Updated this week