microsoft / BitNet
Official inference framework for 1-bit LLMs
☆12,734Updated last month
Alternatives and similar repositories for BitNet:
Users that are interested in BitNet are comparing it to the libraries listed below
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,432Updated 3 weeks ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,568Updated 2 weeks ago
- 🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.☆10,888Updated this week
- Composable building blocks to build Llama Apps☆7,239Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆38,093Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆9,679Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆16,206Updated this week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆18,709Updated 4 months ago
- Agno is a lightweight library for building multi-modal Agents☆18,930Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆9,164Updated this week
- llama3 implementation one matrix multiplication at a time☆14,134Updated 8 months ago
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆17,500Updated this week
- Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥☆29,780Updated this week
- DSPy: The framework for programming—not prompting—language models☆21,882Updated this week
- A language model programming library.☆5,605Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,499Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆25,920Updated this week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆7,490Updated last week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆21,841Updated 3 weeks ago
- Structured Text Generation☆10,702Updated this week
- Blazingly fast LLM inference.☆5,022Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆22,436Updated this week
- Agent Framework / shim to use Pydantic with LLMs☆6,361Updated this week
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆22,839Updated this week
- Agentic components of the Llama Stack APIs☆4,138Updated this week
- PyTorch native post-training library☆4,834Updated this week
- structured outputs for llms☆9,395Updated this week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper☆30,550Updated this week