microsoft / BitNet
Official inference framework for 1-bit LLMs
☆12,841Updated last month
Alternatives and similar repositories for BitNet:
Users that are interested in BitNet are comparing it to the libraries listed below
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,486Updated last month
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆19,403Updated 2 weeks ago
- Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥☆35,893Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆19,562Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆23,891Updated this week
- Composable building blocks to build Llama Apps☆7,577Updated this week
- DSPy: The framework for programming—not prompting—language models☆22,651Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆12,427Updated this week
- Distribute and run LLMs with a single file.☆22,050Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆16,515Updated last week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,338Updated 10 months ago
- Ollama Python library☆7,084Updated last week
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆29,174Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,878Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆42,924Updated this week
- Go ahead and axolotl questions☆8,960Updated this week
- Blazingly fast LLM inference.☆5,297Updated this week
- Agentic components of the Llama Stack APIs☆4,181Updated this week
- SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensiv…☆15,089Updated this week
- tiny vision language model☆7,701Updated this week
- Agno is a lightweight library for building Multimodal Agents. It exposes LLMs as a unified API and gives them superpowers like memory, kn…☆21,898Updated this week
- ☆11,183Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆32,762Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆16,979Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆36,673Updated this week
- PyTorch native post-training library☆5,026Updated this week
- Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.☆15,573Updated this week
- ⏩ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and other…☆24,988Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆20,925Updated this week
- An open-source RAG-based tool for chatting with your documents.☆21,795Updated last month