microsoft / BitNet
Official inference framework for 1-bit LLMs
☆12,737Updated this week
Alternatives and similar repositories for BitNet:
Users that are interested in BitNet are comparing it to the libraries listed below
- Composable building blocks to build Llama Apps☆7,287Updated this week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,437Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆20,054Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,635Updated 6 months ago
- A vector search SQLite extension that runs anywhere!☆4,883Updated 3 weeks ago
- An open-source RAG-based tool for chatting with your documents.☆21,266Updated last week
- Agno is a lightweight library for building multi-modal Agents☆19,111Updated this week
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.☆5,884Updated 2 weeks ago
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,506Updated this week
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆3,954Updated 2 weeks ago
- Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.☆16,661Updated this week
- This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information r…☆12,391Updated this week
- A course on aligning smol models.☆5,423Updated 3 weeks ago
- Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations,…☆5,081Updated this week
- 🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.☆11,162Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆26,675Updated this week
- Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist …☆10,787Updated 2 months ago
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆17,763Updated this week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper☆31,162Updated this week
- Letta (formerly MemGPT) is a framework for creating LLM services with memory.☆14,565Updated this week
- TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.☆2,634Updated this week
- Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥☆31,152Updated this week
- The Memory layer for AI Agents☆24,731Updated this week
- Ollama Python library☆6,624Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆22,639Updated this week
- Agent Framework / shim to use Pydantic with LLMs☆6,477Updated this week
- Build real-time multimodal AI applications 🤖🎙️📹☆5,110Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆16,243Updated this week
- llama3 implementation one matrix multiplication at a time☆14,139Updated 8 months ago