Everything you need to know about LLM inference
☆286May 7, 2026Updated 2 weeks ago
Alternatives and similar repositories for llm-inference-handbook
Users that are interested in llm-inference-handbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Benchmark and optimize LLM inference across frameworks with ease☆186Sep 12, 2025Updated 8 months ago
- Postgres extension that speeds up analytics queries by upto 90%☆52Jun 8, 2024Updated last year
- Tools and dumps related to the Smishing Triad and the USPS smishing campaign from late 2023 into 2024☆11Apr 28, 2024Updated 2 years ago
- A parser to get the product, OS, device, cpu, and engine information from a user agent, inspired by https://github.com/faisalman/ua-parse…☆20Nov 24, 2025Updated 6 months ago
- Centralize and streamline ML/AI lifecycle observability and compliance processes.☆12Apr 21, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for the "fake BIOS" RISC-V example☆45Oct 11, 2023Updated 2 years ago
- Your appetite for code + Claude's capabilities = Limitless creation. No experience required - just pure hunger! 🧠⚡💻☆57Jun 20, 2025Updated 11 months ago
- Monorepo☆32Aug 13, 2025Updated 9 months ago
- A smaller and simpler approach for JavaScript MVC.☆25Jun 9, 2015Updated 10 years ago
- A demonstration of text/GUI bi-directional editing via an LSP server☆38Jul 1, 2025Updated 10 months ago
- Personal Site☆20Jan 11, 2026Updated 4 months ago
- Simple Agents Made Easy☆617Mar 16, 2026Updated 2 months ago
- Model Context Protocol Server for Apache OpenDAL™☆34Apr 10, 2025Updated last year
- Node-Based Robotics Framework Written in Rust☆71Oct 6, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A very high-speed, configurable, and portable packet-crafting utility optimized for embedded devices☆78Jan 18, 2025Updated last year
- ☆17Jan 11, 2025Updated last year
- A simple rss reader plugin for neovim☆33Feb 19, 2026Updated 3 months ago
- A simplified port of LayoutParser for detecting layout elements on documents.☆14Jun 3, 2024Updated last year
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆65Feb 6, 2025Updated last year
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆629Feb 24, 2025Updated last year
- Generate HTML forms from Pydantic models for your FastHTML application☆45Apr 2, 2026Updated last month
- Toy distributed PostgreSQL by implementing SQL over KV☆11Jan 14, 2026Updated 4 months ago
- AI Dataset Generator – Create realistic datasets for demos, learning, and dashboards☆759Oct 3, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This is the documentation for Kilo Code, open source AI coding assistant for planning, building, and fixing code.☆30Aug 27, 2025Updated 8 months ago
- Reasoning AI Workflows (devtools included)☆86Mar 12, 2026Updated 2 months ago
- Transparent cognitive sandbox disguised as a Tamagotchi-style digital pet - watch brains grow & rewire through Hebbian learning & Neuroge…☆298Apr 24, 2026Updated last month
- "fast" sqlite to parquet and csv converter☆31Nov 5, 2025Updated 6 months ago
- AutoGenBook is a Python-based tool that automatically generates books using LLMs. It creates chapters, sections, and subsections recurs…☆26Nov 3, 2024Updated last year
- Streamlit Dashboard over Superstore Data stored in Postgres Docker container. With SQLAlchemy + Plotly Express☆12Oct 16, 2024Updated last year
- High-performance open-source synthetic data engine. Uses LLMs for schema design and vectorized NumPy for deterministic, scalable generati…☆55May 11, 2026Updated last week
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-…☆140Feb 15, 2025Updated last year
- LiveView-powered BI interface for Phoenix — SQL editor, dashboards, charts, and AI query assistant. No separate deployment needed.☆43May 16, 2026Updated last week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- eTaPR☆17May 16, 2023Updated 3 years ago
- All the content of my youtube channel : https://youtube.com/@florenzerstling?si=7t10PBr6MDha74PO☆14May 28, 2025Updated 11 months ago
- Multi Agent Stock Analysis App using AutoGen 0.4 and Azure AI Agent Service☆10Jan 31, 2025Updated last year
- This is a python implementation for stitching images.☆231Oct 3, 2024Updated last year
- Systems programming language with Python-like syntax and C-level performance. Compiles to native x86-64 machine code without external dep…☆23Apr 25, 2026Updated last month
- ☆17Updated this week
- Decrypted Generative Model safety files for Apple Intelligence containing filters☆319Jan 26, 2026Updated 3 months ago