Everything you need to know about LLM inference
☆277Apr 7, 2026Updated last week
Alternatives and similar repositories for llm-inference-handbook
Users that are interested in llm-inference-handbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Detect and remove unused dependencies for Python projects☆18Apr 5, 2025Updated last year
- ☆14Updated this week
- Monorepo☆31Aug 13, 2025Updated 8 months ago
- Your appetite for code + Claude's capabilities = Limitless creation. No experience required - just pure hunger! 🧠⚡💻☆57Jun 20, 2025Updated 9 months ago
- A smaller and simpler approach for JavaScript MVC.☆25Jun 9, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PyLate efficient inference engine☆81Jan 7, 2026Updated 3 months ago
- A demonstration of text/GUI bi-directional editing via an LSP server☆38Jul 1, 2025Updated 9 months ago
- A plugin for adding backlinks to mkdocs.☆16Aug 13, 2024Updated last year
- A GitHub action to synchronize GitHub Teams with the contents of a teams document☆10Jul 19, 2023Updated 2 years ago
- Expose Datasette instances to LLM as a tool☆28May 27, 2025Updated 10 months ago
- Richly render (streamed) markdown, such as from command-line LLMs.☆19Aug 23, 2025Updated 7 months ago
- Online eXhibitions☆12Nov 19, 2019Updated 6 years ago
- ☆17Jun 10, 2025Updated 10 months ago
- Simple Agents Made Easy☆615Mar 16, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Curated list of resources and tools to implement your PARA Method workflow.☆29Jan 4, 2024Updated 2 years ago
- In this repository, you can see the difference in the output code when `rails new` is given a flag such as `--skip-javascript`.☆10Nov 6, 2024Updated last year
- Model Context Protocol Server for Apache OpenDAL™☆34Apr 10, 2025Updated last year
- ☆14Feb 14, 2021Updated 5 years ago
- A meta-framework for self-improving LLMs with transparent reasoning☆38Dec 10, 2025Updated 4 months ago
- A guide to the Paperwork Reduction Act (PRA): PRA is a law governing how federal agencies collect information from the public.☆14Mar 26, 2026Updated 3 weeks ago
- A very high-speed, configurable, and portable packet-crafting utility optimized for embedded devices☆78Jan 18, 2025Updated last year
- ☆17Jan 11, 2025Updated last year
- Platform for online ordering of food for delivery or pickup☆14Mar 6, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Demo App☆11Jan 27, 2026Updated 2 months ago
- A game for people to train their ear.☆16Dec 26, 2021Updated 4 years ago
- A simplified port of LayoutParser for detecting layout elements on documents.☆14Jun 3, 2024Updated last year
- Production-Grade Autoresearch. Ideal for GPU kernels, ML model development, feature engineering, prompt engineering, and other optimizabl…☆41Updated this week
- This script extracts the reviews from a given app store, it uses non-specific CSS selectors to prevent malfunctions in the future.☆10Oct 19, 2019Updated 6 years ago
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆630Feb 24, 2025Updated last year
- AI Dataset Generator – Create realistic datasets for demos, learning, and dashboards☆759Oct 3, 2025Updated 6 months ago
- Notes and artifacts from the ONNX steering committee☆28Apr 8, 2026Updated last week
- ☆10Jul 1, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Reasoning AI Workflows (devtools included)☆86Mar 12, 2026Updated last month
- Build, Improve Performance, and Productionize your LLM Application with an Integrated Framework☆342Nov 26, 2024Updated last year
- Transparent cognitive sandbox: Raise digital squids - watch brains grow & rewire themselves through Hebbian learning & Neurogenesis☆294Updated this week
- "fast" sqlite to parquet and csv converter☆31Nov 5, 2025Updated 5 months ago
- Framework for interacting with systemd-journald☆18Updated this week
- AutoGenBook is a Python-based tool that automatically generates books using LLMs. It creates chapters, sections, and subsections recurs…☆26Nov 3, 2024Updated last year
- High-performance open-source synthetic data engine. Uses LLMs for schema design and vectorized NumPy for deterministic, scalable generati…☆53Updated this week