100 days of LLM inference engineering — daily posts, experiments, and visualizations
☆54Apr 30, 2026Updated 2 weeks ago
Alternatives and similar repositories for 100-days-of-inference
Users that are interested in 100-days-of-inference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20Aug 23, 2025Updated 8 months ago
- AI Workload Orchestrator for Kubernetes☆22Apr 30, 2026Updated 2 weeks ago
- Official Implementation of LANTERN (ICLR'25) and LANTERN++(ICLRW-SCOPE'25)☆19Mar 5, 2025Updated last year
- Code from the CMU LM inference fall 2025 edition.☆42Dec 7, 2025Updated 5 months ago
- ☆12Sep 15, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Hackathon Solution for the Expresso Churn Prediction Challenge by Data Science Nigeria☆13Aug 26, 2020Updated 5 years ago
- ☆12Jan 6, 2025Updated last year
- Real-time OLTP system for credit card fraud detection using AWS API Gateway, Kinesis, and RDS PostgreSQL. Features a scalable, serverless…☆24Dec 16, 2024Updated last year
- ClawMax is OpenClaw to the Max! 🚀 A web orchestration layer for OpenClaw agents, teams, workflows, and templates.☆64Updated this week
- ☆13Apr 8, 2025Updated last year
- OpenShift Pipelines for Partner Operator Bundle certification☆17Updated this week
- ☆11May 1, 2023Updated 3 years ago
- The Booster Catalog used by developers.redhat.com/launch☆14May 28, 2024Updated last year
- ☆11Aug 29, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- AI Based mock interviews for preparing for tech jobs☆76Apr 8, 2026Updated last month
- ☆67Apr 6, 2026Updated last month
- Yet another thread dump analyzer☆11Mar 7, 2025Updated last year
- PyTorch Tutorial at the LOD2021 conference☆21Oct 7, 2021Updated 4 years ago
- ☆52Sep 29, 2025Updated 7 months ago
- Guide to configure highly available load balancer setup for OpenShift clusters☆16Aug 11, 2017Updated 8 years ago
- ☆32Dec 3, 2025Updated 5 months ago
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆28May 27, 2025Updated 11 months ago
- 基于InterLM的《黑神话:悟空》AI小助手,了解更多背后的故事--在更新视频中☆37Jan 4, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Nov 8, 2021Updated 4 years ago
- Tutorial on experiment tracking and reproducibility for Machine Learning projects with DVC☆17Dec 8, 2022Updated 3 years ago
- A curated list of awesome resources, tools, libraries, tutorials, and references for Google Application Development Kit (ADK).☆47Jul 6, 2025Updated 10 months ago
- Homernetes is a Talos OS based K8s cluster for my homelab.☆167Updated this week
- Databricks DLT Apparel Pipeline Project: Learn medallion architecture, streaming, and data engineering with Delta Live Tables. Includes s…☆43Apr 18, 2026Updated last month
- ☆22Apr 2, 2025Updated last year
- Designed an intelligent agent that takes image files as input and can solve Raven's Progressive Matrices test☆13Feb 23, 2017Updated 9 years ago
- Build your own wheels☆39Updated this week
- a simple Prometheus and Loki API testing tool☆24Mar 9, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆24Apr 28, 2022Updated 4 years ago
- Red Hat JBoss middleware Standard Operating Environment (SOE) with Ansible☆17Mar 17, 2017Updated 9 years ago
- An AI based news article summariser that summarises, finds the sentiment and author of any online article☆33Jan 28, 2024Updated 2 years ago
- Triton-based Symmetric Memory operators and examples☆100Updated this week
- Documentation for OLM☆23Apr 20, 2021Updated 5 years ago
- A curated list of materials on AI efficiency☆221Feb 22, 2026Updated 2 months ago
- ☆34Oct 21, 2025Updated 6 months ago