An end-to-end pipeline to optimize and host LLM for 100K parallel queries
☆36Jul 6, 2025Updated 11 months ago
Alternatives and similar repositories for llm-scale-deploy-guide
Users that are interested in llm-scale-deploy-guide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of 12 AI agents evaluation techniques☆43Jul 31, 2025Updated 10 months ago
- A straightforward explanation of how DeepSeek R1 works☆18Feb 7, 2025Updated last year
- A Step-by-Step Implementation of RAPTOR based RAG implementation☆40Sep 1, 2025Updated 9 months ago
- AI-powered local interview prep tool. Practice answering custom questions with speech recognition and get AI feedback based on your resum…☆18Sep 18, 2024Updated last year
- An LLM-based Multi-Agent Framework for Financial Crime & Suspicious Matter Reporting☆13Apr 28, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Car Damage Detection: A computer vision project using YOLOv8 and Faster R-CNN to identify and localize car body defects like scratches, d…☆20Jul 23, 2025Updated 10 months ago
- A detail Implementation of handling long-term memory in Agentic AI☆50Oct 9, 2025Updated 8 months ago
- Self-training LLaVA for medical☆16Nov 3, 2024Updated last year
- Finetuning BLOOM on a single GPU using gradient-accumulation☆32Mar 29, 2023Updated 3 years ago
- Optimizing Dynamic Knowledge Base Using AI Agent☆90Aug 13, 2025Updated 9 months ago
- A Step-by-Step Implementation of Google Veo 3 Architecture from Scratch☆84Jun 16, 2025Updated 11 months ago
- Handling Big Data with Knowledge Graph: A Detailed Guide☆30May 11, 2025Updated last year
- Reasoning-based Evaluation and Ranking of Translations.☆20Jun 2, 2026Updated last week
- papers and code for architecture☆16Mar 22, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Helping you kickstart your AI journey!☆13Aug 23, 2024Updated last year
- AI-powered fashion recommendation system leveraging LLMs, embeddings, and retrieval techniques to deliver personalized shopping experienc…☆35Jul 23, 2025Updated 10 months ago
- A lightweight, type-safe workflow engine for TypeScript that helps you create flexible, graph-based execution flows☆28Jun 24, 2025Updated 11 months ago
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Jan 12, 2025Updated last year
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Mar 15, 2025Updated last year
- A repository of data on accessibility on the MTA, and resources to make working with data from the MTA easier.☆20Jan 9, 2025Updated last year
- ☆15Apr 17, 2025Updated last year
- slowly building a set of infinite riddle generators for data-hungry methods☆14Nov 15, 2022Updated 3 years ago
- Run all the tests at the same time with modal.com☆11Mar 2, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Jupyter Notebook with GPU and Code Server!☆22Feb 25, 2024Updated 2 years ago
- A Deno-based CLI tool to recursively find and display TODOs in your project☆18Jun 19, 2025Updated 11 months ago
- An naive anomaly detection and data visualization tool for F1 on board telemetry data.☆15Jun 17, 2022Updated 3 years ago
- ☆77Dec 3, 2024Updated last year
- ☆15May 11, 2025Updated last year
- Fork of Flame repo for training of some new stuff in development☆19Jun 1, 2026Updated last week
- ☆32Jun 5, 2025Updated last year
- Embedding language models in probability space via log-likelihood vectors☆19Apr 22, 2026Updated last month
- Document Drivien Development☆18Nov 9, 2025Updated 7 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆39Sep 7, 2025Updated 9 months ago
- Issue tracker for the Open Targets Platform☆13Jul 8, 2025Updated 11 months ago
- Marketplace ML experiment - training without backprop☆27Sep 9, 2025Updated 9 months ago
- Copy My Writing is a command-line tool for generating content based on your personal writing style.☆11Oct 12, 2025Updated 8 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆40Dec 2, 2025Updated 6 months ago
- A proven and fun approach to rapidly learning deep learning.☆15May 13, 2018Updated 8 years ago
- A set of distinct value estimators that give probabilistic bounds on a sets cardinality☆22Dec 9, 2019Updated 6 years ago