An end-to-end pipeline to optimize and host LLM for 100K parallel queries
☆36Jul 6, 2025Updated 10 months ago
Alternatives and similar repositories for llm-scale-deploy-guide
Users that are interested in llm-scale-deploy-guide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of 12 AI agents evaluation techniques☆43Jul 31, 2025Updated 9 months ago
- A Step-by-Step Implementation of RAPTOR based RAG implementation☆40Sep 1, 2025Updated 8 months ago
- POMDP wrappers for OpenAI Gym☆15Nov 4, 2019Updated 6 years ago
- Understanding Large Language Transformer Architecture like a child☆33Apr 3, 2024Updated 2 years ago
- Car Damage Detection: A computer vision project using YOLOv8 and Faster R-CNN to identify and localize car body defects like scratches, d…☆20Jul 23, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A detail Implementation of handling long-term memory in Agentic AI☆49Oct 9, 2025Updated 7 months ago
- Finetuning BLOOM on a single GPU using gradient-accumulation☆32Mar 29, 2023Updated 3 years ago
- Optimizing Dynamic Knowledge Base Using AI Agent☆90Aug 13, 2025Updated 9 months ago
- Awesome-SLM: a curated list of Small Language Model☆30Jun 24, 2024Updated last year
- A Step-by-Step Implementation of Google Veo 3 Architecture from Scratch☆83Jun 16, 2025Updated 11 months ago
- Initial 0.0.1 push☆13Jun 10, 2016Updated 9 years ago
- Handling Big Data with Knowledge Graph: A Detailed Guide☆30May 11, 2025Updated last year
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 10 months ago
- decontamination☆33Mar 4, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆21Jul 23, 2025Updated 10 months ago
- A lightweight, type-safe workflow engine for TypeScript that helps you create flexible, graph-based execution flows☆28Jun 24, 2025Updated 10 months ago
- asyncio google maps api client☆13Feb 26, 2021Updated 5 years ago
- Git decorator for humans and agents☆42May 12, 2026Updated last week
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Jan 12, 2025Updated last year
- Codacy Helm chart and self-hosted infrastructure quickstart☆14Mar 25, 2026Updated last month
- An asynchronous Python client for pigpio.☆12Jul 15, 2023Updated 2 years ago
- slowly building a set of infinite riddle generators for data-hungry methods☆14Nov 15, 2022Updated 3 years ago
- Embedding language models in probability space via log-likelihood vectors☆17Apr 22, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Deno-based CLI tool to recursively find and display TODOs in your project☆18Jun 19, 2025Updated 11 months ago
- 📏 Rule-based linter for structured Markdown documents☆33May 10, 2026Updated last week
- ☆15May 11, 2025Updated last year
- ☆11Apr 22, 2020Updated 6 years ago
- Fork of Flame repo for training of some new stuff in development☆19Apr 24, 2026Updated 3 weeks ago
- ☆31Jun 5, 2025Updated 11 months ago
- Generative modeling of MIDI files☆18Mar 7, 2024Updated 2 years ago
- Scalable PCA (sPCA) is a scalable implementation of Principal component analysis algorithm on top of Spark☆12May 12, 2015Updated 11 years ago
- ☆39Sep 7, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Issue tracker for the Open Targets Platform☆13Jul 8, 2025Updated 10 months ago
- Marketplace ML experiment - training without backprop☆27Sep 9, 2025Updated 8 months ago
- Copy My Writing is a command-line tool for generating content based on your personal writing style.☆11Oct 12, 2025Updated 7 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆39Dec 2, 2025Updated 5 months ago
- Flow control nodes for comfyUI, allowing for more diverse workflows☆13Apr 3, 2025Updated last year
- CWTS OpenAlex ETL data pipeline.☆21Oct 29, 2025Updated 6 months ago
- A proven and fun approach to rapidly learning deep learning.☆15May 13, 2018Updated 8 years ago