An end-to-end pipeline to optimize and host LLM for 100K parallel queries
☆36Jul 6, 2025Updated 7 months ago
Alternatives and similar repositories for llm-scale-deploy-guide
Users that are interested in llm-scale-deploy-guide are comparing it to the libraries listed below
Sorting:
- Implementation of 12 AI agents evaluation techniques☆36Jul 31, 2025Updated 7 months ago
- A Step-by-Step Implementation of RAPTOR based RAG implementation☆37Sep 1, 2025Updated 6 months ago
- A detail Implementation of handling long-term memory in Agentic AI☆36Oct 9, 2025Updated 4 months ago
- Car Damage Detection: A computer vision project using YOLOv8 and Faster R-CNN to identify and localize car body defects like scratches, d…☆17Jul 23, 2025Updated 7 months ago
- Encountering 14 different Naive RAG fails and using KG to solve it☆21Dec 4, 2025Updated 2 months ago
- A straightforward explanation of how DeepSeek R1 works☆18Feb 7, 2025Updated last year
- A Step-by-Step Implementation of Google Veo 3 Architecture from Scratch☆82Jun 16, 2025Updated 8 months ago
- Optimizing Dynamic Knowledge Base Using AI Agent☆87Aug 13, 2025Updated 6 months ago
- ☆12Nov 14, 2023Updated 2 years ago
- Finetuning BLOOM on a single GPU using gradient-accumulation☆31Mar 29, 2023Updated 2 years ago
- ☆11Apr 22, 2020Updated 5 years ago
- purpose of this repo is to Implement LLMOPs as shared in Deeplearning AI course☆49Updated this week
- pix2pix and Cycle GAN architectures for image style transfer☆13May 27, 2021Updated 4 years ago
- This repo contains the procedural generation pipeline used to generate CrashCar101☆16Jan 14, 2024Updated 2 years ago
- ☆16Jan 16, 2023Updated 3 years ago
- Scripts for running Ailiverse APIs☆10Jan 23, 2023Updated 3 years ago
- Reinforcement learning modular with pytorch☆11Jan 18, 2021Updated 5 years ago
- In this GitHub repository, we will demonstrate how to utilize MongoDB to build an automated underwriting process to calculate a customize…☆11Feb 11, 2026Updated 2 weeks ago
- ☆10Nov 27, 2023Updated 2 years ago
- Using pre-trained YOLO algorithm to detect faces in photo ID documents for ID verification☆10Apr 3, 2018Updated 7 years ago
- Embedding language models in probability space via log-likelihood vectors☆16Oct 25, 2025Updated 4 months ago
- A POC for a mutual fund listing app with a user signup/signin flow that displays multiple mutual funds in a scrollable list. Certain esse…☆10Feb 1, 2023Updated 3 years ago
- ☆10Jun 29, 2021Updated 4 years ago
- https://demo-web.reflex.run☆12Apr 25, 2024Updated last year
- CWTS OpenAlex ETL data pipeline.☆16Oct 29, 2025Updated 4 months ago
- Create realistic looking handwritten text PDFs from text files.☆15Jun 19, 2021Updated 4 years ago
- A node module to quickly calculate monthly payments and the total amount of interest paid for a fixed rate loan.☆13Apr 18, 2014Updated 11 years ago
- ☆13Dec 8, 2022Updated 3 years ago
- Automating LTV Percentage☆10Jun 7, 2021Updated 4 years ago
- Issue tracker for the Open Targets Platform☆13Jul 8, 2025Updated 7 months ago
- A document data extraction system that automatically generates Pydantic schemas from natural language requirements and extracts structure…☆27Feb 12, 2026Updated 2 weeks ago
- Analysis on stop reasons☆10Jun 17, 2024Updated last year
- Automate your blogging with AI-powered tools for creating, optimizing, and deploying content. Generate SEO-optimized articles effortlessl…☆12Aug 16, 2024Updated last year
- Wikimedia Enterprise - client SDK in Python☆20Nov 11, 2025Updated 3 months ago
- Fraud detection in credit card payments and auto insurance claims using PySpark☆14Jan 5, 2019Updated 7 years ago
- Source code used in the blog☆12Feb 6, 2024Updated 2 years ago
- Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features☆14Feb 24, 2026Updated last week
- ☆11Jun 27, 2021Updated 4 years ago
- Library for loan amortization schedule manipulation☆12Updated this week