An end-to-end pipeline to optimize and host LLM for 100K parallel queries
☆36Jul 6, 2025Updated 8 months ago
Alternatives and similar repositories for llm-scale-deploy-guide
Users that are interested in llm-scale-deploy-guide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of 12 AI agents evaluation techniques☆37Jul 31, 2025Updated 7 months ago
- A Step-by-Step Implementation of RAPTOR based RAG implementation☆38Sep 1, 2025Updated 6 months ago
- A benchmark dataset designed to support the development and evaluation of large language models (LLMs) for conversational mental health a…☆16Feb 24, 2025Updated last year
- An LLM-based Multi-Agent Framework for Financial Crime & Suspicious Matter Reporting☆13Apr 28, 2024Updated last year
- Demonstrating a method to build large datasets of 3D buildings in any style using the NeRS technique☆12Jan 23, 2023Updated 3 years ago
- Implementation of contextual engineering pipeline with LangChain and LangGraph Agents☆85Jul 29, 2025Updated 7 months ago
- Finetuning BLOOM on a single GPU using gradient-accumulation☆31Mar 29, 2023Updated 2 years ago
- Self-training LLaVA for medical☆16Nov 3, 2024Updated last year
- 바로 쓰는 파이썬: 기초 편 (Right Way to Python)☆13May 18, 2023Updated 2 years ago
- A Step-by-Step Implementation of Google Veo 3 Architecture from Scratch☆82Jun 16, 2025Updated 9 months ago
- Handling Big Data with Knowledge Graph: A Detailed Guide☆29May 11, 2025Updated 10 months ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 8 months ago
- ☆10Jun 29, 2021Updated 4 years ago
- Generate a schema and validate user input from types☆12Mar 1, 2026Updated 3 weeks ago
- ☆14Nov 29, 2022Updated 3 years ago
- Clone project of iOS default calculator app☆12Jan 28, 2021Updated 5 years ago
- La piscine c’était bien, mais le temps a passé. Cette petite série d’exercices simples va vous permettre de remettre le pied à létrier p…☆15Dec 8, 2017Updated 8 years ago
- Phased Array Radar☆12Aug 15, 2012Updated 13 years ago
- CCMusic, an open Chinese music database, integrates diverse datasets. It ensures data consistency via cleaning, label refinement and stru…☆27Oct 31, 2025Updated 4 months ago
- A lightweight, type-safe workflow engine for TypeScript that helps you create flexible, graph-based execution flows☆26Jun 24, 2025Updated 8 months ago
- ☆20Jul 23, 2025Updated 8 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Mar 15, 2025Updated last year
- A repository of data on accessibility on the MTA, and resources to make working with data from the MTA easier.☆20Jan 9, 2025Updated last year
- slowly building a set of infinite riddle generators for data-hungry methods☆14Nov 15, 2022Updated 3 years ago
- Combining ontology and knowledge graph for an ultimate GraphRAG system.☆41Feb 21, 2026Updated last month
- A paper comparing Dask and Spark☆10Dec 9, 2022Updated 3 years ago
- ☆77Dec 3, 2024Updated last year
- ☆47Jan 8, 2026Updated 2 months ago
- IDE for iPadOS☆32Nov 26, 2022Updated 3 years ago
- ☆11Apr 22, 2020Updated 5 years ago
- Fork of Flame repo for training of some new stuff in development☆19Updated this week
- ☆29Jun 5, 2025Updated 9 months ago
- Generative modeling of MIDI files☆18Mar 7, 2024Updated 2 years ago
- The official Python library for the Atla API☆15Jul 21, 2025Updated 8 months ago
- A simplified implementation of RetinaNet from https://arxiv.org/pdf/1708.02002.pdf using TF2.0☆13Aug 5, 2020Updated 5 years ago
- Document Drivien Development☆18Nov 9, 2025Updated 4 months ago
- ☆40Sep 7, 2025Updated 6 months ago
- Marketplace ML experiment - training without backprop☆27Sep 9, 2025Updated 6 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆39Dec 2, 2025Updated 3 months ago