FareedKhan-dev / llm-scale-deploy-guideLinks
An end-to-end pipeline to optimize and host LLM for 100K parallel queries
☆30Updated 3 months ago
Alternatives and similar repositories for llm-scale-deploy-guide
Users that are interested in llm-scale-deploy-guide are comparing it to the libraries listed below
Sorting:
- LLM reads a paper and produce a working prototype☆57Updated 6 months ago
- ☆50Updated last year
- Training setup for Langchain's Open Deep Research☆65Updated last month
- Fastest way to build, prototype and deploy AI Agents with tools securely.☆40Updated this week
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 5 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 6 months ago
- ☆67Updated 6 months ago
- Handling Big Data with Knowledge Graph: A Detailed Guide☆27Updated 5 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆58Updated 2 weeks ago
- Verifiers for LLM Reinforcement Learning☆76Updated 6 months ago
- ☆160Updated 3 months ago
- An agent to generate stunning images :)☆23Updated 4 months ago
- ☆41Updated 5 months ago
- A Step-by-Step Implementation of RAPTOR based RAG implementation☆29Updated last month
- ☆146Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated 10 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆161Updated last month
- A curated list of materials on AI guardails☆40Updated 4 months ago
- Building LLMs from scratch following the book from S. Raschka☆32Updated 6 months ago
- ☆40Updated 10 months ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆57Updated 7 months ago
- Implementation of contextual engineering pipeline with LangChain and LangGraph Agents☆57Updated 2 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆46Updated last year
- ☆21Updated last year
- ☆80Updated last year
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆39Updated 6 months ago
- ☆43Updated 5 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
- minimal scripts for 24GB VRAM GPUs. training, inference, whatever☆43Updated last month
- This is an open-source version of OpenAI's O1 Model Series by Siraj Raval & O1-Preview☆96Updated last year