FareedKhan-dev / llm-scale-deploy-guideLinks
An end-to-end pipeline to optimize and host LLM for 100K parallel queries
☆30Updated 4 months ago
Alternatives and similar repositories for llm-scale-deploy-guide
Users that are interested in llm-scale-deploy-guide are comparing it to the libraries listed below
Sorting:
- Training setup for Langchain's Open Deep Research☆72Updated 3 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆36Updated 6 months ago
- Verifiers for LLM Reinforcement Learning☆79Updated 7 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆113Updated 7 months ago
- ☆51Updated last year
- Train LLM on Hugging Face infra☆67Updated 2 weeks ago
- ☆99Updated 8 months ago
- Fastest way to build, prototype and deploy AI Agents or ANY LLM Application with built-in security layer.☆99Updated this week
- Inference, Fine Tuning and many more recipes with Gemma family of models☆274Updated 4 months ago
- purpose of this repo is to Implement LLMOPs as shared in Deeplearning AI course☆35Updated last week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆106Updated this week
- Implementation of 12 AI agents evaluation techniques☆28Updated 4 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆40Updated 7 months ago
- LLM reads a paper and produce a working prototype☆58Updated 7 months ago
- accompanying material for sleep-time compute paper☆117Updated 7 months ago
- An Automatic Prompt Optimization Framework for Large Language Models☆138Updated 3 months ago
- ☆80Updated 2 weeks ago
- ☆80Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆106Updated 7 months ago
- This repository contains a pipeline for fine-tuning Large Language Models (LLMs) for Text-to-SQL conversion using General Reward Proximal…☆39Updated 7 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆39Updated last month
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆64Updated last week
- ☆88Updated 3 weeks ago
- Training LLMs to reason and analyze data with notebooks☆53Updated 2 months ago
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆99Updated 6 months ago
- ☆41Updated 7 months ago
- Submodular optimization for context engineering: query fan-out, text selection, passage reranking☆77Updated 4 months ago
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆76Updated 7 months ago
- unsloth-5090-multiple☆59Updated 6 months ago