[ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs
☆317Jul 13, 2025Updated 7 months ago
Alternatives and similar repositories for llamaduo
Users that are interested in llamaduo are comparing it to the libraries listed below
Sorting:
- ☆30Mar 18, 2024Updated last year
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- ☆16Jul 23, 2024Updated last year
- Code and data from the paper 'Human Feedback is not Gold Standard'☆20Feb 24, 2026Updated last week
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated last year
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆203Jul 17, 2024Updated last year
- XmodelLM☆38Nov 19, 2024Updated last year
- generate synthetic data for LLM fine-tuning in arbitrary situations within systematic way☆22Mar 18, 2024Updated last year
- ☆17Apr 9, 2025Updated 10 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆233Oct 31, 2024Updated last year
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 10 months ago
- yolosegment2labelme - a Python package that allows you to convert YOLO segmentation prediction results to LabelMe and anylabeling JSON fo…☆10May 8, 2024Updated last year
- [ICLR'25] Code for KaSA, an official implementation of "KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models"☆20Jan 16, 2025Updated last year
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,108Feb 23, 2026Updated last week
- Reward Model을 이용하여 언어모델의 답변을 평가하기☆29Feb 23, 2024Updated 2 years ago
- Robust recipes to align language models with human and AI preferences☆5,510Sep 8, 2025Updated 5 months ago
- A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.☆3,803Feb 25, 2026Updated last week
- What Would Portland Do? Generative agent experience☆13Mar 13, 2024Updated last year
- RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Containe…☆15Jan 9, 2025Updated last year
- Automating enterprise workflows with multimodal agents☆115Oct 9, 2024Updated last year
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!☆936Jan 28, 2026Updated last month
- batched loras☆350Sep 6, 2023Updated 2 years ago
- GenAIOps on Kubernetes: A collection of reference architectures for running GenAI at scale on Kubernetes using OSS tooling☆136Oct 29, 2024Updated last year
- Multi-Agent AI App from Scratch in python without any depedency of framework☆15Jan 7, 2025Updated last year
- ☆13Sep 12, 2024Updated last year
- ☆55Jan 15, 2026Updated last month
- Chatbot for The Carbon Almanac book or a climate change related topic☆16Mar 6, 2023Updated 3 years ago
- Evaluate your LLM's response with Prometheus and GPT4 💯☆1,050Apr 25, 2025Updated 10 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆154Jun 13, 2024Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Jun 10, 2024Updated last year
- Knowledge Graph Generator app☆34Apr 18, 2024Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Oct 4, 2024Updated last year
- Training LLMs with QLoRA + FSDP☆1,538Nov 9, 2024Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- Material for the series of seminars on Large Language Models☆34Apr 21, 2024Updated last year
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆823Jul 15, 2025Updated 7 months ago
- ☆23Oct 28, 2024Updated last year