deep-diver / llamaduo
This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.
☆289Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for llamaduo
- ☆131Updated 4 months ago
- awesome synthetic (text) datasets☆242Updated 3 weeks ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆175Updated this week
- Banishing LLM Hallucinations Requires Rethinking Generalization☆261Updated 4 months ago
- An Open Source Toolkit For LLM Distillation☆356Updated 2 months ago
- ☆105Updated 2 months ago
- Tutorial for building LLM router☆163Updated 4 months ago
- Framework for enhancing LLMs for RAG tasks using fine-tuning.☆504Updated this week
- Solving data for LLMs - Create quality synthetic datasets!☆137Updated last month
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆328Updated 5 months ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆617Updated last week
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆130Updated this week
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆96Updated 7 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆180Updated 3 weeks ago
- ☆204Updated 4 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAG☆311Updated last week
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆137Updated 7 months ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆165Updated 2 weeks ago
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation☆215Updated last month
- Task-based Agentic Framework using StrictJSON as the core☆436Updated last month
- Automatically evaluate your LLMs in Google Colab☆559Updated 6 months ago
- GenAIOps on Kubernetes: A collection of reference architectures for running GenAI at scale on Kubernetes using OSS tooling☆128Updated 3 weeks ago
- RAFT, or Retrieval-Augmented Fine-Tuning, is a method comprising of a fine-tuning and a RAG-based retrieval phase. It is particularly sui…☆75Updated 2 months ago
- Code for explaining and evaluating late chunking (chunked pooling)☆246Updated last month
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆221Updated 6 months ago
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆408Updated 2 months ago
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆146Updated 7 months ago
- Fast parallel LLM inference for MLX☆149Updated 4 months ago
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆162Updated 6 months ago
- Structured information extraction from documents☆282Updated last month