redotvideo / pluto
Synthetic Data for LLM Fine-Tuning
☆97Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for pluto
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆203Updated 6 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆221Updated 6 months ago
- RAFT, or Retrieval-Augmented Fine-Tuning, is a method comprising of a fine-tuning and a RAG-based retrieval phase. It is particularly sui…☆75Updated 2 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆97Updated 7 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆65Updated this week
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆181Updated 3 weeks ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆82Updated 2 months ago
- Just a bunch of benchmark logs for different LLMs☆116Updated 3 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- Function Calling Benchmark & Testing☆75Updated 4 months ago
- ☆94Updated 2 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆93Updated 5 months ago
- ☆106Updated 2 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆134Updated 2 months ago
- A benchmark for emotional intelligence in large language models☆197Updated 3 months ago
- A framework for evaluating function calls made by LLMs☆35Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆137Updated last month
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆74Updated 2 months ago
- 🤖 Headless IDE for AI agents☆133Updated this week
- Routing on Random Forest (RoRF)☆84Updated 2 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆113Updated 3 weeks ago
- Low-Rank adapter extraction for fine-tuned transformers model☆162Updated 6 months ago
- A toolkit for building multimodal AI agents☆111Updated this week
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆53Updated 3 weeks ago
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆144Updated 9 months ago
- Simple examples using Argilla tools to build AI☆42Updated this week
- autologic is a Python package that implements the SELF-DISCOVER framework proposed in the paper SELF-DISCOVER: Large Language Models Self…☆57Updated 9 months ago
- A simple Python sandbox for helpful LLM data agents☆173Updated 5 months ago
- ☆104Updated 8 months ago