reactorsh / ambrosia
clean up your LLM datasets
☆113Updated last year
Alternatives and similar repositories for ambrosia:
Users that are interested in ambrosia are comparing it to the libraries listed below
- Just a bunch of benchmark logs for different LLMs☆117Updated 6 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- auto fine tune of models with synthetic data☆74Updated 11 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- ☆48Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated 9 months ago
- Routing on Random Forest (RoRF)☆100Updated 4 months ago
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆40Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆218Updated 8 months ago
- Synthetic Data for LLM Fine-Tuning☆108Updated last year
- Function Calling Benchmark & Testing☆79Updated 6 months ago
- ☆24Updated last year
- ☆20Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated 9 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆82Updated 3 weeks ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- ☆38Updated last year
- ☆22Updated last year
- ☆57Updated last year
- ☆65Updated 8 months ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆48Updated last year
- ☆136Updated last year
- Track the progress of LLM context utilisation☆53Updated 6 months ago
- Official homepage for "Self-Harmonized Chain of Thought"☆89Updated last week
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆118Updated last year
- ☆109Updated last month
- A framework for evaluating function calls made by LLMs☆36Updated 6 months ago
- ☆74Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆30Updated last month
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated last year