BatsResearch / bonitoLinks
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
โ780Updated 4 months ago
Alternatives and similar repositories for bonito
Users that are interested in bonito are comparing it to the libraries listed below
Sorting:
- Evaluate your LLM's response with Prometheus and GPT4 ๐ฏโ959Updated 2 months ago
- DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. โ ๐ค๐คโ1,031Updated 5 months ago
- The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrievalโ1,309Updated 10 months ago
- Automated Evaluation of RAG Systemsโ624Updated 3 months ago
- โ883Updated 8 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.โ1,492Updated last month
- โ908Updated 10 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiโฆโ2,800Updated this week
- Automatically evaluate your LLMs in Google Colabโ646Updated last year
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Modelsโ643Updated 2 weeks ago
- Fine-Tuning Embedding for RAG with Synthetic Dataโ503Updated last year
- Code for explaining and evaluating late chunking (chunked pooling)โ416Updated 6 months ago
- Generative Representational Instruction Tuningโ658Updated 2 weeks ago
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diveโฆโ950Updated 8 months ago
- Easily embed, cluster and semantically label text datasetsโ552Updated last year
- Train Models Contrastively in Pytorchโ727Updated 3 months ago
- Open-source tool to visualise your RAG ๐ฎโ1,144Updated 6 months ago
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data โฆโ728Updated 3 months ago
- โ1,025Updated 6 months ago
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,โฆโ2,123Updated last year
- Efficient Retrieval Augmentation and Generation Frameworkโ1,588Updated 6 months ago
- Repository for "MultiHop-RAG: A Dataset for Evaluating Retrieval-Augmented Generation Across Documents" (COLM 2024)โ338Updated 3 months ago
- โ611Updated 5 months ago
- Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".โ621Updated this week
- Best practices for distilling large language models.โ560Updated last year
- RAGChecker: A Fine-grained Framework For Diagnosing RAGโ929Updated 7 months ago
- โ411Updated last year
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsโ1,722Updated this week
- โ523Updated 7 months ago
- Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffoldingโ392Updated last year