MoritzLaurer / synthetic-data-blog
This is the reproduction repository for my π€ Hugging Face blog post on synthetic data
β68Updated last year
Alternatives and similar repositories for synthetic-data-blog:
Users that are interested in synthetic-data-blog are comparing it to the libraries listed below
- awesome synthetic (text) datasetsβ265Updated 5 months ago
- Simple examples using Argilla tools to build AIβ53Updated 4 months ago
- β76Updated 9 months ago
- β115Updated 7 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 8 months ago
- β66Updated 10 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ59Updated last year
- β112Updated 6 months ago
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β73Updated 5 months ago
- β143Updated 8 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo rankerβ107Updated 2 weeks ago
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"β129Updated last year
- Let's build better datasets, together!β257Updated 3 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β67Updated 4 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Mβ¦β208Updated 5 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β80Updated last year
- Manage scalable open LLM inference endpoints in Slurm clustersβ253Updated 8 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Ayaβ107Updated last month
- Codebase accompanying the Summary of a Haystack paper.β76Updated 6 months ago
- Set of scripts to finetune LLMsβ37Updated last year
- code for training & evaluating Contextual Document Embedding modelsβ176Updated 2 months ago
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ100Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"β105Updated 6 months ago
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs β¦β50Updated last month
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuningβ46Updated last year
- Mixing Language Models with Self-Verification and Meta-Verificationβ102Updated 3 months ago
- A Lightweight Library for AI Observabilityβ238Updated last month
- An introduction to LLM Samplingβ77Updated 3 months ago
- β150Updated 4 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paperβ¦β102Updated 11 months ago