MoritzLaurer / synthetic-data-blogLinks

This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data

☆68

Alternatives and similar repositories for synthetic-data-blog

Users that are interested in synthetic-data-blog are comparing it to the libraries listed below

Sorting:

davanstrien / awesome-synthetic-datasets
awesome synthetic (text) datasets
☆305Updated this week
huggingface / data-is-better-together
Let's build better datasets, together!
☆264Updated 10 months ago
MoritzLaurer / zeroshot-classifier
Notebooks for training universal 0-shot classifiers on many different tasks
☆136Updated 10 months ago
microsoft / llm-data-creation
Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"
☆134Updated 2 years ago
QuixiAI / spectrum
☆138Updated 2 months ago
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆50Updated last year
writer / writing-in-the-margins
☆119Updated last year
cfahlgren1 / observers
A Lightweight Library for AI Observability
☆251Updated 8 months ago
davanstrien / data-for-fine-tuning-llms
☆80Updated last year
apple / ml-superposition-prompting
☆146Updated last year
davanstrien / haiku-dpo
Using open source LLMs to build synthetic datasets for direct preference optimization
☆69Updated last year
arcee-ai / EvolKit
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…
☆242Updated last year
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆79Updated last year
zetaalphavector / RAGElo
RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker
☆122Updated 2 weeks ago
Knowledgator / GLiClass
Generalist and Lightweight Model for Text Classification
☆164Updated 5 months ago
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆78Updated last year
lamini-ai / Lamini-Memory-Tuning
Banishing LLM Hallucinations Requires Rethinking Generalization
☆275Updated last year
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆200Updated 6 months ago
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆276Updated last year
ibm-granite / granite-3.0-language-models
☆268Updated 4 months ago
geronimi73 / phi2-finetune
☆86Updated last year
hamelsmu / llama-inference
experiments with inference on llama
☆103Updated last year
predlico / ARAGOG
ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…
☆114Updated last year
AnswerDotAI / fastdata
☆159Updated 11 months ago
jina-ai / correlations
Simple UI for debugging correlations of text embeddings
☆299Updated 5 months ago
osanseviero / hackerllama
My personal site
☆78Updated last year
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆118Updated last year
Muhtasham / summarization-eval
📝 Reference-Free automatic summarization evaluation with potential hallucination detection
☆102Updated last year
alopatenko / LLMEvaluation
A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…
☆149Updated this week
Arize-ai / LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
☆106Updated 2 months ago