tencent-ailab / persona-hubLinks

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

☆1,373

Alternatives and similar repositories for persona-hub

Users that are interested in persona-hub are comparing it to the libraries listed below

Sorting:

magpie-align / magpie
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …
☆782Updated 7 months ago
zhentingqi / rStar
☆964Updated 9 months ago
AIDC-AI / Marco-o1
An Open Large Reasoning Model for Real-World Solutions
☆1,524Updated 5 months ago
YangLing0818 / buffer-of-thought-llm
[NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models
☆666Updated 4 months ago
Open-Source-O1 / Open-O1
☆1,349Updated 11 months ago
wasiahmad / Awesome-LLM-Synthetic-Data
A reading list on LLM based Synthetic Data Generation 🔥
☆1,448Updated 4 months ago
maitrix-org / llm-reasoners
A library for advanced large language model reasoning
☆2,292Updated 4 months ago
trotsky1997 / MathBlackBox
☆1,035Updated 10 months ago
ezelikman / quiet-star
Code for Quiet-STaR
☆739Updated last year
SimpleBerry / LLaMA-O1
Large Reasoning Models
☆805Updated 10 months ago
tatsu-lab / alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
☆1,885Updated 2 months ago
princeton-nlp / SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
☆924Updated 8 months ago
GAIR-NLP / O1-Journey
O1 Replication Journey
☆2,003Updated 9 months ago
sierra-research / tau-bench
Code and Data for Tau-Bench
☆901Updated 2 months ago
ContextualAI / HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
☆892Updated last month
RUC-NLPIR / Search-o1
🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]
☆1,068Updated 2 months ago
gkamradt / LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
☆2,060Updated last year
argilla-io / distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆2,912Updated this week
prometheus-eval / prometheus-eval
Evaluate your LLM's response with Prometheus and GPT4 💯
☆1,006Updated 6 months ago
Agent-RL / ReCall
ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning
☆1,230Updated 5 months ago
GAIR-NLP / LIMO
[COLM 2025] LIMO: Less is More for Reasoning
☆1,038Updated 3 months ago
uclaml / SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
☆1,207Updated last year
ysymyth / awesome-language-agents
List of language agents based on paper "Cognitive Architectures for Language Agents"
☆1,050Updated 9 months ago
madaan / self-refine
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
☆750Updated last year
openai / mle-bench
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
☆1,042Updated 2 weeks ago
BatsResearch / bonito
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
☆796Updated 3 months ago
huggingface / cosmopedia
☆546Updated 11 months ago
lmarena / arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
☆948Updated 4 months ago
allenai / open-instruct
AllenAI's post-training codebase
☆3,263Updated last week
jquesnelle / yarn
YaRN: Efficient Context Window Extension of Large Language Models
☆1,623Updated last year