tencent-ailab / persona-hub
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
☆1,096Updated last month
Alternatives and similar repositories for persona-hub:
Users that are interested in persona-hub are comparing it to the libraries listed below
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆660Updated last week
- ☆910Updated 2 months ago
- A reading list on LLM based Synthetic Data Generation 🔥☆1,211Updated last month
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆612Updated last month
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆735Updated 3 weeks ago
- Code for Quiet-STaR☆721Updated 7 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆817Updated 2 weeks ago
- ☆1,347Updated 4 months ago
- O1 Replication Journey☆1,977Updated 2 months ago
- ☆1,011Updated 3 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆1,389Updated this week
- Large Reasoning Models☆799Updated 3 months ago
- ☆559Updated last week
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆851Updated last month
- Recipes to scale inference-time compute of open models☆1,044Updated last month
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,578Updated this week
- Evaluate your LLM's response with Prometheus and GPT4 💯☆885Updated last week
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆767Updated this week
- An Open Large Reasoning Model for Real-World Solutions☆1,475Updated 3 weeks ago
- ☆502Updated 4 months ago
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆349Updated 6 months ago
- A library for advanced large language model reasoning☆2,060Updated last month
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,313Updated this week
- ☆584Updated 2 months ago
- Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"☆522Updated 4 months ago
- Scalable RL solution for advanced reasoning of language models☆1,419Updated last week
- Synthetic data curation for post-training and structured data extraction☆1,065Updated this week
- Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'☆1,457Updated 2 months ago
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.☆1,695Updated 2 months ago
- LIMO: Less is More for Reasoning☆864Updated last month