wasiahmad / Awesome-LLM-Synthetic-Data
A reading list on LLM based Synthetic Data Generation π₯
β1,223Updated last month
Alternatives and similar repositories for Awesome-LLM-Synthetic-Data:
Users that are interested in Awesome-LLM-Synthetic-Data are comparing it to the libraries listed below
- Synthetic data curation for post-training and structured data extractionβ1,097Updated last week
- LIMO: Less is More for Reasoningβ875Updated last month
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard aβ¦β1,105Updated 2 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ1,358Updated this week
- β1,011Updated 3 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,601Updated last week
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data β¦β664Updated 2 weeks ago
- A library for advanced large language model reasoningβ2,069Updated last month
- Recipes to scale inference-time compute of open modelsβ1,048Updated last month
- Search-o1: Agentic Search-Enhanced Large Reasoning Modelsβ748Updated 3 weeks ago
- Awesome Reasoning LLM Tutorial/Survey/Guideβ1,220Updated this week
- Evaluate your LLM's response with Prometheus and GPT4 π―β893Updated 2 weeks ago
- Automatic evals for LLMsβ346Updated this week
- Summarize existing representative LLMs text datasets.β1,227Updated last week
- System 2 Reasoning Link Collectionβ818Updated 2 weeks ago
- O1 Replication Journeyβ1,980Updated 2 months ago
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learningβ395Updated last week
- A bibliography and survey of the papers surrounding o1β1,183Updated 4 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ1,466Updated last week
- Training Large Language Model to Reason in a Continuous Latent Spaceβ1,015Updated 2 months ago
- Large Reasoning Modelsβ800Updated 4 months ago
- An Open Large Reasoning Model for Real-World Solutionsβ1,477Updated 3 weeks ago
- A curated list of retrieval-augmented generation (RAG) in large language modelsβ250Updated last month
- List of papers on hallucination detection in LLMs.β812Updated 3 weeks ago
- β913Updated 2 months ago
- AllenAI's post-training codebaseβ2,854Updated this week
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Modelsβ614Updated last week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.β2,378Updated this week
- β1,348Updated 4 months ago
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuningβ350Updated 6 months ago