shizhediao / Post-Training-Data-FlywheelView external linksLinks
We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.
☆61Oct 3, 2024Updated last year
Alternatives and similar repositories for Post-Training-Data-Flywheel
Users that are interested in Post-Training-Data-Flywheel are comparing it to the libraries listed below
Sorting:
- ☆17Nov 3, 2024Updated last year
- Repository for initial POC NLP based SQL adapter using LLM.☆10May 6, 2025Updated 9 months ago
- This is the official implementation of ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting☆24Jul 30, 2024Updated last year
- [AAAI 2025]Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity☆26Mar 17, 2025Updated 10 months ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated last year
- ☆12Jun 30, 2024Updated last year
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated 10 months ago
- Demos for AI assistants using NLUX, Next.js, React, and Node.js☆17Jun 24, 2024Updated last year
- The official implementation of the paper “Anchored Supervised Fine-Tuning”☆26Jan 30, 2026Updated 2 weeks ago
- CFG-GAN: Composite functional gradient learning of generative adversarial models☆15Jul 9, 2020Updated 5 years ago
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration☆36Mar 10, 2024Updated last year
- Directional Preference Alignment☆58Sep 23, 2024Updated last year
- Synthetic Data Generation for Evaluation☆13Feb 21, 2025Updated 11 months ago
- ☆14Feb 26, 2024Updated last year
- ☆17Apr 10, 2024Updated last year
- This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or re…☆39Sep 22, 2024Updated last year
- The official repository for paper "MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance"☆44Apr 21, 2024Updated last year
- [ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling☆18Jun 6, 2024Updated last year
- ☆18Jun 17, 2024Updated last year
- ☆17Nov 10, 2021Updated 4 years ago
- ☆109Jul 15, 2025Updated 7 months ago
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆41Sep 29, 2024Updated last year
- ☆25Dec 13, 2024Updated last year
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆21Jul 31, 2023Updated 2 years ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆129Jul 10, 2024Updated last year
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆588Dec 9, 2024Updated last year
- ☆21Sep 5, 2023Updated 2 years ago
- ArcherCodeR is an open-source initiative enhancing code reasoning in large language models through scalable, rule-governed reinforcement …☆44Aug 6, 2025Updated 6 months ago
- 免费的无限制的搜索接口-适用于ChatGPT镜像联网解决方案 / Free unrestricted search interface GPT Mirror Networking Solution☆16Aug 24, 2023Updated 2 years ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆22Mar 18, 2025Updated 10 months ago
- ☆46Jun 11, 2025Updated 8 months ago
- Source code for the paper "Active Prompting with Chain-of-Thought for Large Language Models"☆247May 7, 2024Updated last year
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆53Jun 24, 2024Updated last year
- [COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.☆109Apr 4, 2025Updated 10 months ago
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆266Jul 8, 2025Updated 7 months ago
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Jan 27, 2023Updated 3 years ago
- ☆27Apr 11, 2023Updated 2 years ago
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆107Mar 6, 2025Updated 11 months ago