facebookresearch/darling

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/darling)

facebookresearch / darling

Official Implementation of the paper "Jointly Reinforcing Diversity and Quality in Language Model Generations"

☆61

Alternatives and similar repositories for darling

Users that are interested in darling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

euiin / SMART
View on GitHub
SMART introduces a novel test-time framework where Small Language Models (SLMs) reason step-by-step, and Large Language Models (LLMs) pro…
☆12Jul 9, 2025Updated last year
intervention-training / int
View on GitHub
☆16Feb 4, 2026Updated 5 months ago
novelty-bench / novelty-bench
View on GitHub
☆34Nov 27, 2025Updated 7 months ago
RefineBench / refinebench-eval
View on GitHub
Official code and dataset for our paper: RefineBench: Evaluating Refinement Capability of Language Models via Checklists
☆17Dec 1, 2025Updated 7 months ago
ryoungj / BoLT
View on GitHub
Code for "Reasoning to Learn from Latent Thoughts"
☆134Mar 28, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
leo-liuzy / CodeUpdateArena
View on GitHub
☆17Mar 20, 2025Updated last year
ritzz-ai / PACS
View on GitHub
☆31Sep 12, 2025Updated 10 months ago
howard-yen / SLIM
View on GitHub
☆27Jun 22, 2026Updated last month
stellalisy / PrefPalette
View on GitHub
☆21Apr 3, 2026Updated 3 months ago
liuchengwucn / Safe
View on GitHub
(ACL 2025 Main) Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification - Offici…
☆21Dec 26, 2025Updated 6 months ago
chicosirius / think-or-not
View on GitHub
☆22May 23, 2025Updated last year
NJU-RL / DIVER
View on GitHub
[ICLR 2026] The Official Implementation of DIVER
☆34Mar 5, 2026Updated 4 months ago
lichengliu03 / unary-feedback
View on GitHub
☆44Mar 31, 2026Updated 3 months ago
ASTRAL-Group / LoRe
View on GitHub
When Reasoning Meets Its Laws
☆38Jan 2, 2026Updated 6 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yayayacc / MUR
View on GitHub
☆49May 14, 2026Updated 2 months ago
ChangyuChen347 / MaskedThought
View on GitHub
[ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models
☆27Jul 9, 2024Updated 2 years ago
YujunZhou / EVOL-RL
View on GitHub
Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).
☆51Mar 31, 2026Updated 3 months ago
SkyworkAI / Skywork-DeepResearch
View on GitHub
☆27Aug 13, 2025Updated 11 months ago
sony / cmt
View on GitHub
☆21Mar 3, 2026Updated 4 months ago
upiterbarg / lintseq
View on GitHub
[ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)
☆19Feb 11, 2025Updated last year
DripNowhy / Octopus
View on GitHub
[ICML 2026] Official implementation for paper: Learning Self-Correction in Vision–Language Models via Rollout Augmentation
☆16Jun 4, 2026Updated last month
multimodal-art-projection / TreePO
View on GitHub
☆65Mar 30, 2026Updated 3 months ago
lingchen0331 / UQ_ICL
View on GitHub
Uncertainty quantification for in-context learning of large language models
☆15Apr 1, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
uq-project / UQ
View on GitHub
UQ: Assessing Language Models on Unsolved Questions
☆30Aug 26, 2025Updated 10 months ago
skhemlani / mReasoner
View on GitHub
mReasoner is a unified computational implementation of the model theory of thinking and reasoning
☆16Aug 17, 2023Updated 2 years ago
zzli2022 / TLDR
View on GitHub
Code for Research Project TLDR
☆26Jul 28, 2025Updated 11 months ago
ethz-spylab / jailbreak-tax
View on GitHub
☆24Feb 17, 2026Updated 5 months ago
sail-sg / feedback-conditional-policy
View on GitHub
Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"
☆65Jan 5, 2026Updated 6 months ago
shkim0116 / KLASS
View on GitHub
[NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"
☆33Jan 3, 2026Updated 6 months ago
CMU-AIRe / QED-Nano
View on GitHub
Training tiny models to prove hard theorems
☆81Mar 5, 2026Updated 4 months ago
junkangwu / alpha-DPO
View on GitHub
[ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"
☆31Jan 10, 2026Updated 6 months ago
princeton-pli / LongProc
View on GitHub
LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
☆36Feb 26, 2026Updated 4 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
uservan / speculative_thinking
View on GitHub
☆34Oct 13, 2025Updated 9 months ago
tg-prplx / vellium
View on GitHub
Local-first desktop AI workbench for roleplay, multi-character chat, long-form writing, RAG, MCP tools, plugins, and local models.
☆105Updated this week
SalesforceAIResearch / swecomm
View on GitHub
☆28Jun 2, 2026Updated last month
shelowize / lvrep-rl
View on GitHub
☆12Mar 17, 2024Updated 2 years ago
princeton-pli / QRHead
View on GitHub
QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking
☆40Jan 20, 2026Updated 6 months ago
BitSecret / HyperGNet
View on GitHub
Geometric Problem Solving Integrating FormalGeo Symbolic System and Hypergraph Neural Network.
☆16Sep 23, 2025Updated 10 months ago
jins7 / LatentEvolve
View on GitHub
☆27Oct 9, 2025Updated 9 months ago