☆33Jan 7, 2025Updated last year
Alternatives and similar repositories for reasoning_generalization
Users that are interested in reasoning_generalization are comparing it to the libraries listed below
Sorting:
- Gemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)☆33Sep 28, 2025Updated 5 months ago
- Automatic identification of regions in the latent space of a model that correspond to unique concepts, namely to concepts with a semantic…☆14Nov 22, 2023Updated 2 years ago
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆19Mar 9, 2025Updated 11 months ago
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆42Sep 18, 2025Updated 5 months ago
- ☆19Mar 25, 2025Updated 11 months ago
- [NeurIPS 2025 Datasets & Benchmarks Track] The Illusion of Progress? A Critical Look at Test-Time Adaptation for Vision-Language Models☆33Oct 26, 2025Updated 4 months ago
- ☆18Oct 24, 2020Updated 5 years ago
- [ACL2025 Best Paper] Language Models Resist Alignment☆43Jun 11, 2025Updated 8 months ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆64Jan 26, 2026Updated last month
- ☆23Apr 2, 2024Updated last year
- PowerBiMIP is an open-source, efficient bilevel mixed-integer programming (BiMIP) solver, with a special focus on applications in power a…☆34Feb 26, 2026Updated last week
- Unlock level without hassle in Candy Crush Saga☆22Sep 5, 2017Updated 8 years ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆24Sep 13, 2024Updated last year
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆121Dec 10, 2024Updated last year
- GenRM-CoT: Data release for verification rationales☆68Oct 16, 2024Updated last year
- Latest Weight Averaging (NeurIPS HITY 2022)☆32Jun 20, 2023Updated 2 years ago
- Identifying tumor affected scans using Fast.ai and detecting them using openCV☆13Jan 18, 2021Updated 5 years ago
- Simple Scalable Discrete Diffusion for text in PyTorch☆37Sep 27, 2024Updated last year
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Jul 16, 2023Updated 2 years ago
- ☆38Jan 15, 2025Updated last year
- QuESt Planning is a long-term power system capacity expansion planning model that identifies cost-optimal energy storage, generation, and…☆14Feb 4, 2026Updated last month
- ☆10Feb 13, 2025Updated last year
- Simulating the spread of a rumour in a social network with Python☆10Apr 3, 2020Updated 5 years ago
- A generic Monte Carlo method based on the Gumbel-Max trick.☆32May 31, 2016Updated 9 years ago
- A bibliography and survey of the papers surrounding o1☆1,213Nov 16, 2024Updated last year
- Consensus Based Distributed Stochastic Gradient Descent☆11Jun 24, 2018Updated 7 years ago
- 在监控画质下实现对校园自行车的重识别,包含REID模型识别,向量数据库检索,UI展示☆10Feb 13, 2024Updated 2 years ago
- Example Systems using PowerDynamics.jl☆12Oct 10, 2022Updated 3 years ago
- enuSpace plugin for Tensorflow (graphical logic block, flow programming)☆11Feb 6, 2020Updated 6 years ago
- ☆11Sep 8, 2025Updated 5 months ago
- Towards Memorization-Free Diffusion Models (CVPR2024) Codebase☆12Jun 2, 2024Updated last year
- Visualize linear programming at https://lpviz.net☆33Jan 20, 2026Updated last month
- Source code for my blog DeepNotes☆15May 4, 2023Updated 2 years ago
- ☆12Mar 15, 2023Updated 2 years ago
- ☆11Oct 25, 2024Updated last year
- Repo of paper "Free Process Rewards without Process Labels"☆169Mar 14, 2025Updated 11 months ago
- Few-Shot Relation Extraction with AllenNLP☆13Jan 27, 2019Updated 7 years ago
- ☆16Nov 20, 2024Updated last year
- ☆12Nov 21, 2023Updated 2 years ago