Anjiang-Wei / CodeARCLinks
☆15Updated 3 weeks ago
Alternatives and similar repositories for CodeARC
Users that are interested in CodeARC are comparing it to the libraries listed below
Sorting:
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆20Updated last year
- A platform to develop CTM-motivated AI architecture.☆13Updated 2 weeks ago
- ☆17Updated 6 months ago
- ☆71Updated 7 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆121Updated 9 months ago
- Test-time-training on nearest neighbors for large language models☆41Updated last year
- ☆49Updated last year
- ☆40Updated last year
- A Sober Look at Language Model Reasoning☆74Updated last week
- This repository contains the implementation of the paper "MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models".☆20Updated last month
- What Makes a Reward Model a Good Teacher? An Optimization Perspective☆32Updated this week
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆35Updated last year
- ☆52Updated last year
- Discriminative Constrained Optimization for Reinforcing Large Reasoning Models☆25Updated 3 weeks ago
- The official implementation of "Self-play LLM Theorem Provers with Iterative Conjecturing and Proving"☆88Updated 2 months ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆140Updated 2 weeks ago
- LEAP is an end-to-end library designed to support social science research by automatically analyzing user-collected unstructured data in …☆15Updated 4 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆105Updated 2 months ago
- Lightweight Adapting for Black-Box Large Language Models☆22Updated last year
- Course information for CS598-Topics in LLM Agents(25'Spring) under the direction of Prof. Jiaxuan You ( jiaxuan@illinois.edu ).☆31Updated last month
- Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts☆16Updated last year
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆18Updated 2 months ago
- Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"☆37Updated 4 months ago
- Rewarded soups official implementation☆58Updated last year
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆72Updated 2 weeks ago
- ☆190Updated 3 months ago
- [ICML 2021] This is the official github repo for training L_inf dist nets with high certified accuracy.☆42Updated 3 years ago
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆25Updated 7 months ago
- [ICLR 2025 Workshop] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"☆25Updated last week
- Implementation for NeurIPS 2024 oral paper: Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation☆12Updated 5 months ago