Anjiang-Wei / CodeARCLinks

☆15

Alternatives and similar repositories for CodeARC

Users that are interested in CodeARC are comparing it to the libraries listed below

Sorting:

guyuntian / CoT_benchmark
Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"
☆20Updated last year
consciousness-lab / ctm-ai
A platform to develop CTM-motivated AI architecture.
☆13Updated 2 weeks ago
BunsenFeng / model_swarm
☆17Updated 6 months ago
thu-wyz / inference_scaling
☆71Updated 7 months ago
Edward-Sun / easy-to-hard
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
☆121Updated 9 months ago
socialfoundations / tttlm
Test-time-training on nearest neighbors for large language models
☆41Updated last year
abhishekpanigrahi1996 / Skill-Localization-by-grafting
☆49Updated last year
deeplearning-wisc / args
☆40Updated last year
bethgelab / sober-reasoning
A Sober Look at Language Model Reasoning
☆74Updated last week
NJUDeepEngine / meteora
This repository contains the implementation of the paper "MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models".
☆20Updated last month
princeton-pli / what-makes-good-rm
What Makes a Reward Model a Good Teacher? An Optimization Perspective
☆32Updated this week
Jiacheng-Zhu-AIML / AsymmetryLoRA
Preprint: Asymmetry in Low-Rank Adapters of Foundation Models
☆35Updated last year
Kaffaljidhmah2 / Arxiv-Recommender
☆52Updated last year
Optimization-AI / DisCO
Discriminative Constrained Optimization for Reinforcing Large Reasoning Models
☆25Updated 3 weeks ago
kfdong / STP
The official implementation of "Self-play LLM Theorem Provers with Iterative Conjecturing and Proving"
☆88Updated 2 months ago
TsinghuaC3I / MARTI
A Framework for LLM-based Multi-Agent Reinforced Training and Inference
☆140Updated 2 weeks ago
uiuc-kang-lab / leap
LEAP is an end-to-end library designed to support social science research by automatically analyzing user-collected unstructured data in …
☆15Updated 4 months ago
ryoungj / BoLT
Code for "Reasoning to Learn from Latent Thoughts"
☆105Updated 2 months ago
haotiansun14 / BBox-Adapter
Lightweight Adapting for Black-Box Large Language Models
☆22Updated last year
ulab-uiuc / CS598-Topics-in-LLM-Agents
Course information for CS598-Topics in LLM Agents(25'Spring) under the direction of Prof. Jiaxuan You ( jiaxuan@illinois.edu ).
☆31Updated last month
YuejiangLIU / csl
Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts
☆16Updated last year
rosieyzh / openrlhf-pretrain
Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"
☆18Updated 2 months ago
YangRui2015 / Generalizable-Reward-Model
Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"
☆37Updated 4 months ago
alexrame / rewardedsoups
Rewarded soups official implementation
☆58Updated last year
YangRui2015 / RiC
Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"
☆72Updated 2 weeks ago
kanishkg / cognitive-behaviors
☆190Updated 3 months ago
zbh2047 / L_inf-dist-net
[ICML 2021] This is the official github repo for training L_inf dist nets with high certified accuracy.
☆42Updated 3 years ago
srzer / MOD
Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".
☆25Updated 7 months ago
tmlr-group / landscape-of-thoughts
[ICLR 2025 Workshop] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"
☆25Updated last week
cometeme / funcoder
Implementation for NeurIPS 2024 oral paper: Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation
☆12Updated 5 months ago