OpenCausaLab / CaLMLinks
☆93Updated 2 months ago
Alternatives and similar repositories for CaLM
Users that are interested in CaLM are comparing it to the libraries listed below
Sorting:
- ☆140Updated 4 months ago
- ☆29Updated 7 months ago
- ☆60Updated 2 weeks ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆177Updated 4 months ago
- Awesome Agent Training☆131Updated this week
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆123Updated 8 months ago
- A research repo for experiments about Reinforcement Finetuning☆47Updated last month
- ☆210Updated last week
- ☆94Updated 5 months ago
- [COLM'24] Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration☆28Updated 7 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆123Updated 2 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆73Updated this week
- A comprehensive collection of process reward models.☆85Updated 2 weeks ago
- A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".☆41Updated last year
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆46Updated 3 weeks ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆102Updated last year
- [ACL 2025] A Neural-Symbolic Self-Training Framework☆109Updated 2 weeks ago
- ☆42Updated 2 months ago
- A collection of survey papers and resources related to Large Language Models (LLMs).☆40Updated last year
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆94Updated 3 months ago
- ☆57Updated this week
- ☆135Updated 5 months ago
- ☆193Updated last week
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆80Updated last year
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆54Updated 6 months ago
- [ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models☆48Updated 2 months ago
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆61Updated 8 months ago
- The official code of paper “Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning”☆117Updated this week
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆99Updated 3 weeks ago
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆61Updated 4 months ago