OpenCausaLab / CaLM
☆93Updated last month
Alternatives and similar repositories for CaLM
Users that are interested in CaLM are comparing it to the libraries listed below
Sorting:
- ☆133Updated 3 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆171Updated 3 months ago
- ☆55Updated 7 months ago
- Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 …☆68Updated last month
- A comprehensive collection of process reward models.☆76Updated last week
- ☆133Updated 4 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆92Updated 2 months ago
- AI Alignment: A Comprehensive Survey☆133Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆116Updated 7 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆138Updated 6 months ago
- ☆102Updated 5 months ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆46Updated this week
- ☆39Updated 8 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆72Updated 2 weeks ago
- ☆27Updated 2 months ago
- [Preprint] A Neural-Symbolic Self-Training Framework☆107Updated last month
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆61Updated 7 months ago
- A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".☆41Updated last year
- [COLM'24] Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration☆28Updated 6 months ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆113Updated last week
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆93Updated 3 weeks ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆119Updated last month
- Awesome Agent Training☆106Updated this week
- ☆153Updated last month
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 5 months ago
- ☆55Updated 2 months ago
- MPO: Boosting LLM Agents with Meta Plan Optimization☆51Updated 2 months ago
- Repo of "Large Language Model-based Human-Agent Collaboration for Complex Task Solving(EMNLP2024 Findings)"☆32Updated 7 months ago
- A collection of survey papers and resources related to Large Language Models (LLMs).☆40Updated last year
- A research repo for experiments about Reinforcement Finetuning☆46Updated last month