AgentForceTeamOfficial / Baby-AIGS
Official Implementation of the Baby-AIGS system
☆19Updated 3 weeks ago
Alternatives and similar repositories for Baby-AIGS:
Users that are interested in Baby-AIGS are comparing it to the libraries listed below
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆13Updated last month
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆46Updated 2 weeks ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆52Updated 9 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆104Updated last month
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆73Updated 2 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆36Updated last month
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆74Updated this week
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆41Updated last month
- Repository for paper Tools Are Instrumental for Language Agents in Complex Environments☆33Updated 2 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆98Updated 3 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆31Updated 4 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆39Updated last month
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆35Updated last month
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆48Updated last month
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆46Updated 6 months ago
- Evaluate the Quality of Critique☆35Updated 6 months ago
- Structured Chemistry Reasoning with Large Language Models☆31Updated 7 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆76Updated last month
- ☆36Updated last month
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆100Updated 3 weeks ago
- Natural Language Reinforcement Learning☆54Updated this week
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆85Updated last week
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆37Updated last month
- On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆26Updated last month
- Repo of paper "Free Process Rewards without Process Labels"☆26Updated last week
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆31Updated 10 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆75Updated 10 months ago
- Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)☆44Updated 4 months ago
- Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering☆45Updated last week
- [ACL 2024] The project of Symbol-LLM☆43Updated 5 months ago