tsukushiAI / self-organized-agentLinks
A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization
☆15Updated 9 months ago
Alternatives and similar repositories for self-organized-agent
Users that are interested in self-organized-agent are comparing it to the libraries listed below
Sorting:
- ☆20Updated 10 months ago
- ☆28Updated 11 months ago
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆72Updated last year
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆76Updated last year
- ☆10Updated 2 years ago
- DataSciBench: An LLM Agent Benchmark for Data Science☆33Updated 3 weeks ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆29Updated last month
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆68Updated last year
- ☆28Updated 8 months ago
- ☆19Updated 11 months ago
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆12Updated 7 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆36Updated last year
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Updated last year
- ☆16Updated last year
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆41Updated last year
- Code repo for MathAgent☆17Updated last year
- ☆17Updated 2 months ago
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers☆32Updated 5 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆37Updated last month
- ☆19Updated 3 months ago
- ☆18Updated last year
- ☆78Updated last year
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Updated 10 months ago
- Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation☆36Updated 2 months ago
- ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆115Updated 3 weeks ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated last year
- Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"☆30Updated 2 months ago
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆24Updated 10 months ago
- [NAACL 2024] Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models☆86Updated last year