beeevita / EvoPrompt
Official implementation of the paper Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
☆129Updated 9 months ago
Alternatives and similar repositories for EvoPrompt:
Users that are interested in EvoPrompt are comparing it to the libraries listed below
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆132Updated 10 months ago
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆53Updated 11 months ago
- ☆101Updated last month
- ☆101Updated 3 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆213Updated 2 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆41Updated 8 months ago
- Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"☆49Updated last year
- FireAct: Toward Language Agent Fine-tuning☆271Updated last year
- augmented LLM with self reflection☆115Updated last year
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆163Updated 2 months ago
- A banchmark list for evaluation of large language models.☆84Updated this week
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆92Updated last year
- Generative Judge for Evaluating Alignment☆230Updated last year
- ☆42Updated 2 months ago
- Code implementation of synthetic continued pretraining☆93Updated 2 months ago
- ☆50Updated 5 months ago
- ☆78Updated this week
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆99Updated 4 months ago
- [ICLR 2024] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use☆81Updated 11 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆66Updated 3 months ago
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆132Updated last week
- ☆176Updated last month
- ☆103Updated last month
- ☆79Updated 5 months ago
- Reformatted Alignment☆114Updated 5 months ago
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆63Updated 2 weeks ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆45Updated 4 months ago
- Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering☆57Updated 3 months ago