algopapi / EvoPrompting_Reinforcement_learningLinks
☆24Updated 2 years ago
Alternatives and similar repositories for EvoPrompting_Reinforcement_learning
Users that are interested in EvoPrompting_Reinforcement_learning are comparing it to the libraries listed below
Sorting:
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆97Updated last year
- ☆46Updated last year
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆36Updated last year
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆120Updated 9 months ago
- Interactive coding assistant for data scientists and machine learning developers, empowered by large language models.☆95Updated 10 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Updated last year
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated 9 months ago
- ☆88Updated last year
- ☆66Updated 4 months ago
- Codes for the EMNLP 2023 Findings paper "Self-Polish: Enhance Reasoning in Large Language Models via Problem Refining" by Zhiheng Xi, Sen…☆30Updated 2 years ago
- ☆32Updated last year
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆38Updated last year
- Code repo for MathAgent☆17Updated last year
- ☆24Updated 10 months ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆113Updated last year
- Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools☆140Updated 2 years ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆97Updated last year
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆104Updated 2 weeks ago
- ☆134Updated last year
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆184Updated 4 months ago
- ☆84Updated last year
- ☆49Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated last year
- ☆59Updated 8 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Open Implementations of LLM Analyses☆105Updated 10 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆125Updated last year
- Understanding the correlation between different LLM benchmarks☆29Updated last year