algopapi / EvoPrompting_Reinforcement_learning
☆23Updated last year
Alternatives and similar repositories for EvoPrompting_Reinforcement_learning:
Users that are interested in EvoPrompting_Reinforcement_learning are comparing it to the libraries listed below
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆20Updated last year
- Codes for the EMNLP 2023 Findings paper "Self-Polish: Enhance Reasoning in Large Language Models via Problem Refining" by Zhiheng Xi, Sen…☆30Updated last year
- A repository for transformer critique learning and generation☆89Updated last year
- Evaluation of neuro-symbolic engines☆35Updated 8 months ago
- Code repo for MathAgent☆15Updated last year
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models☆33Updated 8 months ago
- Interactive coding assistant for data scientists and machine learning developers, empowered by large language models.☆92Updated 6 months ago
- "Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases" b…☆43Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆73Updated 5 months ago
- A GPT-based generative LM for combined text and math formulas, leveraging tree-based formula encoding.☆35Updated last year
- ☆60Updated 11 months ago
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆34Updated last year
- Fun project to run your own LLM chat bot using llama.cpp☆11Updated last year
- ☆24Updated 6 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- ☆87Updated last year
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆108Updated last week
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 10 months ago
- Generate High Quality textual or multi-modal datasets with Agents☆18Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆96Updated last year
- Toy implementation of Strawberry☆31Updated 6 months ago
- ☆81Updated last year
- Official repository for ODQA experiments from Decomposed Prompting: A Modular Approach for Solving Complex Tasks, ICLR23☆10Updated last year
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- FuseAI Project☆85Updated 2 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆34Updated last year
- ☆32Updated last year
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆53Updated 10 months ago