microsoft / EvokeLinks
Everything for the Paper: 'Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing'
☆19Updated 2 years ago
Alternatives and similar repositories for Evoke
Users that are interested in Evoke are comparing it to the libraries listed below
Sorting:
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆25Updated last year
- Verifiers for LLM Reinforcement Learning☆80Updated 9 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆69Updated last year
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆40Updated last year
- ☆32Updated last year
- ☆55Updated last year
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆24Updated 3 years ago
- Track the progress of LLM context utilisation☆55Updated 9 months ago
- ☆28Updated 9 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆28Updated last year
- ☆67Updated 10 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆30Updated 10 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆26Updated last week
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 11 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- ☆20Updated last month
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated 2 years ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated last year
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- ☆23Updated last year
- ☆54Updated 2 weeks ago
- Generate High Quality textual or multi-modal datasets with Agents☆18Updated 2 years ago
- ☆29Updated last month
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆53Updated 6 months ago
- Pre-training code for CrystalCoder 7B LLM☆57Updated last year
- Gentopia Agent Zoo and Agent Benchmark☆31Updated 2 years ago
- Aioli: A unified optimization framework for language model data mixing☆32Updated last year
- ☆20Updated 9 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated 2 years ago