yecchen / MIRAI
Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"
☆55Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for MIRAI
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆30Updated 9 months ago
- [KDD 2024]this is project for training explicit graph reasoning large language models.☆39Updated 6 months ago
- Code/data for MARG (multi-agent review generation)☆33Updated last week
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆39Updated last month
- Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…☆35Updated last year
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Updated last year
- Code implementation of synthetic continued pretraining☆60Updated last month
- [NeurIPS 2024] Official implementation for paper "Can Graph Learning Improve Planning in LLM-based Agents?"☆85Updated last week
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆33Updated last year
- [ACL 2024] The project of Symbol-LLM☆42Updated 4 months ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆31Updated 6 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆49Updated 9 months ago
- Code repo for MathAgent☆13Updated 11 months ago
- InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆54Updated last week
- ☆14Updated last month
- Repository for paper Tools Are Instrumental for Language Agents in Complex Environments☆33Updated last month
- ☆41Updated last month
- [NAACL 2024] Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models☆82Updated 8 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆63Updated last month
- Accompanying code for "Boosted Prompt Ensembles for Large Language Models"☆28Updated last year
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; arXiv preprint arXiv:2403.…☆37Updated 4 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆48Updated 7 months ago
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View☆102Updated 6 months ago
- The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"☆41Updated 2 weeks ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆31Updated 2 months ago
- ☆103Updated 4 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆33Updated 10 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆96Updated last month
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging☆98Updated last year
- [EMNLP 2024] Ask-before-Plan: Proactive Language Agents for Real-World Planning☆13Updated 3 weeks ago