seacowx / OpenToM
The official repository of the OpenToM dataset
☆19Updated 11 months ago
Alternatives and similar repositories for OpenToM:
Users that are interested in OpenToM are comparing it to the libraries listed below
- ☆81Updated last year
- Just a bunch of benchmark logs for different LLMs☆117Updated 6 months ago
- A set of utilities for running few-shot prompting experiments on large-language models☆116Updated last year
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆67Updated last year
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆69Updated 3 months ago
- Codebase accompanying the Summary of a Haystack paper.☆74Updated 4 months ago
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆74Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 6 months ago
- Camel-Coder: Collaborative task completion with multiple agents. Role-based prompts, intervention mechanism, and thoughtful suggestions☆33Updated last year
- ☆48Updated 2 months ago
- ☆20Updated last year
- The first dense retrieval model that can be prompted like an LM☆64Updated 4 months ago
- clean up your LLM datasets☆113Updated last year
- ☆65Updated 10 months ago
- Track the progress of LLM context utilisation☆53Updated 6 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated 10 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆47Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆53Updated 5 months ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆42Updated last year
- ☆80Updated 3 weeks ago
- Evaluating LLMs with CommonGen-Lite☆88Updated 10 months ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆64Updated 7 months ago
- Reinforcement Learning with Heuristic Imperatives - Finetuning LLMs for Post-Conventional Moral Intuition☆64Updated last year
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆66Updated 7 months ago
- ☆47Updated 2 months ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆43Updated last year
- ☆48Updated last year
- ☆24Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆79Updated 3 months ago
- Based on the tree of thoughts paper☆46Updated last year