Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".
☆69Jun 29, 2024Updated last year
Alternatives and similar repositories for SELFGOAL
Users that are interested in SELFGOAL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for our paper: "LoGU: Long-form Generation with Uncertainty Expressions".☆17May 27, 2025Updated 9 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆28Aug 9, 2025Updated 7 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆156Oct 19, 2024Updated last year
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆49Jan 28, 2024Updated 2 years ago
- Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks☆13Jun 12, 2023Updated 2 years ago
- The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…☆11Jul 28, 2025Updated 7 months ago
- ☆13Aug 29, 2025Updated 6 months ago
- [COLM'24] How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?☆22Oct 13, 2024Updated last year
- Code for our EMNLP 2022 paper: Generative Entity Typing with Curriculum Learning.☆13Aug 19, 2023Updated 2 years ago
- A plugin to use a language model to fill in parts of notes.☆16Feb 20, 2024Updated 2 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- ☆11Jun 21, 2025Updated 9 months ago
- ☆15Oct 28, 2024Updated last year
- Localized questions for VQA☆11May 6, 2025Updated 10 months ago
- Blend through colors as you scroll down the page.☆10Feb 11, 2022Updated 4 years ago
- Experimental tl;dr summaries for datasets on the Hugging Face Hub!☆10Apr 4, 2024Updated last year
- ☆34Jul 13, 2023Updated 2 years ago
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- ☆13Jan 14, 2026Updated 2 months ago
- ☆41Jan 28, 2026Updated last month
- An introduction to theorem proving in Lean for the impatient.☆19Apr 6, 2025Updated 11 months ago
- ☆37Nov 14, 2025Updated 4 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆112Sep 28, 2024Updated last year
- [ACL 2025] RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection☆34Jul 23, 2025Updated 8 months ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆90Jan 3, 2024Updated 2 years ago
- Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard☆25Dec 14, 2024Updated last year
- ☆26May 15, 2024Updated last year
- AloePlayer: a cross-platform local media player.☆17Jan 24, 2026Updated 2 months ago
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- Benchmark LLM reasoning capability by solving chess puzzles.☆90Apr 26, 2025Updated 10 months ago
- Code and datasets for the paper "Can Pre-trained Language Models Interpret Similes as Smart as Human?" (ACL 2022)☆14Jan 4, 2023Updated 3 years ago
- Fast and memory-efficient exact attention☆21Mar 13, 2026Updated last week
- Build an AI bot in Discord to serve user's personalized reports on what's up in tech☆28Sep 14, 2025Updated 6 months ago
- ☆13Jan 14, 2022Updated 4 years ago
- Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)☆25Mar 28, 2023Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Sequential planner for large text based environments☆12Dec 13, 2023Updated 2 years ago
- Code and data associated with the AmbiEnt dataset in "We're Afraid Language Models Aren't Modeling Ambiguity" (Liu et al., 2023)☆65Feb 5, 2024Updated 2 years ago