Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".
☆70Jun 29, 2024Updated 2 years ago
Alternatives and similar repositories for SELFGOAL
Users that are interested in SELFGOAL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for our paper: "LoGU: Long-form Generation with Uncertainty Expressions".☆19May 27, 2025Updated last year
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Aug 19, 2023Updated 2 years ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆167Oct 19, 2024Updated last year
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆49Jan 28, 2024Updated 2 years ago
- Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks☆13Jun 12, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…☆11Jul 28, 2025Updated 11 months ago
- ☆13Aug 29, 2025Updated 10 months ago
- [COLM'24] How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?☆23Oct 13, 2024Updated last year
- ☆14Aug 18, 2022Updated 3 years ago
- Code for our EMNLP 2022 paper: Generative Entity Typing with Curriculum Learning.☆13Aug 19, 2023Updated 2 years ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Feb 23, 2024Updated 2 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- Localized questions for VQA☆12May 6, 2025Updated last year
- ☆36May 24, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated 2 years ago
- ☆19Apr 18, 2023Updated 3 years ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated last year
- (ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"☆29Mar 2, 2026Updated 4 months ago
- ☆46Jan 28, 2026Updated 5 months ago
- Side-channel Analysis☆20May 17, 2022Updated 4 years ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆113Sep 28, 2024Updated last year
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- [ACL 2025] RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection☆34Jul 23, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆91Jan 3, 2024Updated 2 years ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆137Jul 10, 2024Updated last year
- The official repo for DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph☆18Oct 13, 2024Updated last year
- ☆20Jan 3, 2025Updated last year
- Microsoft Complex Tasks Dataset☆17Jun 12, 2023Updated 3 years ago
- Bootstrap (Linear) Thompson Sampling☆13Jun 30, 2016Updated 10 years ago
- Fast and memory-efficient exact attention☆22Jun 26, 2026Updated last week
- Build an AI bot in Discord to serve user's personalized reports on what's up in tech☆28Sep 14, 2025Updated 9 months ago
- ☆13Jan 14, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Sequential planner for large text based environments☆12Dec 13, 2023Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Benchmarking Social Intelligence of Language Agents through Interactive Scenarios☆13Jan 4, 2025Updated last year
- Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)☆26Mar 28, 2023Updated 3 years ago
- EarthKAM Explorer. Web-based 3D exploration of satellite images taken by middle school students through the ISS EarthKAM program. Devel…☆18Jun 14, 2017Updated 9 years ago
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆26Sep 19, 2024Updated last year
- A Python tool for visualizing satellite positions using TLE (Two Line Element) data☆12May 1, 2022Updated 4 years ago