Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".
☆70Jun 29, 2024Updated last year
Alternatives and similar repositories for SELFGOAL
Users that are interested in SELFGOAL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆30Aug 9, 2025Updated 10 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆166Oct 19, 2024Updated last year
- Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks☆13Jun 12, 2023Updated 3 years ago
- Code for equipping pretrained language models (BART, GPT-2, XLNet) with commonsense knowledge for generating implicit knowledge statement…☆15Jul 27, 2021Updated 4 years ago
- A plugin to use a language model to fill in parts of notes.☆16Feb 20, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Feb 23, 2024Updated 2 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- ☆15Oct 28, 2024Updated last year
- ☆11Jun 21, 2025Updated 11 months ago
- Localized questions for VQA☆12May 6, 2025Updated last year
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated 2 years ago
- ☆19Apr 18, 2023Updated 3 years ago
- ☆13Jan 14, 2026Updated 5 months ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- (ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"☆28Mar 2, 2026Updated 3 months ago
- ☆36Nov 14, 2025Updated 7 months ago
- Side-channel Analysis☆20May 17, 2022Updated 4 years ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆113Sep 28, 2024Updated last year
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- [ACL 2025] RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection☆34Jul 23, 2025Updated 10 months ago
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆91Jan 3, 2024Updated 2 years ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆136Jul 10, 2024Updated last year
- Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard☆25Dec 14, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The official repo for DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph☆18Oct 13, 2024Updated last year
- ☆28May 15, 2024Updated 2 years ago
- AloePlayer: a cross-platform local media player.☆17Jan 24, 2026Updated 4 months ago
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- A collection of neuro-symbolic systems, papers and videos☆41Jan 12, 2026Updated 5 months ago
- Benchmark LLM reasoning capability by solving chess puzzles.☆91Apr 26, 2025Updated last year
- Code for our paper LLaMAR: LM-based Long-Horizon Planner for Multi-Agent Robotics☆35Feb 10, 2025Updated last year
- Build an AI bot in Discord to serve user's personalized reports on what's up in tech☆28Sep 14, 2025Updated 9 months ago
- ☆13Jan 14, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Benchmarking Social Intelligence of Language Agents through Interactive Scenarios☆13Jan 4, 2025Updated last year
- Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)☆26Mar 28, 2023Updated 3 years ago
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆26Sep 19, 2024Updated last year
- A Python tool for visualizing satellite positions using TLE (Two Line Element) data☆12May 1, 2022Updated 4 years ago
- A simple Streamlit application to visualize document chunks and queries in embedding space 🗺️🔍☆14Apr 15, 2025Updated last year
- LLaVa Version of RaDialog☆26May 27, 2025Updated last year
- train AI agents to master Free-style Gomoku(五子棋)☆24Mar 2, 2024Updated 2 years ago