β57Sep 16, 2024Updated last year
Alternatives and similar repositories for natural-plan
Users that are interested in natural-plan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"β13Jun 22, 2025Updated 11 months ago
- π€ Code for our EMNLP 2022 paper: "BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Datasetβ¦β16Oct 7, 2024Updated last year
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Searchβ17Jan 24, 2026Updated 4 months ago
- [ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"β520May 24, 2026Updated 3 weeks ago
- β11Sep 19, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"β54Feb 23, 2024Updated 2 years ago
- Benchmarking Generalization to New Tasks from Natural Language Instructionsβ28Jul 2, 2021Updated 4 years ago
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, β¦β19Jun 25, 2024Updated last year
- β10Oct 14, 2023Updated 2 years ago
- The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"β24May 6, 2026Updated last month
- Code for paper "Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators"β16Dec 4, 2024Updated last year
- β14Jul 17, 2025Updated 10 months ago
- β18Aug 19, 2024Updated last year
- β29Feb 17, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Sys2Bench is a benchmarking suite designed to evaluate reasoning and planning capabilities of large language models across algorithmic, lβ¦β31Mar 5, 2025Updated last year
- β37Jan 25, 2024Updated 2 years ago
- This repository houses the code for the paper - "The Neglected of VLMs"β30Dec 31, 2025Updated 5 months ago
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalizationβ29Sep 12, 2024Updated last year
- A package for interfacing with Slay the Spire through Communication Mod, plus a simple AIβ25Jan 13, 2026Updated 5 months ago
- [EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verificationβ42Apr 29, 2023Updated 3 years ago
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to β¦β70Jan 28, 2026Updated 4 months ago
- Repository for Skill Set Optimizationβ14Jul 26, 2024Updated last year
- Codebase for the EMNLP 2021 paper "HittER: Hierarchical Transformers for Knowledge Graph Embeddings".β12Nov 1, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- β33Nov 30, 2025Updated 6 months ago
- β12Oct 20, 2023Updated 2 years ago
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risksβ10Nov 27, 2024Updated last year
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"β25Oct 11, 2025Updated 8 months ago
- Predicting brain activity from word embeddings during natural language comprehensionβ24Feb 20, 2024Updated 2 years ago
- Generate Python docstrings automatically with LLM and syntax treesβ20Jun 13, 2025Updated last year
- This is a niche collection of research papers which are proven to be gradients pushing the field of Natural Language Processing, Deep Leaβ¦β25Nov 19, 2024Updated last year
- A paper list for box embeddingsβ17Jun 9, 2021Updated 5 years ago
- [KDD 2020] This is the code repository for our KDD'20 paper STEAM: Self-Supervised Taxonomy Expansion with Mini-Paths.β18Jul 22, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for the paper "Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering" (AAAI 2021)β30Feb 19, 2021Updated 5 years ago
- β19Dec 25, 2021Updated 4 years ago
- [ICLR2026] NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agentsβ151Feb 27, 2026Updated 3 months ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Aβ¦β49Jan 28, 2024Updated 2 years ago
- Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.β11Nov 23, 2023Updated 2 years ago
- BERT score for text generationβ12Jan 15, 2025Updated last year
- β11Jun 11, 2024Updated 2 years ago