agiresearch / Formal-LLMLinks
Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents
☆133Updated last year
Alternatives and similar repositories for Formal-LLM
Users that are interested in Formal-LLM are comparing it to the libraries listed below
Sorting:
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆82Updated last year
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆189Updated 3 months ago
- ☆105Updated last year
- ☆128Updated 6 months ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆102Updated 4 months ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆99Updated 2 years ago
- ☆86Updated 2 years ago
- ☆159Updated last year
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆94Updated 2 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆110Updated last year
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆90Updated last year
- ☆126Updated last year
- ☆186Updated 10 months ago
- Beating the GAIA benchmark with Transformers Agents. 🚀☆140Updated 10 months ago
- An implemtation of Everyting of Thoughts (XoT).☆156Updated last year
- Multi-Granularity LLM Debugger [ICSE2026]☆94Updated 5 months ago
- LLM reads a paper and produce a working prototype☆60Updated 8 months ago
- A codebase for "Language Models can Solve Computer Tasks"☆238Updated last year
- Complex question answering in LLMs with enhanced reasoning and information-seeking capabilities.☆204Updated 2 years ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆69Updated last year
- ☆41Updated last year
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆156Updated 10 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆190Updated 9 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆118Updated 2 months ago
- Harness used to benchmark aider against SWE Bench benchmarks☆78Updated last year
- AWM: Agent Workflow Memory☆372Updated 10 months ago
- ☆63Updated 5 months ago
- ☆144Updated last year
- ☆122Updated last year
- LILO: Library Induction with Language Observations☆89Updated last year