MurtyShikhar / NNetnavLinks
Interaction-first method for generating demonstrations for web-agents on any website
☆51Updated 7 months ago
Alternatives and similar repositories for NNetnav
Users that are interested in NNetnav are comparing it to the libraries listed below
Sorting:
- ☆136Updated 8 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆66Updated 11 months ago
- Official Repo for CRMArena and CRMArena-Pro☆126Updated 3 weeks ago
- ☆41Updated last year
- ☆62Updated 5 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆93Updated last month
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?☆220Updated this week
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆156Updated 9 months ago
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆110Updated last month
- ☆84Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 10 months ago
- ☆190Updated last week
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆131Updated last year
- Repository for the paper Stream of Search: Learning to Search in Language☆151Updated 9 months ago
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆80Updated 11 months ago
- Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory☆200Updated 6 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 10 months ago
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle☆298Updated 3 weeks ago
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and re…☆479Updated this week
- Automating enterprise workflows with multimodal agents☆112Updated last year
- ☆35Updated 6 months ago
- ☆86Updated last year
- Code for the paper 🌳 Tree Search for Language Model Agents☆216Updated last year
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆60Updated 6 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆94Updated 6 months ago
- accompanying material for sleep-time compute paper☆117Updated 7 months ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆102Updated 3 months ago
- Run SWE-bench evaluations remotely☆44Updated 3 months ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆58Updated 9 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated 11 months ago