plastic-labs / dspy-opentomLinks
Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset
β23Updated last year
Alternatives and similar repositories for dspy-opentom
Users that are interested in dspy-opentom are comparing it to the libraries listed below
Sorting:
- Explore the use of DSPy for extracting features from PDFs πβ49Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ββ34Updated 7 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β42Updated last year
- Writing Blog Posts with Generative Feedback Loops!β50Updated last year
- Based on the tree of thoughts paperβ48Updated 2 years ago
- β15Updated 7 months ago
- Very minimal (and stateless) agent frameworkβ45Updated 10 months ago
- β14Updated last year
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for youβ¦β38Updated last year
- β40Updated 11 months ago
- ππ§ Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!β54Updated 4 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, anβ¦β16Updated last month
- Simple Graph Memory for AI applicationsβ89Updated 6 months ago
- β25Updated 6 months ago
- β55Updated last year
- β88Updated 3 weeks ago
- SCREWS: A Modular Framework for Reasoning with Revisionsβ27Updated 2 years ago
- β25Updated 6 months ago
- β51Updated last year
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsemblesβ60Updated 6 months ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"β58Updated 9 months ago
- A forest of autonomous agents.β19Updated 10 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)β91Updated 10 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ59Updated last month
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.β48Updated last month
- β11Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorerβ44Updated last year
- β28Updated 8 months ago
- β67Updated 8 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"β66Updated 11 months ago