☆25May 28, 2025Updated 9 months ago
Alternatives and similar repositories for agent-evals
Users that are interested in agent-evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Oct 6, 2021Updated 4 years ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆27Mar 6, 2024Updated 2 years ago
- Developer showcase of projects built on Cartesia☆20Aug 28, 2024Updated last year
- Benchmarking of 1D pattern classification networks☆10Jul 19, 2023Updated 2 years ago
- Symbolic Regression from Scratch with Python☆13Dec 6, 2022Updated 3 years ago
- ☆26May 15, 2024Updated last year
- A Rust implementation of Yolo for object detection and tracking.☆10Nov 17, 2022Updated 3 years ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆148Nov 26, 2024Updated last year
- Example code using the DSPy framework.☆20May 30, 2024Updated last year
- Generate Python docstrings automatically with LLM and syntax trees☆20Jun 13, 2025Updated 9 months ago
- Creating Generative AI Apps which work☆17Apr 14, 2025Updated 11 months ago
- A Declarative Language for Expressing Partial World Knowledge to Reinforcement Learning Agents☆17Jan 19, 2024Updated 2 years ago
- Spatial Spectral Machine Learning☆14Oct 15, 2025Updated 5 months ago
- [EMNLP'21] Plan-then-Generate: Controlled Data-to-Text Generation via Planning☆76Jun 15, 2022Updated 3 years ago
- This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box☆18Dec 19, 2024Updated last year
- LMQL implementation of tree of thoughts☆36Jan 31, 2024Updated 2 years ago
- Scripts to create the MLB dataset introduced in the paper Data-to-text Generation with Entity Modeling☆14Feb 9, 2021Updated 5 years ago
- Fine-grained attention in hierarchical transformers for tabular time-series.☆12Dec 24, 2024Updated last year
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆47Apr 28, 2023Updated 2 years ago
- [COLING2020] A challenge dataset for Person SenTiment analysis in news domain.☆11May 2, 2022Updated 3 years ago
- Official repository of the paper MPMQA: Multimodal Question Answering on Product Manuals (AAAI 2023)☆19Nov 28, 2022Updated 3 years ago
- Official code for ICML 2024 paper "An Unsupervised Approach for Periodic Source Detection in Time Series"☆13Feb 21, 2025Updated last year
- ☆18Jul 15, 2019Updated 6 years ago
- Vast-ai public repository for open sourced tools, plugins, etc.☆16Nov 4, 2024Updated last year
- ☆15Mar 26, 2024Updated last year
- Code for TACL 2022 paper on Data-to-text Generation with Variational Sequential Planning☆21Apr 25, 2022Updated 3 years ago
- Programming by Demonstration for Fetch☆16Aug 8, 2017Updated 8 years ago
- my slides for an advanced algorithms course☆14Apr 26, 2025Updated 10 months ago
- ☆16Mar 2, 2019Updated 7 years ago
- Simple IoT project using Azure IoT Hub and showing a device running node to send telemetry data and that is analyzed by Azure IoT service…☆10Jul 13, 2017Updated 8 years ago
- ☆10Oct 24, 2024Updated last year
- Repository containing starters templates to be used within Kodu☆15Sep 26, 2024Updated last year
- This service integrates Python node invocation with TypeScript and litegraph.js, offering easy setup and ComfyUI compatibility. It simpli…☆12Jan 20, 2024Updated 2 years ago
- [IEEE-TITS] Official implementation of paper "A Survey on the Application of Large Language Models in Scenario-Based Testing of Automated…☆27Jan 23, 2026Updated 2 months ago
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j☆18Aug 19, 2024Updated last year
- GPT for FACodec☆13Mar 25, 2024Updated last year
- ☆29Oct 24, 2025Updated 5 months ago
- ☆18Dec 17, 2022Updated 3 years ago
- ☆17Jun 7, 2024Updated last year