☆51Jul 31, 2025Updated 7 months ago
Alternatives and similar repositories for agentic-benchmarks
Users that are interested in agentic-benchmarks are comparing it to the libraries listed below
Sorting:
- ☆12Jan 2, 2024Updated 2 years ago
- ☆19Mar 14, 2024Updated last year
- ☆22Aug 1, 2021Updated 4 years ago
- Repo for Llatrieval☆31Aug 21, 2024Updated last year
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Dec 19, 2023Updated 2 years ago
- ☆17Oct 30, 2025Updated 4 months ago
- A collection of Claude skills I have concocted to optimise my Ops/Product workflows.☆49Feb 12, 2026Updated 3 weeks ago
- Leveraging Base Language Models for Few-Shot Synthetic Data Generation☆40Oct 18, 2025Updated 4 months ago
- ☆57Feb 2, 2026Updated last month
- Beyond Vibe Coding. Code, Planning, Documentation and Product Management agents.☆70Feb 20, 2026Updated 2 weeks ago
- (ICML 2021) Implementation for S2SD - Simultaneous Similarity-based Self-Distillation for Deep Metric Learning. Paper Link: https://arxiv…☆44Sep 18, 2020Updated 5 years ago
- AI agent skill for building modern, composable, and accessible React UI components following the components.build specification☆42Jan 28, 2026Updated last month
- A simple script to plot the Roofline model for given HW platforms and applications☆10Aug 22, 2024Updated last year
- ValTown MCP Server - Execute ValTown functions from AI assistants☆15Aug 12, 2025Updated 6 months ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- A Rust-based agent orchestrator enabling a swarm of Claude Code instances building software.☆27Feb 8, 2026Updated 3 weeks ago
- Spark, Cassandra, Tessellation and ArcGIS☆10Jan 18, 2015Updated 11 years ago
- ☆12Aug 15, 2023Updated 2 years ago
- Official repository for "DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Opinion Text Generation"☆10May 20, 2022Updated 3 years ago
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆17Nov 19, 2025Updated 3 months ago
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- ☆35May 15, 2022Updated 3 years ago
- ☆40May 2, 2021Updated 4 years ago
- This repository is the summary of all of our works for the XLA.☆11Jan 14, 2018Updated 8 years ago
- A simple mobile/native monorepo template w/ a sync engine.☆17Feb 3, 2026Updated last month
- a Video Quality Analysis Toolkit☆13May 16, 2025Updated 9 months ago
- structured attention encoder☆13Jun 6, 2018Updated 7 years ago
- ☆11Sep 8, 2017Updated 8 years ago
- Neural embeddings with negative sampling in Keras☆11Jun 11, 2017Updated 8 years ago
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 4 months ago
- Official implementation for “SafeMVDrive: Multi-view Safety-Critical Driving Video Synthesis in the Real World Domain”☆21Dec 11, 2025Updated 2 months ago
- Memory and Context Orchestration for Coding Agents☆28Updated this week
- Simple setup for personal dotfiles☆11Nov 29, 2025Updated 3 months ago
- ☆18Updated this week
- VexFS is a Linux kernel-native file system with built-in vector search and semantic memory. Designed for AI agents, RAG, and LLM workload…☆24Oct 19, 2025Updated 4 months ago
- ☆13Feb 21, 2026Updated 2 weeks ago
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Jun 1, 2022Updated 3 years ago
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"☆109Jul 15, 2023Updated 2 years ago
- FamilyTool benchmark☆12Sep 10, 2025Updated 5 months ago