The official GitHub repository of the paper "Recent advances in large language model benchmarks against data contamination: From static to dynamic evaluation"
☆505Mar 3, 2026Updated 2 months ago
Alternatives and similar repositories for Static-to-Dynamic-LLMEval
Users that are interested in Static-to-Dynamic-LLMEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of the ICML2025 paper “Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contam…☆235Dec 23, 2025Updated 4 months ago
- RealMirror, a comprehensive, open-source embodied AI VLA platform.☆789Apr 16, 2026Updated 3 weeks ago
- Converge AI is an autonomous CLI tool designed to solve "rebase hell" for enterprise teams maintaining long-lived, customized forks of op…☆484Apr 6, 2026Updated last month
- 🎮 TypeScript game numeric engine for RPG & strategy games. Zero dependencies, type-safe formula parsing, battle system simulation, and e…☆676Dec 30, 2025Updated 4 months ago
- 高性能数字人桌面应用框架,开箱即用,集成了AI对话与动态壁纸,即使在较低性能的设备上也能流畅运行数字人☆181Dec 22, 2025Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆244Oct 26, 2025Updated 6 months ago
- Agents daemon that hides nothing☆552Apr 29, 2026Updated last week
- Making ANY Software Skill-Native -- Auto-generate production-ready AI Agent Skills for Claude Code, OpenClaw, Codex, and more.☆386Apr 6, 2026Updated last month
- 云原生成熟度评估☆345Apr 27, 2026Updated last week
- Claude Code skill for improving website AEO (AI Engine Optimization) and GEO (Generative Engine Optimization) scores — 16 foundational ch…☆930Apr 24, 2026Updated last week
- AI-powered programmatic advertising infrastructure for U.S. SMBs☆478Apr 11, 2026Updated 3 weeks ago
- 通晓AI中台-后端能力☆458Feb 28, 2026Updated 2 months ago
- Skill Compose is an open-source agent builder and runtime platform for skill-powered agents. No workflow graphs. No CLI.☆1,108Mar 4, 2026Updated 2 months ago
- Agentic Generative Engine Optimizaiton☆373Feb 24, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆89Oct 6, 2023Updated 2 years ago
- Official Implementation of 'OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model'☆422Mar 31, 2026Updated last month
- Give your AI Agent a cloud-native life. Deploy once, converse everywhere.☆272Feb 5, 2026Updated 3 months ago
- ☆107Feb 5, 2026Updated 3 months ago
- Dynamic MBTI Personality Simulation for LLM Agents via Carl Jung's Theory. A framework that enables LLM agents' MBTI personalities to nat…☆927Mar 17, 2026Updated last month
- Life Agent RU YEE — An AI-powered life management agent that autonomously handles daily routines including meal planning, grocery …☆890Mar 27, 2026Updated last month
- 由BitSoul出品的A股市场全能Skill,自带免费历史数据,内置100+行业主流因子,完整的回测框架,基于MOE架构的股票筛选与买卖判断,更提供因子挖矿等趣味接口,欢迎安装试用,也欢共同开发交流!☆524Mar 21, 2026Updated last month
- [ICCV 2025] Official Implementation of "ProLearn: Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driv…☆55Sep 12, 2025Updated 7 months ago
- Open-source trust layer for AI agents — cryptographic agent identity (Ed25519), instance-scoped execution tokens, SHA-256 hash-chained au…☆483Mar 26, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- HY-Embodied: Embodied Foundation Models for Real-World Agents☆720Apr 14, 2026Updated 3 weeks ago
- 🚀 OpenClaw 一键安装部署脚本 | Zero-config installer for OpenClaw - Single command to get started☆100Apr 11, 2026Updated 3 weeks ago
- ☆241Apr 3, 2026Updated last month
- Scaling Autonomous Research in Medical Image Segmentation☆335Apr 14, 2026Updated 3 weeks ago
- HarmonyOS Next innovative capabilities case repo.☆157Dec 18, 2025Updated 4 months ago
- 咨询AI Agent Skill - 为任何企业自动生成 AI 应用场景地图报告 | Auto-generate AI scenario map reports for any enterprise☆574Apr 1, 2026Updated last month
- claude code simplified (~2000 Lines)☆342Apr 2, 2026Updated last month
- A distributed framework for LLM agents☆439Apr 27, 2026Updated last week
- A collection of open-source Agent Skills for content creation — images, audio, and video.☆819Apr 9, 2026Updated 3 weeks ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- C++ speech recognition inference engine using GGML — CPU/CUDA GPU, real-time microphone streaming, single GGUF model file, no Python dep…☆110Apr 30, 2026Updated last week
- Prismer Cloud☆1,209Apr 26, 2026Updated last week
- ☆352Apr 1, 2026Updated last month
- MCP server implementation for Google's Gemini API☆144Apr 21, 2026Updated 2 weeks ago
- Core abstractions for your agentic workflow☆110Mar 11, 2026Updated last month
- Test file, please do not call.☆163Apr 26, 2026Updated last week
- Watchdog for Grasshopper Prevent Rhino & Grasshopper from freezing due to accidental massive computations or cascading calculation chains…☆141Jan 30, 2026Updated 3 months ago