The official GitHub repository of the paper "Recent advances in large language model benchmarks against data contamination: From static to dynamic evaluation"
☆498Mar 3, 2026Updated 2 months ago
Alternatives and similar repositories for Static-to-Dynamic-LLMEval
Users that are interested in Static-to-Dynamic-LLMEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of the ICML2025 paper “Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contam…☆225Dec 23, 2025Updated 5 months ago
- RealMirror, a comprehensive, open-source embodied AI VLA platform.☆783May 18, 2026Updated last week
- Converge AI is an autonomous CLI tool designed to solve "rebase hell" for enterprise teams maintaining long-lived, customized forks of op…☆465Apr 6, 2026Updated last month
- 🎮 TypeScript game numeric engine for RPG & strategy games. Zero dependencies, type-safe formula parsing, battle system simulation, and e…☆642Dec 30, 2025Updated 4 months ago
- 高性能数字人桌面应用框架,开箱即用,集成了AI对话与动态壁纸,即使在较低性能的设备上也能流畅运行数字人☆183Dec 22, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆238Updated this week
- 云原生成熟度评估☆345May 4, 2026Updated 3 weeks ago
- Making ANY Software Skill-Native -- Auto-generate production-ready AI Agent Skills for Claude Code, OpenClaw, Codex, and more.☆435Apr 6, 2026Updated last month
- Claude Code skill for improving website AEO (AI Engine Optimization) and GEO (Generative Engine Optimization) scores — 16 foundational ch…☆968May 18, 2026Updated last week
- Agents daemon that hides nothing☆716Updated this week
- Skill Compose is an open-source agent builder and runtime platform for skill-powered agents. No workflow graphs. No CLI.☆1,104Mar 4, 2026Updated 2 months ago
- AI-powered programmatic advertising infrastructure for U.S. SMBs☆476May 8, 2026Updated 2 weeks ago
- Agentic Generative Engine Optimizaiton☆379Feb 24, 2026Updated 3 months ago
- Open-source benchmark for browser AI agents on 153 everyday online tasks across 144 live websites. 5-layer recording + DOM-match + LLM ju…☆320Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆89Oct 6, 2023Updated 2 years ago
- Official Implementation of 'OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model'☆422Mar 31, 2026Updated last month
- Give your AI Agent a cloud-native life. Deploy once, converse everywhere.☆272Feb 5, 2026Updated 3 months ago
- ☆107May 11, 2026Updated 2 weeks ago
- Dynamic MBTI Personality Simulation for LLM Agents via Carl Jung's Theory. A framework that enables LLM agents' MBTI personalities to nat…☆852Mar 17, 2026Updated 2 months ago
- [ICCV 2025] Official Implementation of "ProLearn: Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driv…☆55Sep 12, 2025Updated 8 months ago
- Life Agent RU YEE — An AI-powered life management agent that autonomously handles daily routines including meal planning, grocery …☆835Mar 27, 2026Updated last month
- 由BitSoul出品的A股市场全能Skill,自带免费历史数据,内置100+行业主流因子,完整的回测框架,基于MOE架构的股票筛选与买卖判断,更提供因子挖矿等趣味接口,欢迎安装试用,也欢共同开发交流!☆544Mar 21, 2026Updated 2 months ago
- HY-Embodied: Embodied Foundation Models for Real-World Agents☆726Apr 14, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🚀 OpenClaw 一键安装部署脚本 | Zero-config installer for OpenClaw - Single command to get started☆104Apr 11, 2026Updated last month
- ☆241Apr 3, 2026Updated last month
- HarmonyOS Next innovative capabilities case repo.☆157Dec 18, 2025Updated 5 months ago
- Scaling Autonomous Research in Medical Image Segmentation☆346Apr 14, 2026Updated last month
- Agent4FaceForgery: Multi-Agent LLM Framework for Realistic Face Forgery☆206Apr 22, 2026Updated last month
- 咨询AI Agent Skill - 为任何企业自动生成 AI 应用场景地图报告 | Auto-generate AI scenario map reports for any enterprise☆613Apr 1, 2026Updated last month
- claude code simplified (~2000 Lines)☆348Apr 2, 2026Updated last month
- A distributed framework for LLM agents☆437Updated this week
- C++ speech recognition inference engine using GGML — CPU/CUDA GPU, real-time microphone streaming, single GGUF model file, no Python dep…☆113Apr 30, 2026Updated 3 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆352Apr 1, 2026Updated last month
- A collection of open-source Agent Skills for content creation — images, audio, and video.☆728Apr 9, 2026Updated last month
- Core abstractions for your agentic workflow☆111Mar 11, 2026Updated 2 months ago
- Test file, please do not call.☆163Apr 26, 2026Updated last month
- Watchdog for Grasshopper Prevent Rhino & Grasshopper from freezing due to accidental massive computations or cascading calculation chains…☆141Jan 30, 2026Updated 3 months ago
- MCP server implementation for Google's Gemini API☆144Apr 21, 2026Updated last month
- ☆161Apr 14, 2025Updated last year