The official GitHub repository of the paper "Recent advances in large language model benchmarks against data contamination: From static to dynamic evaluation"
☆497Mar 3, 2026Updated 3 months ago
Alternatives and similar repositories for Static-to-Dynamic-LLMEval
Users that are interested in Static-to-Dynamic-LLMEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of the ICML2025 paper “Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contam…☆226Dec 23, 2025Updated 5 months ago
- RealMirror, a comprehensive, open-source embodied AI VLA platform.☆799May 18, 2026Updated 3 weeks ago
- Converge AI is an autonomous CLI tool designed to solve "rebase hell" for enterprise teams maintaining long-lived, customized forks of op…☆464Apr 6, 2026Updated 2 months ago
- 🎮 TypeScript game numeric engine for RPG & strategy games. Zero dependencies, type-safe formula parsing, battle system simulation, and e…☆651Dec 30, 2025Updated 5 months ago
- 高性能数字人桌面应用框架,开箱即用,集成了AI对话与动态壁纸,即使在较低性能的设备上也能流畅运行数字人☆182Dec 22, 2025Updated 5 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆241May 20, 2026Updated 3 weeks ago
- 云原生成熟度评估☆350Updated this week
- Making ANY Software Skill-Native -- Auto-generate production-ready AI Agent Skills for Claude Code, OpenClaw, Codex, and more.☆441Apr 6, 2026Updated 2 months ago
- Skill Compose is an open-source agent builder and runtime platform for skill-powered agents. No workflow graphs. No CLI.☆1,103Mar 4, 2026Updated 3 months ago
- Agents daemon that hides nothing☆717May 26, 2026Updated 3 weeks ago
- Claude Code skill for improving website AEO (AI Engine Optimization) and GEO (Generative Engine Optimization) scores — 16 foundational ch…☆1,228Jun 7, 2026Updated last week
- AI-powered programmatic advertising infrastructure for U.S. SMBs☆475May 8, 2026Updated last month
- Agentic Generative Engine Optimizaiton☆378Feb 24, 2026Updated 3 months ago
- ☆89Oct 6, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Open-source benchmark for browser AI agents on daily tasks.☆391Updated this week
- Official Implementation of 'OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model'☆423May 28, 2026Updated 2 weeks ago
- Give your AI Agent a cloud-native life. Deploy once, converse everywhere.☆272Feb 5, 2026Updated 4 months ago
- ☆107May 11, 2026Updated last month
- [ICCV 2025] Official Implementation of "ProLearn: Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driv…☆55Sep 12, 2025Updated 9 months ago
- Dynamic MBTI Personality Simulation for LLM Agents via Carl Jung's Theory. A framework that enables LLM agents' MBTI personalities to nat…☆1,218Mar 17, 2026Updated 2 months ago
- 由BitSoul出品的A股市场全能Skill,自带免费历史数据,内置100+行业主流因子,完整的回测框架,基于MOE架构的股票筛选与买卖判断,更提供因子挖矿等趣味接口,欢迎安装试用,也欢共同开发交流!☆555Mar 21, 2026Updated 2 months ago
- Life Agent RU YEE — An AI-powered life management agent that autonomously handles daily routines including meal planning, grocery …☆830Mar 27, 2026Updated 2 months ago
- HY-Embodied: Embodied Foundation Models for Real-World Agents☆735Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 🚀 OpenClaw 一键安装部署脚本 | Zero-config installer for OpenClaw - Single command to get started☆104Apr 11, 2026Updated 2 months ago
- ☆241Apr 3, 2026Updated 2 months ago
- HarmonyOS Next innovative capabilities case repo.☆157Dec 18, 2025Updated 5 months ago
- Scaling Autonomous Research in Medical Image Segmentation☆351Apr 14, 2026Updated 2 months ago
- Agent4FaceForgery: Multi-Agent LLM Framework for Realistic Face Forgery☆206Apr 22, 2026Updated last month
- 咨询AI Agent Skill - 为任何企业自动生成 AI 应用场景地图报告 | Auto-generate AI scenario map reports for any enterprise☆619Apr 1, 2026Updated 2 months ago
- claude code simplified (~2000 Lines)☆351Apr 2, 2026Updated 2 months ago
- A distributed framework for LLM agents☆517Updated this week
- C++ speech recognition inference engine using GGML — CPU/CUDA GPU, real-time microphone streaming, single GGUF model file, no Python dep…☆116Apr 30, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆357May 26, 2026Updated 3 weeks ago
- A collection of open-source Agent Skills for content creation — images, audio, and video.☆736Updated this week
- Core abstractions for your agentic workflow☆111Mar 11, 2026Updated 3 months ago
- Test file, please do not call.☆162Apr 26, 2026Updated last month
- ☆161Jun 3, 2026Updated last week
- Watchdog for Grasshopper Prevent Rhino & Grasshopper from freezing due to accidental massive computations or cascading calculation chains…☆140Jan 30, 2026Updated 4 months ago
- MCP server implementation for Google's Gemini API☆143Apr 21, 2026Updated last month