The official GitHub repository of the paper "Recent advances in large language model benchmarks against data contamination: From static to dynamic evaluation"
☆509Mar 3, 2026Updated last month
Alternatives and similar repositories for Static-to-Dynamic-LLMEval
Users that are interested in Static-to-Dynamic-LLMEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of the ICML2025 paper “Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contam…☆236Dec 23, 2025Updated 3 months ago
- RealMirror, a comprehensive, open-source embodied AI VLA platform.☆771Updated this week
- Converge AI is an autonomous CLI tool designed to solve "rebase hell" for enterprise teams maintaining long-lived, customized forks of op…☆471Apr 6, 2026Updated last week
- Claude Code skill for improving website AEO (AI Engine Optimization) and GEO (Generative Engine Optimization) scores — 16 foundational ch…☆700Apr 2, 2026Updated 2 weeks ago
- 🎮 TypeScript game numeric engine for RPG & strategy games. Zero dependencies, type-safe formula parsing, battle system simulation, and e…☆641Dec 30, 2025Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 高性能数字人桌面应用框架,开箱即用,集成了AI对话与动态壁纸,即使在较低性能的设备上也能流畅运行数字人☆180Dec 22, 2025Updated 3 months ago
- 🚀2026年波场TRX靓号地址生成器,USDT钱包靓号生成器,利用 gpu 进行加速,代码开源,安全可靠。TRON vanity address generator, use GPU, opensource, safety, enjoy.☆206Apr 3, 2026Updated last week
- AI-powered programmatic advertising infrastructure for U.S. SMBs☆328Mar 25, 2026Updated 3 weeks ago
- ☆244Oct 26, 2025Updated 5 months ago
- 云原生成熟度评估☆345Apr 3, 2026Updated last week
- Skill Compose is an open-source agent builder and runtime platform for skill-powered agents. No workflow graphs. No CLI.☆1,108Mar 4, 2026Updated last month
- Agents daemon that hides nothing☆461Updated this week
- Agentic Generative Engine Optimizaiton☆372Feb 24, 2026Updated last month
- Life Agent RU YEE — An AI-powered life management agent that autonomously handles daily routines including meal planning, grocery …☆711Mar 27, 2026Updated 2 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆89Oct 6, 2023Updated 2 years ago
- Official Implementation of 'OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model'☆420Mar 31, 2026Updated 2 weeks ago
- Give your AI Agent a cloud-native life. Deploy once, converse everywhere.☆271Feb 5, 2026Updated 2 months ago
- Dynamic MBTI Personality Simulation for LLM Agents via Carl Jung's Theory. A framework that enables LLM agents' MBTI personalities to nat…☆925Mar 17, 2026Updated 3 weeks ago
- ☆107Feb 5, 2026Updated 2 months ago
- [ICCV 2025] Official Implementation of "ProLearn: Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driv…☆55Sep 12, 2025Updated 7 months ago
- 🚀 OpenClaw 一键安装部署脚本 | Zero-config installer for OpenClaw - Single command to get started☆93Updated this week
- HarmonyOS Next innovative capabilities case repo.☆157Dec 18, 2025Updated 3 months ago
- 由BitSoul出品的A股市场全能Skill,自带免费历史数据,内置100+行业主流因子,完整的回测框架,基于MOE架构的股票筛选与买卖判断,更提供因子挖矿等趣味接口,欢迎安装试用,也欢共同开发交流!☆298Mar 21, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Prismer Cloud☆665Updated this week
- A distributed framework for LLM agents☆439Feb 28, 2026Updated last month
- ☆99Apr 3, 2026Updated last week
- ☆351Apr 1, 2026Updated 2 weeks ago
- MCP server implementation for Google's Gemini API☆143Mar 29, 2026Updated 2 weeks ago
- Core abstractions for your agentic workflow☆110Mar 11, 2026Updated last month
- ☆161Apr 14, 2025Updated last year
- Test file, please do not call.☆163Mar 24, 2026Updated 3 weeks ago
- Watchdog for Grasshopper Prevent Rhino & Grasshopper from freezing due to accidental massive computations or cascading calculation chains…☆141Jan 30, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- drqa code repository☆31Oct 10, 2025Updated 6 months ago
- Self-evolving cognitive memory and context engine for AI agents in Java. Empowering 24/7 proactive agents like OpenClaw with understandin…☆382Updated this week
- The open marketplace for AI agent skills. Discover, publish, and compose skills for AI agents.☆164Apr 7, 2026Updated last week
- DataCompare is a tool designed to compare database data. Currently, the databases it supports stably include: PostgreSQL, Oracle, and MyS…☆329Mar 2, 2026Updated last month
- ☆39Feb 27, 2025Updated last year
- 集成数据分析、流程图生成、浏览器自动化、视频内容总结、服务器监控、智能图表六大智能体,支持自然语言交互和智能路由。☆116Jan 30, 2026Updated 2 months ago
- A general‑purpose AI‑powered examination platform for schools, training providers, enterprises, and online programs. It delivers multi‑di…☆274Sep 28, 2025Updated 6 months ago