SeekingDream/Static-to-Dynamic-LLMEval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SeekingDream/Static-to-Dynamic-LLMEval)

SeekingDream / Static-to-Dynamic-LLMEval

The official GitHub repository of the paper "Recent advances in large language model benchmarks against data contamination: From static to dynamic evaluation"

☆500

Alternatives and similar repositories for Static-to-Dynamic-LLMEval

Users that are interested in Static-to-Dynamic-LLMEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sdfa66065-lang / convergeai
View on GitHub
Converge AI is an autonomous CLI tool designed to solve "rebase hell" for enterprise teams maintaining long-lived, customized forks of op…
☆463Updated this week
BitSoulTech / BitSoulStockSkill
View on GitHub
由BitSoul出品的A股市场全能Skill，自带免费历史数据，内置100+行业主流因子，完整的回测框架，基于MOE架构的股票筛选与买卖判断，更提供因子挖矿等趣味接口，欢迎安装试用，也欢共同开发交流！
☆568Mar 21, 2026Updated 4 months ago
chaosblade-io / cloud-native-maturity-evaluate
View on GitHub
云原生成熟度评估
☆351Jun 10, 2026Updated last month
spring-cloud-alibaba-group / monolithic-to-microservice
View on GitHub
☆358May 26, 2026Updated 2 months ago
omnuron / omniclaw
View on GitHub
The first agentic payment network: policy-controlled, gasless, and real money-ready. OmniClaw CLI + Financial Policy Engine let autonomo…
☆576May 29, 2026Updated last month
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
haoyangfeng2024 / smb-adtech-platform
View on GitHub
AI-powered programmatic advertising infrastructure for U.S. SMBs
☆474May 8, 2026Updated 2 months ago
MetaInFLow / Enterprise-ai-scenario-map-skill
View on GitHub
咨询AI Agent Skill - 为任何企业自动生成 AI 应用场景地图报告 | Auto-generate AI scenario map reports for any enterprise
☆625Apr 1, 2026Updated 3 months ago
dox012 / nano-claude
View on GitHub
claude code simplified (~2000 Lines)
☆351Apr 2, 2026Updated 3 months ago
Palaiologos1453 / OpenInterview
View on GitHub
☆249Jul 3, 2026Updated 3 weeks ago
TIGER-AI-Lab / ClawBench
View on GitHub
Open-source benchmark for browser AI agents on daily tasks.
☆525Updated this week
crabtalk / crabtalk
View on GitHub
Agents daemon that hides nothing
☆719Updated this week
AgentSkillOS / SkillAnything
View on GitHub
Making ANY Software Skill-Native -- Auto-generate production-ready AI Agent Skills for Claude Code, OpenClaw, Codex, and more.
☆458Apr 6, 2026Updated 3 months ago
terminators2025 / RealMirror
View on GitHub
RealMirror, a comprehensive, open-source embodied AI VLA platform.
☆793May 18, 2026Updated 2 months ago
SeekingDream / DyCodeEval
View on GitHub
Official repository of the ICML2025 paper “Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contam…
☆226Dec 23, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SeekingDream / DLCompilerAttack
View on GitHub
☆243May 20, 2026Updated 2 months ago
OmniCustom-project / OmniCustom
View on GitHub
Official Implementation of 'OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model'
☆426Updated this week
pexoai / pexo-skills
View on GitHub
A collection of open-source Agent Skills for content creation — images, audio, and video.
☆757Updated this week
Mexcauth / trading
View on GitHub
☆240Apr 3, 2026Updated 3 months ago
ru-yee / Life-Agent-RU-YEE
View on GitHub
Life Agent RU YEE — An AI-powered life management agent that autonomously handles daily routines including meal planning, grocery …
☆825Mar 27, 2026Updated 3 months ago
thomasxm / CrowdSentinels-AI-MCP
View on GitHub
AI-powered threat hunting and incident response MCP server for Elasticsearch/OpenSearch
☆203Jul 19, 2026Updated last week
thesongzhu / Friday
View on GitHub
Private control plane for AI agents
☆901Updated this week
shepaw / shepaw
View on GitHub
☆258Updated this week
laiyingxin2 / Agent4FaceForgery
View on GitHub
Agent4FaceForgery: Multi-Agent LLM Framework for Realistic Face Forgery
☆209Apr 22, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
w3nq14 / SecUnion
View on GitHub
网络安全博客大全
☆173Mar 31, 2026Updated 3 months ago
raids-lab / crater
View on GitHub
Crater is a cloud-native AI training & inference platform.
☆543Updated this week
yifangao112 / Camyla
View on GitHub
Scaling Autonomous Research in Medical Image Segmentation
☆360Apr 14, 2026Updated 3 months ago
voicecomm-ai / ai-vocsagex-backend
View on GitHub
通晓AI中台-后端能力
☆956Feb 28, 2026Updated 4 months ago
warp-context / rightStage
View on GitHub
Sync AI context across every terminal window. 3 seconds to know what to work on next.
☆352May 7, 2026Updated 2 months ago
openperf / openclaw-cloud
View on GitHub
Give your AI Agent a cloud-native life. Deploy once, converse everywhere.
☆272Feb 5, 2026Updated 5 months ago
dp-archive / archive
View on GitHub
Skill Compose is an open-source agent builder and runtime platform for skill-powered agents. No workflow graphs. No CLI.
☆1,105Mar 4, 2026Updated 4 months ago
Tencent-Hunyuan / HY-Embodied
View on GitHub
HY-Embodied: Embodied Foundation Models for Real-World Agents
☆831Jul 15, 2026Updated last week
Moxxkidd / Garden
View on GitHub
☆295Updated this week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ZhangJinHaHaHa / AgentLens
View on GitHub
Agentlens is a trusted agent trading platform. Here, you can quickly find the Agent that meets your needs, and you can also publish your…
☆1,042Jul 14, 2026Updated last week
ChengHua926 / rBTC
View on GitHub
☆161Jun 3, 2026Updated last month
Sunefei / PatchNet
View on GitHub
Implementation of "Handling Feature Heterogeneity with Learnable Graph Patches"
☆506Apr 26, 2026Updated 3 months ago
PolyX-Research / Response-G1
View on GitHub
[ACL 2026] Official Implementation of Response-G1: Explicit Scene Graph Modeling for Proactive Streaming Video Understanding
☆236May 26, 2026Updated 2 months ago
zhu-zhu666 / S800-Vehicle-Network-Security-Testing-Framework
View on GitHub
Test file, please do not call.
☆162Apr 26, 2026Updated 3 months ago
VexDB-THU / VexDB-Lite
View on GitHub
A cross-platform vector database, which can be integrated into existing databases as a plugin.
☆1,366Updated this week
Nathan-code-development / AIApplication
View on GitHub
AI model square and chat with AI.
☆986May 11, 2026Updated 2 months ago