The official GitHub repository of the paper "Recent advances in large language model benchmarks against data contamination: From static to dynamic evaluation"
☆546Mar 3, 2026Updated 3 weeks ago
Alternatives and similar repositories for Static-to-Dynamic-LLMEval
Users that are interested in Static-to-Dynamic-LLMEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of the ICML2025 paper “Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contam…☆254Dec 23, 2025Updated 3 months ago
- Claude Code skill for improving website AEO (AI Engine Optimization) and GEO (Generative Engine Optimization) scores — 16 foundational ch…☆388Updated this week
- RealMirror, a comprehensive, open-source embodied AI VLA platform.☆804Feb 2, 2026Updated last month
- 🎮 TypeScript game numeric engine for RPG & strategy games. Zero dependencies, type-safe formula parsing, battle system simulation, and e…☆780Dec 30, 2025Updated 2 months ago
- Life Agent RU YEE — An AI-powered life management agent that autonomously handles daily routines including meal planning, grocery …☆371Mar 16, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 高性能数字人桌面应用框架,开箱即用,集成了AI对话与动态壁纸,即使在较低性能的设备上也能流畅运行数字人☆180Dec 22, 2025Updated 3 months ago
- 🚀2026年波场TRX靓号地址生成器,USDT钱包靓号生成器,利用 gpu 进行加速,代码开源,安全可靠。TRON vanity address generator, use GPU, opensource, safety, enjoy.☆202Jan 24, 2026Updated 2 months ago
- ☆250Oct 26, 2025Updated 5 months ago
- AI-powered programmatic advertising infrastructure for U.S. SMBs☆328Updated this week
- The composable agent runtime.☆307Updated this week
- Skill Compose is an open-source agent builder and runtime platform for skill-powered agents. No workflow graphs. No CLI.☆1,113Mar 4, 2026Updated 3 weeks ago
- Official Implementation of 'OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model'☆369Feb 13, 2026Updated last month
- Agentic Generative Engine Optimizaiton☆371Feb 24, 2026Updated last month
- Give your AI Agent a cloud-native life. Deploy once, converse everywhere.☆269Feb 5, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Dynamic MBTI Personality Simulation for LLM Agents via Carl Jung's Theory. A framework that enables LLM agents' MBTI personalities to nat…☆765Mar 17, 2026Updated last week
- ☆107Feb 5, 2026Updated last month
- DataCompare is a Java-based tool designed to verify the consistency of data after replication or migration operations are completed betwe…☆199Mar 2, 2026Updated 3 weeks ago
- [ICCV 2025] Official Implementation of "ProLearn: Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driv…☆54Sep 12, 2025Updated 6 months ago
- 🚀 OpenClaw 一键安装部署脚本 | Zero-config installer for OpenClaw - Single command to get started☆87Mar 17, 2026Updated last week
- HarmonyOS Next innovative capabilities case repo.☆157Dec 18, 2025Updated 3 months ago
- A distributed framework for LLM agents☆439Feb 28, 2026Updated 3 weeks ago
- Core abstractions for your agentic workflow☆110Mar 11, 2026Updated 2 weeks ago
- MCP server implementation for Google's Gemini API☆143Mar 9, 2026Updated 2 weeks ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆161Apr 14, 2025Updated 11 months ago
- Watchdog for Grasshopper Prevent Rhino & Grasshopper from freezing due to accidental massive computations or cascading calculation chains…☆140Jan 30, 2026Updated last month
- drqa code repository☆31Oct 10, 2025Updated 5 months ago
- ☆39Feb 27, 2025Updated last year
- 🛡️AutoControl Arena: Synthesizing Executable Test Environments for Frontier AI Risk Evaluation☆39Mar 19, 2026Updated last week
- 集成数据分析、流程图生成、浏览器自动化、视频内容总结、服务器监控、智能图表六 大智能体,支持自然语言交互和智能路由。☆115Jan 30, 2026Updated last month
- A general‑purpose AI‑powered examination platform for schools, training providers, enterprises, and online programs. It delivers multi‑di…☆273Sep 28, 2025Updated 5 months ago
- 基于函数式编程和 dio 封装的类似 ahooks 的 useRequest 网络请求库☆81Mar 5, 2026Updated 3 weeks ago
- apply-bot.com☆125Dec 16, 2025Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A high-performance personal fund tracker focused on providing real-time net value estimations. It features deep stock penetration, smart …☆29Updated this week
- ☆82Jan 25, 2026Updated 2 months ago
- ☆67Updated this week
- ☆51Feb 13, 2026Updated last month
- Oinone is an AI‑Powered low‑code framework that unifies AI and developers around a shared metadata model to build maintainable, evolvable…☆2,095Mar 18, 2026Updated last week
- This is the repo for the paper TerminalTraj: Large-Scale Terminal Agentic Trajectory Generation from Dockerized Environments☆119Mar 13, 2026Updated 2 weeks ago
- ☆101Feb 12, 2026Updated last month