The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static to dynamic evaluation"
☆545Sep 13, 2025Updated 5 months ago
Alternatives and similar repositories for Static-to-Dynamic-LLMEval
Users that are interested in Static-to-Dynamic-LLMEval are comparing it to the libraries listed below
Sorting:
- Official repository of the ICML2025 paper “Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contam…☆256Dec 23, 2025Updated 2 months ago
- 🎮 TypeScript game numeric engine for RPG & strategy games. Zero dependencies, type-safe formula parsing, battle system simulation, and e…☆773Dec 30, 2025Updated 2 months ago
- RealMirror, a comprehensive, open-source embodied AI VLA platform.☆711Feb 2, 2026Updated last month
- 🚀2026年波场TRX靓号地址生成器,USDT钱包靓号生成器,利用 gpu 进行加速,代码开源,安全可靠。TRON vanity address generator, use GPU, opensource, safety, enjoy.☆200Jan 24, 2026Updated last month
- 高性能数字人桌面应用框架,开箱即用,集成了AI对话与动态壁纸,即使在较低性能的设备上也能流畅运行数字人☆177Dec 22, 2025Updated 2 months ago
- ☆253Oct 26, 2025Updated 4 months ago
- Official Implementation of 'OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model'☆310Feb 13, 2026Updated 3 weeks ago
- Skill Compose is an open-source agent builder and runtime platform for skill-powered agents. No workflow graphs. No CLI.☆1,061Feb 27, 2026Updated last week
- Agentic Generative Engine Optimizaiton☆362Feb 24, 2026Updated last week
- [ICCV 2025] Official Implementation of "ProLearn: Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driv…☆54Sep 12, 2025Updated 5 months ago
- Give your AI Agent a cloud-native life. Deploy once, converse everywhere.☆264Feb 5, 2026Updated last month
- ☆108Feb 5, 2026Updated last month
- HarmonyOS Next innovative capabilities case repo.☆157Dec 18, 2025Updated 2 months ago
- DataCompare is a Java-based tool designed to verify the consistency of data after replication or migration operations are completed betwe…☆169Updated this week
- Watchdog for Grasshopper Prevent Rhino & Grasshopper from freezing due to accidental massive computations or cascading calculation chains…☆140Jan 30, 2026Updated last month
- A distributed framework for LLM agents☆439Updated this week
- MCP server implementation for Google's Gemini API☆143Jan 28, 2026Updated last month
- drqa code repository☆31Oct 10, 2025Updated 4 months ago
- ☆38Feb 27, 2025Updated last year
- This is the repo for the paper TerminalTraj: Large-Scale Terminal Agentic Trajectory Generation from Dockerized Environments☆116Feb 11, 2026Updated 3 weeks ago
- apply-bot.com☆122Dec 16, 2025Updated 2 months ago
- Context Axial Reverse Attention Network for Small Medical Objects Segmentation☆81Sep 29, 2025Updated 5 months ago
- ☆102Feb 12, 2026Updated 3 weeks ago
- 集成数据分析、流程图生成、浏览器自动化、视频内容总结、服务器监控、智能图表六大智能体,支持自然语言交互和智能路由。☆114Jan 30, 2026Updated last month
- ☆82Jan 25, 2026Updated last month
- A general‑purpose AI‑powered examination platform for schools, training providers, enterprises, and online programs. It delivers multi‑di…☆272Sep 28, 2025Updated 5 months ago
- Oinone is an AI-powered enterprise-grade productization engine, serving as an integrated R&D framework for developing standard products a…☆1,722Updated this week
- ☆50Feb 13, 2026Updated 3 weeks ago
- AI for JDK1.8☆36Updated this week
- ☆121Jan 18, 2026Updated last month
- ☆21Feb 27, 2026Updated last week
- ☆55Apr 14, 2025Updated 10 months ago
- This is a enterprise-level AI image generation platform based on ComfyUI, focusing on photorealistic human image generation. It advanced …☆253Oct 3, 2025Updated 5 months ago
- 随机森林分类模型☆1,010Feb 1, 2026Updated last month
- ☆192Mar 14, 2025Updated 11 months ago
- 泡面的密码工具箱☆75Dec 21, 2025Updated 2 months ago
- Official Implementation of "CLI-Gym: Scalable CLI Task Generation via Agentic Environment Inversion"☆119Updated this week
- Qurio is a high-velocity AI knowledge workspace built for teams that demand more than basic chat. It supports generic providers. Highligh…☆41Feb 27, 2026Updated last week
- ☆132Dec 15, 2025Updated 2 months ago