SeekingDream / Static-to-Dynamic-LLMEvalLinks
The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static to dynamic evaluation"
☆384Updated 4 months ago
Alternatives and similar repositories for Static-to-Dynamic-LLMEval
Users that are interested in Static-to-Dynamic-LLMEval are comparing it to the libraries listed below
Sorting:
- Official repository of the ICML2025 paper “Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contam…☆183Updated last month
- ☆38Updated 10 months ago
- ☆141Updated 2 months ago
- ☆192Updated 10 months ago
- This is a enterprise-level AI image generation platform based on ComfyUI, focusing on photorealistic human image generation. It advanced …☆253Updated 3 months ago
- ☆121Updated last week
- 🚀2026年波场TRX靓号生成器,USDT钱包靓号生成器,利用 gpu 进行加速,代码开源,安全可靠。TRON vanity address generator, use GPU, opensource, safety, enjoy.☆48Updated this week
- MCP server implementation for Google's Gemini API☆146Updated 3 weeks ago
- ☆42Updated 3 months ago
- drqa code repository☆31Updated 3 months ago
- Dynamic MBTI Personality Simulation for LLM Agents via Carl Jung's Theory. A framework that enables LLM agents' MBTI personalities to nat…☆155Updated last week
- A distributed framework for LLM agents☆441Updated last week
- 集成数据分析、流程图生成、浏览器自动化、视频内容总结、服务器监控、智能图表六大智能体,支持自然语言交互和智能路由。☆37Updated last week
- 高性能数字人桌面应用框架,开箱即用,集成了AI对话与动态壁纸,即使在较低性能的设备上也能流畅运行数字人☆175Updated last month
- A KMP (Kotlin Multiplatform) logging library with Android-style API. Write once, log everywhere — composable, lazy, and zero-boilerplate.☆27Updated 2 weeks ago
- ☆23Updated 4 months ago
- A collection of paper and code for chain of thought finetuning (CoT-Finetuning)☆118Updated last month
- Browser Pilot built on an openJiuwen framework.☆82Updated this week
- ☆33Updated last month
- 泡面的密码工具箱☆77Updated last month
- 🤖 基于深度学习的AI量化投资系统 | Vision-Based Quantitative Trading System with Deep Learning☆105Updated last week
- HarmonyOS Next innovative capabilities case repo.☆159Updated last month
- 统一应用菜单权限管理☆71Updated 3 weeks ago
- apply-bot.com☆113Updated last month
- A general‑purpose AI‑powered examination platform for schools, training providers, enterprises, and online programs. It delivers multi‑di…☆272Updated 3 months ago
- [NeurIPS 2025 Main] SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications☆1,082Updated 2 weeks ago
- Text2GraphRAG Disease Assistant builds a disease-focused retrieval-augmented generation workflow. It ingests structured Markdown (demo: o…☆41Updated 2 months ago
- a free frontend-only online tools set, publicly available at www.atools.live☆21Updated last week
- 🎮 TypeScript game numeric engine for RPG & strategy games. Zero dependencies, type-safe formula parsing, battle system simulation, and e…☆757Updated 3 weeks ago
- A curated list of Model Context Protocol (MCP) servers☆1,016Updated last week