ElevenLiy / MATEvalLinks
MATEval is the first multi-agent framework simulating human collaborative discussion for open-ended text evaluation.
☆28Updated 3 months ago
Alternatives and similar repositories for MATEval
Users that are interested in MATEval are comparing it to the libraries listed below
Sorting:
- ☆175Updated last month
- LLM-FuzzX is a user-friendly fuzz testing tool for Large Language Models (e.g., GPT, Claude, LLaMA), featuring advanced task-aware mutati…☆113Updated 4 months ago
- The 1st dynamic phishing kit dataset☆201Updated 7 months ago
- [ACL 2025 Oral] QAEncoder: Towards Aligned Representation Learning in Question Answering Systems☆175Updated 2 months ago
- django vue3 ts admin vben fastapi langchain 寻找远程/全职 Python 岗位机会 WX JUN765462425☆626Updated this week
- AIGC Creative Suite☆202Updated 4 months ago
- ☆123Updated 6 months ago
- A graph-based python framework for fitness landscape analysis☆160Updated last month
- ☆213Updated 4 months ago
- A powerful multi-format file parsing, data cleaning, and AI annotation toolkit.☆139Updated last week
- ☆84Updated 6 months ago
- Integrated Plant Single- Cell Database☆168Updated last month
- A L4 innovative AGI System Empowering miRNA Drug Discovery☆330Updated 2 months ago
- 职星学院企业培训系统是一套基于点播、直播、考试、培训、面授等功能完善的在线教育系统,开源版是基于商业版精简实现的一个企业员工培训系统,致力于打造一个各行业都适用的在线培训系统、企业培训平台、员工培训系统、企业内部培训系统。☆365Updated 3 months ago
- ☆201Updated 2 months ago
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆309Updated 7 months ago
- ☆136Updated 2 months ago
- Launching the "Agent Creation Toolkit", providing developers with an intuitive and efficient Development Environment, supporting the rapi…☆202Updated 5 months ago
- An MCP service that automates data analysis through IPython sessions.☆160Updated last month
- Enhanced Benchmark Creation Tool: Automates dataset profiling, model benchmarking, and performance visualization for streamlined evaluati…☆110Updated 4 months ago
- ☆160Updated 2 months ago
- ☆100Updated 7 months ago
- Repo for paper *Measuring and Augmenting Large Language Models for Solving Capture-the-Flag Challenges*☆255Updated 2 months ago
- Revolutionizing Cancer Treatment with AI & Robotics☆65Updated 6 months ago
- (LLM) A Sparse Activation Architecture for Green Artificial Intelligence: The Energy Efficiency Optimization Language Model AliceSkyGarde…☆165Updated 2 months ago
- Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with c…☆140Updated this week
- ☆162Updated last year
- A Easily Extensible labeling annotation template web tool (Flask + Vue 3) for annotation [易扩展的标注网页模板]☆24Updated 4 months ago
- F²-Gen - A open source Financial Fraud Detection Data Generator Web Application☆362Updated last month
- a framework for server with golang☆163Updated 3 months ago