Multi-Agent LLM Evaluation Docs: https://maseval.readthedocs.io/
☆35May 31, 2026Updated 3 weeks ago
Alternatives and similar repositories for MASEval
Users that are interested in MASEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for the paper "Do Deep Neural Network Solutions form a Star Domain?"☆12May 26, 2024Updated 2 years ago
- ☆12Oct 4, 2023Updated 2 years ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆13Jun 21, 2026Updated last week
- public packages published with @arno/*☆15May 27, 2026Updated last month
- MindJam - A collaborative mindmap app built on React, React-Konva and Socket.io☆12Nov 14, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICLR 2026 🔥] Dr.LLM: Dynamic Layer Routing in LLMs☆53Apr 24, 2026Updated 2 months ago
- A collaborative real-time local-first global canvas☆17Sep 28, 2023Updated 2 years ago
- Enhanced Unsupervised Object Discoveries through Exhaustive Self-Supervised Transformers☆15Jun 25, 2024Updated 2 years ago
- ☆39Oct 21, 2022Updated 3 years ago
- ☆30May 15, 2025Updated last year
- A real time collaborative drawing whiteboard with canvas html 5, fabric js and socket.io.☆13Feb 23, 2026Updated 4 months ago
- Extend Konva's functionality to export stages as SVG. Enhance the quality of exported images with SVG format.☆24Jan 11, 2025Updated last year
- Code for the paper "Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real Documentss"☆14Oct 8, 2024Updated last year
- ☆21May 3, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Websocket connector for Yjs (Browser/Node client)☆21Feb 28, 2018Updated 8 years ago
- SSH tunneling daemon☆22Jan 19, 2025Updated last year
- High-performance event store for Dynamic Consistency Boundaries☆78Updated this week
- Utility functions to apply and invert patches generated by Automerge document changes.☆24Mar 31, 2026Updated 2 months ago
- Edit an input image with an input prompt using Google's new Gemini 2.5 Flash (Nano Banana) model, Cloudflare Workers, and Cloudflare R2!☆26Aug 27, 2025Updated 10 months ago
- ☆18Jul 24, 2023Updated 2 years ago
- ☆20Jun 19, 2024Updated 2 years ago
- Federated reconnaissance mini-ImageNet benchmark and baseline models☆13Sep 2, 2021Updated 4 years ago
- Building the Bi-LSTM & the CNN-GAN models to compose Classical Music in different eras☆12Aug 2, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Recifal aquarium monitoring with arduino, alerts and settings by SMS and webApp, and statistics database☆11Jan 11, 2026Updated 5 months ago
- A library that merges Elmish with React, offering an external store for efficient and selective component rendering.☆24May 4, 2026Updated last month
- Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025☆17Jan 12, 2026Updated 5 months ago
- Lean package for "How To Prove It with Lean", a companion to the book "How To Prove It"☆39Jun 10, 2026Updated 2 weeks ago
- Adaptive Deep Learning Model Selection On Embedded Systems☆11May 6, 2018Updated 8 years ago
- [ICLR 2024] "Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality" by Xuxi Chen*, Yu Yang*, Zhangyang Wang, Baha…☆15May 18, 2024Updated 2 years ago
- Sample, estimate, aggregate: A recipe for causal discovery foundation models☆17Jun 21, 2024Updated 2 years ago
- ☆11Nov 4, 2024Updated last year
- An SDK for building applications on top of FLock V1☆14Apr 9, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 📚📚📚📚📚📚📚📚📚 Reading everything☆16Mar 11, 2026Updated 3 months ago
- Run and save the code in ChatGPT. Supports upto 70+ languages.☆13May 26, 2023Updated 3 years ago
- Active and Sample-Efficient Model Evaluation☆27May 22, 2025Updated last year
- Source code of NAACL 2025 Findings "Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models"☆16Dec 16, 2025Updated 6 months ago
- graphs from Draw.io☆14Sep 26, 2024Updated last year
- Testing things..☆37Aug 24, 2025Updated 10 months ago
- AI-powered semantic search engine for emojis in 50+ languages, developed in Python☆41Aug 28, 2024Updated last year