Multi-Agent LLM Evaluation Docs: https://maseval.readthedocs.io/
☆31Apr 16, 2026Updated last week
Alternatives and similar repositories for MASEval
Users that are interested in MASEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for the paper "Do Deep Neural Network Solutions form a Star Domain?"☆12May 26, 2024Updated last year
- ☆12Oct 4, 2023Updated 2 years ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Apr 12, 2026Updated 2 weeks ago
- public packages published with @arno/*☆15Mar 2, 2026Updated last month
- MindJam - A collaborative mindmap app built on React, React-Konva and Socket.io☆12Nov 14, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR 2026 🔥] Dr.LLM: Dynamic Layer Routing in LLMs☆45Apr 21, 2026Updated last week
- Enhanced Unsupervised Object Discoveries through Exhaustive Self-Supervised Transformers☆15Jun 25, 2024Updated last year
- A collaborative real-time local-first global canvas☆16Sep 28, 2023Updated 2 years ago
- ☆28May 15, 2025Updated 11 months ago
- ☆39Oct 21, 2022Updated 3 years ago
- A real time collaborative drawing whiteboard with canvas html 5, fabric js and socket.io.☆13Feb 23, 2026Updated 2 months ago
- Extend Konva's functionality to export stages as SVG. Enhance the quality of exported images with SVG format.☆24Jan 11, 2025Updated last year
- Code for the paper "Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real Documentss"☆15Oct 8, 2024Updated last year
- ☆21May 3, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Websocket connector for Yjs (Browser/Node client)☆21Feb 28, 2018Updated 8 years ago
- SSH tunneling daemon☆22Jan 19, 2025Updated last year
- High-performance event store for Dynamic Consistency Boundaries☆65Updated this week
- Utility functions to apply and invert patches generated by Automerge document changes.☆24Mar 31, 2026Updated 3 weeks ago
- Edit an input image with an input prompt using Google's new Gemini 2.5 Flash (Nano Banana) model, Cloudflare Workers, and Cloudflare R2!☆26Aug 27, 2025Updated 8 months ago
- ☆18Jul 24, 2023Updated 2 years ago
- Federated reconnaissance mini-ImageNet benchmark and baseline models☆13Sep 2, 2021Updated 4 years ago
- ☆20Jun 19, 2024Updated last year
- Building the Bi-LSTM & the CNN-GAN models to compose Classical Music in different eras☆11Aug 2, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Recifal aquarium monitoring with arduino, alerts and settings by SMS and webApp, and statistics database☆11Jan 11, 2026Updated 3 months ago
- A library that merges Elmish with React, offering an external store for efficient and selective component rendering.☆24Mar 25, 2026Updated last month
- Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025☆17Jan 12, 2026Updated 3 months ago
- Lean package for "How To Prove It with Lean", a companion to the book "How To Prove It"☆39Dec 20, 2025Updated 4 months ago
- Adaptive Deep Learning Model Selection On Embedded Systems☆11May 6, 2018Updated 7 years ago
- [ICLR 2024] "Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality" by Xuxi Chen*, Yu Yang*, Zhangyang Wang, Baha…☆15May 18, 2024Updated last year
- Sample, estimate, aggregate: A recipe for causal discovery foundation models☆17Jun 21, 2024Updated last year
- ☆10Nov 4, 2024Updated last year
- An SDK for building applications on top of FLock V1☆14Apr 9, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 📚📚📚📚📚📚📚📚📚 Reading everything☆15Mar 11, 2026Updated last month
- Run and save the code in ChatGPT. Supports upto 70+ languages.☆13May 26, 2023Updated 2 years ago
- Active and Sample-Efficient Model Evaluation☆27May 22, 2025Updated 11 months ago
- Source code of NAACL 2025 Findings "Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models"☆15Dec 16, 2025Updated 4 months ago
- graphs from Draw.io☆14Sep 26, 2024Updated last year
- Testing things..☆37Aug 24, 2025Updated 8 months ago
- AI-powered semantic search engine for emojis in 50+ languages, developed in Python☆41Aug 28, 2024Updated last year