Multi-Agent LLM Evaluation Docs: https://maseval.readthedocs.io/
☆27Mar 29, 2026Updated last week
Alternatives and similar repositories for MASEval
Users that are interested in MASEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for the paper "Do Deep Neural Network Solutions form a Star Domain?"☆12May 26, 2024Updated last year
- ☆12Oct 4, 2023Updated 2 years ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Sep 11, 2025Updated 6 months ago
- [ICLR 2026 🔥] Dr.LLM: Dynamic Layer Routing in LLMs☆44Oct 15, 2025Updated 5 months ago
- Enhanced Unsupervised Object Discoveries through Exhaustive Self-Supervised Transformers☆15Jun 25, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆28May 15, 2025Updated 10 months ago
- ☆39Oct 21, 2022Updated 3 years ago
- Code for the paper "Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real Documentss"☆15Oct 8, 2024Updated last year
- ☆21May 3, 2022Updated 3 years ago
- SSH tunneling daemon☆22Jan 19, 2025Updated last year
- Edit an input image with an input prompt using Google's new Gemini 2.5 Flash (Nano Banana) model, Cloudflare Workers, and Cloudflare R2!☆26Aug 27, 2025Updated 7 months ago
- ☆18Jul 24, 2023Updated 2 years ago
- ☆20Jun 19, 2024Updated last year
- Building the Bi-LSTM & the CNN-GAN models to compose Classical Music in different eras☆11Aug 2, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Federated reconnaissance mini-ImageNet benchmark and baseline models☆13Sep 2, 2021Updated 4 years ago
- Recifal aquarium monitoring with arduino, alerts and settings by SMS and webApp, and statistics database☆11Jan 11, 2026Updated 2 months ago
- A library that merges Elmish with React, offering an external store for efficient and selective component rendering.☆24Mar 25, 2026Updated last week
- Lean package for "How To Prove It with Lean", a companion to the book "How To Prove It"☆38Dec 20, 2025Updated 3 months ago
- Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025☆17Jan 12, 2026Updated 2 months ago
- Adaptive Deep Learning Model Selection On Embedded Systems☆11May 6, 2018Updated 7 years ago
- [ICLR 2024] "Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality" by Xuxi Chen*, Yu Yang*, Zhangyang Wang, Baha…☆15May 18, 2024Updated last year
- Sample, estimate, aggregate: A recipe for causal discovery foundation models☆17Jun 21, 2024Updated last year
- ☆10Nov 4, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- An SDK for building applications on top of FLock V1☆14Apr 9, 2024Updated last year
- 📚📚📚📚📚📚📚📚📚 Reading everything☆15Mar 11, 2026Updated 3 weeks ago
- Run and save the code in ChatGPT. Supports upto 70+ languages.☆13May 26, 2023Updated 2 years ago
- Active and Sample-Efficient Model Evaluation☆27May 22, 2025Updated 10 months ago
- Source code of NAACL 2025 Findings "Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models"☆15Dec 16, 2025Updated 3 months ago
- graphs from Draw.io☆14Sep 26, 2024Updated last year
- Testing things..☆36Aug 24, 2025Updated 7 months ago
- ☆41Sep 30, 2025Updated 6 months ago
- AI-powered semantic search engine for emojis in 50+ languages, developed in Python☆41Aug 28, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [ICCV-2023] Heterogeneous Forgetting Compensation for Class-Incremental Learning☆12Dec 4, 2023Updated 2 years ago
- A simple Telegram Bot that updates a Notion database based on user input. This bot supports various property types and allows users to sp…☆22Jun 30, 2023Updated 2 years ago
- A curated list of Text-to-Video Generation papers and BibTeX entries☆21Feb 21, 2024Updated 2 years ago
- ☆19Jul 24, 2023Updated 2 years ago
- Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"☆17Jan 12, 2024Updated 2 years ago
- ☆13Oct 4, 2023Updated 2 years ago
- Source code of "TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification", ACL2024 (findings)☆14Nov 20, 2024Updated last year