LLM Benchmark
☆44May 24, 2025Updated last year
Alternatives and similar repositories for PERSONA
Users that are interested in PERSONA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Aug 30, 2025Updated 10 months ago
- [ACL Findings 2025] A benchmark for anomaly detection using large language models. It supports zero-shot detection, data augmentation, an…☆45Oct 9, 2025Updated 8 months ago
- Official Implement of "ADGym: Design Choices for Deep Anomaly Detection", NeurIPS 2023☆34Aug 23, 2023Updated 2 years ago
- A multi-agent framework to fully automate anomaly detection in different modalities, tabular, graph, time series, and more (work in progr…☆100Apr 24, 2026Updated 2 months ago
- [ICLR 2025] Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs☆19Mar 20, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- GenoCraft: A Comprehensive, User-Friendly Web Platform for High-Throughput Omics Data Analysis and Visualization (https://arxiv.org/pdf/2…☆19May 28, 2025Updated last year
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Dec 22, 2023Updated 2 years ago
- Open-sourced evaluation suite from the Monitoring Monitorability paper☆84Jun 11, 2026Updated 3 weeks ago
- https://arxiv.org/abs/2404.10917☆14Mar 18, 2025Updated last year
- A collection of scripts for the Stack Exchange network☆17Aug 14, 2023Updated 2 years ago
- Unofficial Pytorch implementation of the paper 'Categorical Reparameterization with Gumbel-Softmax' and 'The Concrete Distribution: A Con…☆11Apr 27, 2021Updated 5 years ago
- Word acquisition in neural language models (TACL 2022).☆21Jan 30, 2025Updated last year
- This is a repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆12Nov 21, 2022Updated 3 years ago
- ☆31May 15, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Learning from Indirect Observations☆11Jul 16, 2021Updated 4 years ago
- [ICLR 2025] Official implementation for "StringLLM: Understanding the String Processing Capability of Large Language Models"☆22Jan 23, 2025Updated last year
- Official repository for FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models☆42Sep 19, 2025Updated 9 months ago
- [ICLR'26, NAACL'25 Demo] Toolkit & Benchmark for evaluating the trustworthiness of generative foundation models.☆131Aug 22, 2025Updated 10 months ago
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆73Nov 27, 2024Updated last year
- AdaRFT: Efficient Reinforcement Finetuning via Adaptive Curriculum Learning☆56Jun 13, 2025Updated last year
- [NeurIPS 2024, spotlight] Scaling Out-of-Distribution Detection for Multiple Modalities☆70Dec 3, 2025Updated 7 months ago
- Official demo repository for our ACL 2019 long paper "Generating Question-Answer Hierarchies".☆20Feb 13, 2026Updated 4 months ago
- ☆22Mar 19, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR 2026] The official code for "Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models"☆28Feb 7, 2026Updated 4 months ago
- Python Pushshift.io API Wrapper (for comment/submission search)☆14Apr 29, 2021Updated 5 years ago
- ☆20Apr 5, 2024Updated 2 years ago
- A new algorithm that formulates jailbreaking as a reasoning problem.☆26Jul 2, 2025Updated last year
- Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 …☆127Feb 3, 2026Updated 5 months ago
- Code for ICML 2023 paper "When and How Does Known Class Help Discover Unknown Ones? Provable Understandings Through Spectral Analysis"☆14Jun 24, 2023Updated 3 years ago
- An automated data pipeline scaling RL to pretraining levels☆76Jun 2, 2026Updated last month
- SVIP: Towards Verifiable Inference of Open-Source Large Language Models☆15Jun 3, 2025Updated last year
- [COLM 2025] JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model☆26Nov 25, 2025Updated 7 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆20Apr 8, 2025Updated last year
- Code and data for paper "Can Watermarked LLMs be Identified by Users via Crafted Prompts?" Accepted by ICLR 2025 (Spotlight)☆28Dec 28, 2024Updated last year
- [ECCV2022] Motion Sensitive Contrastive Learning for Self-supervised Video Representation☆17Aug 12, 2022Updated 3 years ago
- An unofficial PyTorch implementation of MixMatch - A Holistic Approach to Semi-Supervised Learning☆14Aug 10, 2021Updated 4 years ago
- Röttger et al. (2025): "MSTS: A Multimodal Safety Test Suite for Vision-Language Models"☆20Mar 31, 2025Updated last year
- ☆15Oct 9, 2022Updated 3 years ago
- Bridging Immutable and Mutable Abstractions for Distributed Data Analytics☆12May 15, 2019Updated 7 years ago