LLM Benchmark
☆43May 24, 2025Updated last year
Alternatives and similar repositories for PERSONA
Users that are interested in PERSONA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL Findings 2025] A benchmark for anomaly detection using large language models. It supports zero-shot detection, data augmentation, an…☆44Oct 9, 2025Updated 7 months ago
- Arabic collocations library and data for Python☆10Nov 14, 2021Updated 4 years ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- [ICLR 2025] Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs☆19Mar 20, 2025Updated last year
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Dec 22, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13Apr 30, 2026Updated 3 weeks ago
- Open-sourced evaluation suite from the Monitoring Monitorability paper☆75Apr 22, 2026Updated last month
- https://arxiv.org/abs/2404.10917☆14Mar 18, 2025Updated last year
- LLM Program Watermarking☆18Apr 19, 2024Updated 2 years ago
- Word acquisition in neural language models (TACL 2022).☆21Jan 30, 2025Updated last year
- Official codebase for ICLR oral paper Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling☆36Apr 14, 2022Updated 4 years ago
- Toolkit for foundation models in causal inference☆33Jan 14, 2026Updated 4 months ago
- ☆31May 15, 2026Updated last week
- ☆24May 21, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Learning from Indirect Observations☆11Jul 16, 2021Updated 4 years ago
- ☆13Oct 12, 2020Updated 5 years ago
- [ICLR 2025] Official implementation for "StringLLM: Understanding the String Processing Capability of Large Language Models"☆22Jan 23, 2025Updated last year
- Summarize. is a Streamlit application that performs automatic text summarization using both extractive and abstractive models.☆16Sep 22, 2021Updated 4 years ago
- Klimatkollen's data pipeline and API for processing company sustainability reports☆23Updated this week
- [ICLR'26, NAACL'25 Demo] Toolkit & Benchmark for evaluating the trustworthiness of generative foundation models.☆130Aug 22, 2025Updated 9 months ago
- ☆10Dec 28, 2023Updated 2 years ago
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆72Nov 27, 2024Updated last year
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆35Sep 12, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR 2026] The official code for "Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models"☆26Feb 7, 2026Updated 3 months ago
- code for DOMI☆12Mar 24, 2023Updated 3 years ago
- [ACL 2025 Main] Code and data for paper "Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation?"☆22Jun 18, 2025Updated 11 months ago
- ☆21Apr 5, 2024Updated 2 years ago
- A new algorithm that formulates jailbreaking as a reasoning problem.☆26Jul 2, 2025Updated 10 months ago
- Code for ICML 2023 paper "When and How Does Known Class Help Discover Unknown Ones? Provable Understandings Through Spectral Analysis"☆14Jun 24, 2023Updated 2 years ago
- doi2bibtex: Resolve DOIs and arXiv identifiers to formatted BibTeX entries☆23Aug 20, 2024Updated last year
- An automated data pipeline scaling RL to pretraining levels☆77Oct 11, 2025Updated 7 months ago
- A lightweight library for large laguage model (LLM) jailbreaking defense.☆61Sep 11, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- SpectraGuru - A Spectra Analysis Application☆35May 15, 2026Updated last week
- [ICML 2024] Watermarks in the Sand: Impossibility of Strong Watermarking for Generative Models☆24Sep 12, 2024Updated last year
- Code for paper "AutoAudit: Mining Accounting and Time-Evolving Graphs" (Big Data 2020)☆19Aug 23, 2023Updated 2 years ago
- An unofficial PyTorch implementation of MixMatch - A Holistic Approach to Semi-Supervised Learning☆14Aug 10, 2021Updated 4 years ago
- [ECCV2022] Motion Sensitive Contrastive Learning for Self-supervised Video Representation☆17Aug 12, 2022Updated 3 years ago
- Röttger et al. (2025): "MSTS: A Multimodal Safety Test Suite for Vision-Language Models"☆18Mar 31, 2025Updated last year
- ☆15Mar 7, 2020Updated 6 years ago