Moonshot - A simple and modular tool to evaluate and red-team any LLM application.
☆322Feb 5, 2026Updated 3 months ago
Alternatives and similar repositories for moonshot
Users that are interested in moonshot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository stems from our paper, “Cataloguing LLM Evaluations”, and serves as a living, collaborative catalogue of LLM evaluation fr…☆20Nov 16, 2023Updated 2 years ago
- Papers about red teaming LLMs and Multimodal models.☆164May 28, 2025Updated 11 months ago
- ☆10Jan 14, 2025Updated last year
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆36Jan 11, 2025Updated last year
- Finetune Code for OpenThaiGPT 0.1.0-beta☆12Nov 11, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Jun 15, 2024Updated last year
- 🤯 AI Security EXPOSED! Live Demos Showing Hidden Risks of 🤖 Agentic AI Flows: 💉Prompt Injection, ☣️ Data Poisoning. Watch the recorded…☆22Jul 5, 2024Updated last year
- AI Security Research☆16Jun 21, 2023Updated 2 years ago
- Code for ICLR 2025 Failures to Find Transferable Image Jailbreaks Between Vision-Language Models☆36Jun 1, 2025Updated 11 months ago
- Generative AI Governance for Enterprises☆16Dec 29, 2024Updated last year
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10May 30, 2024Updated last year
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆17Apr 15, 2025Updated last year
- S-Eval: Towards Automated and Comprehensive Safety Evaluation for Large Language Models☆115Feb 13, 2026Updated 2 months ago
- ☆42May 4, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 🤖🛡️🔍🔒🔑 Tiny package designed to support red teams and penetration testers in exploiting large language model AI solutions.☆26May 16, 2024Updated last year
- Inspect: A framework for large language model evaluations☆2,000Updated this week
- Adding guardrails to large language models.☆6,818Updated this week
- Run safety benchmarks against AI models and view detailed reports showing how well they performed.☆124May 1, 2026Updated last week
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆122Oct 25, 2023Updated 2 years ago
- ☆31Jul 14, 2023Updated 2 years ago
- LobotoMl is a set of scripts and tools to assess production deployments of ML services☆10May 16, 2022Updated 3 years ago
- this is based on the paper Chain-of-Retrieval Augmented Generation☆14Mar 29, 2025Updated last year
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- An open source deep research clone. AI Agent (Local LLM or Gemini) that reasons large amounts of web data extracted with SwiftSoup.☆13Feb 10, 2025Updated last year
- HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal☆940Aug 16, 2024Updated last year
- Perl tools to transform account / transaction data from DBS Bank into proper CSV☆27Jul 21, 2022Updated 3 years ago
- Spring to Quarkus exercise☆26Feb 2, 2026Updated 3 months ago
- Effective sampling methods within TensorFlow input functions.☆10Mar 24, 2023Updated 3 years ago
- ☆18Nov 12, 2024Updated last year
- The Dataset and Official Implementation for <Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models’ Understandi…☆18Aug 7, 2024Updated last year
- source code for the offsecml framework☆45Jun 6, 2024Updated last year
- WangchanX Fine-tuning Pipeline☆46Oct 4, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Automate your pipelines, streamline your workflows.☆21Updated this week
- A repository that holds templates, examples, and tests to help external parties submit tasks to AISI that conform with the Autonomous Sys…☆11Jan 16, 2026Updated 3 months ago
- Official PyTorch implementation of RACRO (https://www.arxiv.org/abs/2506.04559)☆19Jul 1, 2025Updated 10 months ago
- Official Repository for Can Language Models be Instructed to Protect Personal Information?☆13Oct 8, 2023Updated 2 years ago
- ☆16Apr 27, 2024Updated 2 years ago
- OpenThaiRAG is an open-source Retrieval-Augmented Generation (RAG) framework designed specifically for Thai language processing. This pro…☆49Dec 13, 2024Updated last year
- A step-by-step tutorial for publishing data and an ontology as Linked Data on your machine.☆14May 9, 2023Updated 3 years ago