Moonshot - A simple and modular tool to evaluate and red-team any LLM application.
☆314Feb 5, 2026Updated last month
Alternatives and similar repositories for moonshot
Users that are interested in moonshot are comparing it to the libraries listed below
Sorting:
- This repository stems from our paper, “Cataloguing LLM Evaluations”, and serves as a living, collaborative catalogue of LLM evaluation fr…☆19Nov 16, 2023Updated 2 years ago
- Parallel Universal Dependencies.☆15Nov 12, 2025Updated 3 months ago
- LLM evaluation.☆16Nov 7, 2023Updated 2 years ago
- ☆25Nov 27, 2023Updated 2 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- this is based on the paper Chain-of-Retrieval Augmented Generation☆14Mar 29, 2025Updated 11 months ago
- An isomorphic Javascript client for Supabase.☆10Oct 24, 2022Updated 3 years ago
- LLM red teaming datasets from the paper 'Student-Teacher Prompting for Red Teaming to Improve Guardrails' for the ART of Safety Workshop …☆22Oct 12, 2023Updated 2 years ago
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆32Jan 11, 2025Updated last year
- ☆16Feb 4, 2026Updated last month
- ☆13Jun 15, 2024Updated last year
- Shan Natural Language Processing tools inspired by PythaiNLP☆14Updated this week
- A free and open source template to host your links and socials. Built with Astro, Tailwind CSS, and Keystatic CMS by Cosmic Themes.☆16May 9, 2025Updated 9 months ago
- A lightweight record-replay reverse proxy for testing☆21Feb 20, 2026Updated 2 weeks ago
- AI Security Research☆15Jun 21, 2023Updated 2 years ago
- A step-by-step tutorial for publishing data and an ontology as Linked Data on your machine.☆14May 9, 2023Updated 2 years ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10May 30, 2024Updated last year
- Bundle of security analysis scripts for keras tensorflow models☆16Apr 15, 2024Updated last year
- Panda Guard is designed for researching jailbreak attacks, defenses, and evaluation algorithms for large language models (LLMs).☆64Jan 19, 2026Updated last month
- A basic ls replacement, written in rust, using cursor ai and Geoffrey Huntley's techniques☆29Mar 3, 2025Updated last year
- ☆18Mar 25, 2025Updated 11 months ago
- ☆20Nov 20, 2024Updated last year
- 🧾 | Use these AI prompts to refine your searches, improve accuracy, and get detailed, context-driven responses that precisely match your…☆21May 11, 2025Updated 9 months ago
- exploiting and defending neural networks(神经网络攻防专栏)☆15Mar 2, 2021Updated 5 years ago
- Generative AI Governance for Enterprises☆16Dec 29, 2024Updated last year
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆15Jul 22, 2025Updated 7 months ago
- Watch LLMs duke it out on a simulated CPU space.☆17Mar 7, 2025Updated last year
- Inspect: A framework for large language model evaluations☆1,800Updated this week
- A security-first linter for code that shouldn't need linting☆18Sep 12, 2023Updated 2 years ago
- ☆40May 4, 2024Updated last year
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆18Apr 15, 2025Updated 10 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- The Security Toolkit for LLM Interactions☆2,620Dec 15, 2025Updated 2 months ago
- OpenThaiRAG is an open-source Retrieval-Augmented Generation (RAG) framework designed specifically for Thai language processing. This pro…☆48Dec 13, 2024Updated last year
- Java library to tokenize Thai text into a list of TCCs☆19May 30, 2017Updated 8 years ago
- ☆18Jun 4, 2025Updated 9 months ago
- ☆50Aug 3, 2024Updated last year
- Adding guardrails to large language models.☆6,492Updated this week
- ☆28Updated this week