☆19Mar 25, 2024Updated 2 years ago
Alternatives and similar repositories for SimpleSafetyTests
Users that are interested in SimpleSafetyTests are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Mar 26, 2025Updated last year
- Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to v…☆15Apr 14, 2025Updated last year
- ☆17May 14, 2025Updated 11 months ago
- ☆22Mar 6, 2024Updated 2 years ago
- Reduction Server in Rust☆14Apr 9, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP☆11Jan 29, 2024Updated 2 years ago
- Parallel NDJSON Reader for Python☆17Dec 4, 2019Updated 6 years ago
- autoredteam: code for training models that automatically red team other language models☆14Aug 9, 2023Updated 2 years ago
- ICLR2024 Paper. Showing properties of safety tuning and exaggerated safety.☆93May 9, 2024Updated last year
- Official implementation of our LREC-COLING 2024 paper "Generative Multimodal Entity Linking".☆37Feb 27, 2025Updated last year
- Tools for robustness evaluation in interpretability methods☆10Jun 25, 2021Updated 4 years ago
- ☆10Nov 28, 2023Updated 2 years ago
- Codes and data for CIKM 2022 paper "RuDi: Explaining Behavior Sequence Models by Automatic Statistics Generation and Rule Distillation"☆12Aug 16, 2022Updated 3 years ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 6 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- CVPR 2019 paper "Disentangling Adversarial Robustness and Generalization".☆14Oct 28, 2019Updated 6 years ago
- [WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews☆17Dec 14, 2025Updated 4 months ago
- helper functions for processing and integrating visual language information with Qwen-VL Series Model☆18Aug 30, 2024Updated last year
- ☆15Oct 23, 2023Updated 2 years ago
- Click this --> https://zsdonghao.github.io☆10Apr 14, 2026Updated 2 weeks ago
- Netflix for XBMC☆61Nov 13, 2012Updated 13 years ago
- ☆16May 16, 2025Updated 11 months ago
- ☆16Mar 22, 2025Updated last year
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆15Jun 21, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆11Jan 19, 2025Updated last year
- Thesis project about Visual Anomaly Detection based on Self Supervised Learning. The model identifies anomalies from information acquired…☆10Apr 14, 2023Updated 3 years ago
- Multilingual safety benchmark for Large Language Models☆53Sep 1, 2024Updated last year
- Tasks for describing differences between text distributions.☆17Aug 9, 2024Updated last year
- ☆21Jan 11, 2023Updated 3 years ago
- 免费的AI视频生成nonebot插件,支持文生视频和图文生视频☆10May 7, 2025Updated 11 months ago
- ☆13Sep 12, 2024Updated last year
- ☆13Jun 17, 2024Updated last year
- Checkpointable dataset utilities for foundation model training☆32Jan 29, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Creating the tools and data sets necessary to evaluate vulnerabilities in LLMs.☆27Mar 14, 2025Updated last year
- Notebooks for Scaling Deep Learning Interpretability by Visualizing Activation and Attribution Summarizations☆15Oct 3, 2019Updated 6 years ago
- ☆17Mar 22, 2024Updated 2 years ago
- [ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang☆14Jan 4, 2024Updated 2 years ago
- ☆20Nov 15, 2024Updated last year
- ☆12Nov 14, 2024Updated last year
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Jul 19, 2023Updated 2 years ago