☆18Mar 25, 2024Updated 2 years ago
Alternatives and similar repositories for SimpleSafetyTests
Users that are interested in SimpleSafetyTests are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to v…☆15Apr 14, 2025Updated 11 months ago
- Data synthesis code for "AGENT: A Benchmark for Core Psychological Reasoning"☆24Mar 3, 2022Updated 4 years ago
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆26Jun 24, 2024Updated last year
- Parallel NDJSON Reader for Python☆17Dec 4, 2019Updated 6 years ago
- ICLR2024 Paper. Showing properties of safety tuning and exaggerated safety.☆93May 9, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation of our LREC-COLING 2024 paper "Generative Multimodal Entity Linking".☆37Feb 27, 2025Updated last year
- ☆10Nov 28, 2023Updated 2 years ago
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- Codes and data for CIKM 2022 paper "RuDi: Explaining Behavior Sequence Models by Automatic Statistics Generation and Rule Distillation"☆12Aug 16, 2022Updated 3 years ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 6 months ago
- [WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews☆17Dec 14, 2025Updated 3 months ago
- helper functions for processing and integrating visual language information with Qwen-VL Series Model☆18Aug 30, 2024Updated last year
- ☆20Jul 24, 2024Updated last year
- ☆15Oct 23, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Click this --> https://zsdonghao.github.io☆10Updated this week
- Netflix for XBMC☆61Nov 13, 2012Updated 13 years ago
- ☆18Apr 7, 2025Updated last year
- ☆16May 16, 2025Updated 10 months ago
- ☆16Mar 22, 2025Updated last year
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆14Jun 21, 2024Updated last year
- ☆11Jan 19, 2025Updated last year
- A toolkit for testing and improving named entity recognition [ESEC/FSE'23]☆11Aug 31, 2023Updated 2 years ago
- Thesis project about Visual Anomaly Detection based on Self Supervised Learning. The model identifies anomalies from information acquired…☆10Apr 14, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Tasks for describing differences between text distributions.☆17Aug 9, 2024Updated last year
- Koishi's Day 2024 Paper (NeurIPS 2024): An advanced persona-driven role-playing system with global faithfulness quantification and optimi…☆11Oct 19, 2025Updated 5 months ago
- ☆21Jan 11, 2023Updated 3 years ago
- 免费的AI视频生成nonebot插件,支持文生视频和图文生视频☆10May 7, 2025Updated 11 months ago
- ☆13Sep 12, 2024Updated last year
- ☆13Jun 17, 2024Updated last year
- Creating the tools and data sets necessary to evaluate vulnerabilities in LLMs.☆27Mar 14, 2025Updated last year
- 使用torch.distributed实现DP/TP/PP☆13Dec 28, 2023Updated 2 years ago
- ☆16Mar 22, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang☆14Jan 4, 2024Updated 2 years ago
- ☆20Nov 15, 2024Updated last year
- ☆11Nov 14, 2024Updated last year
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Jul 19, 2023Updated 2 years ago
- Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks☆15Feb 17, 2025Updated last year
- ☆43Jul 10, 2024Updated last year
- 🐴🐘 Data on Members of the 116th U.S. Congress☆10Dec 11, 2019Updated 6 years ago