Learning Safety Constraints for Large Language Models (ICML2025)
☆32Aug 4, 2025Updated 7 months ago
Alternatives and similar repositories for SafetyPolytope
Users that are interested in SafetyPolytope are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Feb 25, 2026Updated 3 weeks ago
- A simple weather display with a cute interactive desktop pet (❛◡❛✿)☆14May 24, 2022Updated 3 years ago
- ☆30Aug 21, 2025Updated 7 months ago
- Scaling safe exploration to vision control☆14Feb 19, 2025Updated last year
- The interface between probabilistic model checking and data-driven policy learning.☆16Mar 11, 2026Updated last week
- Convergent Policy Optimization for Safe Reinforcement Learning☆11Oct 26, 2019Updated 6 years ago
- ☆19Dec 25, 2024Updated last year
- This is a repository of DQN and its variants implementation in PyTorch based on the original papar.☆13Nov 18, 2019Updated 6 years ago
- Official implementation for ICLR 2025 paper "Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning"☆20Mar 5, 2025Updated last year
- A scalable Dreamer implementation in JAX☆10May 22, 2022Updated 3 years ago
- ☆19Feb 8, 2024Updated 2 years ago
- LLM - Detect AI Generated Text || Identify which essay was written by a large language model☆17Jan 17, 2024Updated 2 years ago
- ☆21Nov 12, 2019Updated 6 years ago
- Flow RL is a high-performance RL library with flow and diffusion models.☆28Updated this week
- Chinese-Handwriting-Tool☆13Nov 11, 2023Updated 2 years ago
- OODRobustBench: a Benchmark and Large-Scale Analysis of Adversarial Robustness under Distribution Shift. ICML 2024 and ICLRW-DMLR 2024☆23Jul 25, 2024Updated last year
- ☆22Apr 3, 2025Updated 11 months ago
- This repository contains PyTorch implementations of reinforcement learning algorithms. Its purpose is to provide straightforward and easi…☆18Nov 10, 2023Updated 2 years ago
- ☆34May 1, 2025Updated 10 months ago
- Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"☆22Apr 26, 2023Updated 2 years ago
- Anomalous versions of OpenAI Gym and PyBullet3 environments☆15Oct 24, 2021Updated 4 years ago
- ☆27Sep 15, 2024Updated last year
- Robust and safe deep reinforcement learning algorithms☆16Mar 27, 2024Updated last year
- Reinforcement Learning framework to make synthetic experiments in the financial domain☆23Jul 18, 2023Updated 2 years ago
- ☆25Sep 23, 2024Updated last year
- Implementation of paper 'Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing'☆23Jun 9, 2024Updated last year
- ☆37Aug 21, 2025Updated 7 months ago
- Project exploring 3D volumetric rendering of NEXRAD radar data.☆11Oct 23, 2023Updated 2 years ago
- A collection of meta-learning algorithms in Jax☆24Sep 3, 2022Updated 3 years ago
- Neural Fixed-Point Acceleration for Convex Optimization☆29Oct 6, 2022Updated 3 years ago
- A gym interface for AI safety gridworlds created in pycolab.☆18May 12, 2022Updated 3 years ago
- Comprehensive Assessment of Trustworthiness in Multimodal Foundation Models☆27Mar 15, 2025Updated last year
- PyTorch Implementation of FractalNet☆28Dec 15, 2018Updated 7 years ago
- Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)☆36Dec 17, 2024Updated last year
- ✒️ A gallery of experiments with Scalable Vector Graphics (SVG) and interactive visualizations.☆13Jan 6, 2023Updated 3 years ago
- Hypercorn is an ASGI and WSGI Server based on Hyper libraries and inspired by Gunicorn.☆14Jan 12, 2026Updated 2 months ago
- ☆19Sep 22, 2025Updated 6 months ago
- ☆27Oct 6, 2024Updated last year
- ☆33Jan 13, 2022Updated 4 years ago