Official github repo for SafeDialBench, a comprehensive multi-turn dialogue benchmark to evaluate LLMs' safety.
☆46May 12, 2025Updated 10 months ago
Alternatives and similar repositories for SafeDialBench-Dataset
Users that are interested in SafeDialBench-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of the paper “Reconsidering Overthinking: Penalizing Internal and External Redundancy in CoT Reasoning”☆20Aug 20, 2025Updated 7 months ago
- Implementation of the paper "Multi-Agent Exploration via Self-Learning and Social Learning"☆20Dec 7, 2024Updated last year
- Implementation of the paper "WToE: Learning When to Explore in Multi-Agent Reinforcement Learning"☆21Aug 17, 2024Updated last year
- ManifoldAlignmentStyleTransfer☆45Feb 24, 2022Updated 4 years ago
- Implementation of the paper "Decentralized Counterfactual Value with Threat Detection for Multi-Agent Reinforcement Learning in Mixed Coo…☆17Dec 7, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11Nov 12, 2024Updated last year
- Unpaired Caricature Generation with Multiple Exaggerations (TMM 2021)☆40Jul 14, 2021Updated 4 years ago
- Platform for training generalizable deep reinforcement learning agents☆13Mar 4, 2026Updated 3 weeks ago
- Exploiting Inter-sample and Inter-feature Relations in Dataset Distillation (CVPR24)☆11Jun 16, 2024Updated last year
- Code for "LifeLong Incremental Reinforcement Learning (LLIRL)"☆21Jan 28, 2021Updated 5 years ago
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning☆25Jan 5, 2026Updated 2 months ago
- ☆11Oct 9, 2022Updated 3 years ago
- ☆12Jan 14, 2026Updated 2 months ago
- Bird’s Eye: Probing for Linguistic Graph Structureswith a Simple Information-Theoretic Approach☆11Aug 1, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A unified evaluation toolkit and leaderboard for rigorously assessing the scientific intelligence of large language and vision–language m…☆76Feb 27, 2026Updated last month
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆16Aug 14, 2023Updated 2 years ago
- knowledge distillation for few-shot learning☆15Dec 27, 2023Updated 2 years ago
- ☆22Jan 5, 2025Updated last year
- This repository contains a collection of the most influential papers, and benchmarks related to Large Language Models (LLMs) based Agent …☆49Jul 7, 2025Updated 8 months ago
- Survey on Robust Weakly Supervised Learning☆13Dec 23, 2021Updated 4 years ago
- 📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, AAAI, IJCAI, ICML, AAMAS, ICLR, ICRA, etc. | (AI…☆11Aug 20, 2023Updated 2 years ago
- FSMIS via GMRD☆19Dec 30, 2024Updated last year
- ☆25Apr 22, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- OmniGAIA: Towards Native Omni-Modal AI Agents☆82Mar 16, 2026Updated last week
- egocentric humanoid manipulation benchmark☆60Dec 4, 2025Updated 3 months ago
- Official codebase for "STAIR: Improving Safety Alignment with Introspective Reasoning"☆88Feb 26, 2025Updated last year
- [ICCV 2021 Oral] Mining Latent Classes for Few-shot Segmentation☆75Sep 25, 2021Updated 4 years ago
- NJU OS lab 2023☆12Apr 26, 2023Updated 2 years ago
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- An efficient Python toolkit for Abductive Learning (ABL), a novel paradigm that integrates machine learning and logical reasoning in a un…☆88Mar 18, 2026Updated last week
- Code for Interpretable Counting for Visual Question Answering for ICLR 2018 reproducibility challenge.☆20Jun 28, 2018Updated 7 years ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- [NeurIPS 2023] Official pytorch implementation of "Domain Re-Modulation for Few-Shot Generative Domain Adaption"☆13Aug 2, 2024Updated last year
- PICABench: How Far Are We from Physically Realistic Image Editing?☆36Nov 5, 2025Updated 4 months ago
- PyTorch code for: Frustratingly Simple Domain Generalization via Image Stylization☆23Jun 25, 2020Updated 5 years ago
- [ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"☆12Dec 6, 2024Updated last year
- ☆46Mar 4, 2025Updated last year
- Learning globally stable dynamical systems policies through imitation. A modification of the original work, focussing on waypoint-based i…☆12Oct 12, 2024Updated last year