Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)
☆18Oct 17, 2023Updated 2 years ago
Alternatives and similar repositories for ToxificationReversal
Users that are interested in ToxificationReversal are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codes for Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback (ACL 2024 Findings)☆16Jul 2, 2024Updated last year
- ☆21Aug 9, 2024Updated last year
- [ACL 2024] Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue☆26Oct 18, 2025Updated 7 months ago
- PyTorch implementation of CARE☆16Oct 6, 2023Updated 2 years ago
- PyTorch implementation of experiments in the paper Aligning Language Models with Human Preferences via a Bayesian Approach☆32Nov 6, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆11Nov 8, 2022Updated 3 years ago
- Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue (ACL Findings 2023)☆21Nov 10, 2025Updated 7 months ago
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆33Apr 12, 2025Updated last year
- Official repository of S-Agents: Self-organizing Agents in Open-ended Environment☆26Mar 15, 2024Updated 2 years ago
- [ACL 2023] ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning☆55Oct 3, 2024Updated last year
- [EMNLP 2024 Findings] ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation☆19Dec 11, 2024Updated last year
- ☆19Mar 10, 2025Updated last year
- [ACL 2025 Findings] Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors☆88Jun 2, 2025Updated last year
- ☆18Jun 24, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ACL 2025] RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection☆34Jul 23, 2025Updated 10 months ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- This repository provides the data and the codes used in the AAAI'24 paper, COOPER: Coordinating Specialized Agents towards a Complex Dial…☆28Mar 1, 2024Updated 2 years ago
- [TOIS 2024] Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue☆13Oct 18, 2025Updated 7 months ago
- Developing a Korean LLM model : Hate Speech Filtering, Improving conversational skills, Finetuning with the RLHF method☆19May 27, 2025Updated last year
- Codes for our paper "Enhancing Continual Relation Extraction via Classifier Decomposition" (Findings of ACL2023)☆10Nov 29, 2023Updated 2 years ago
- [ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration☆69Feb 21, 2025Updated last year
- Released Code for ACL 21 paper: DocOIE A Document-level Context-Aware Dataset for OpenIE☆15Nov 25, 2022Updated 3 years ago
- Fine grained Empathy Direction Detection☆16Dec 11, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Apr 22, 2024Updated 2 years ago
- "CBF-LLM: Safe Control for LLM Alignment"☆12Dec 10, 2024Updated last year
- ☆16Aug 14, 2022Updated 3 years ago
- Evaluate the Quality of Critique☆37Jun 1, 2024Updated 2 years ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆86Sep 13, 2025Updated 9 months ago
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Jan 27, 2023Updated 3 years ago
- ☆11Jun 14, 2024Updated 2 years ago
- code for ACL 2019 paper "cross lingual training for automatic question generation"☆14Jun 30, 2019Updated 6 years ago
- Code for "Never Too Late to Learn: Regularizing Gender bias in Coreference Resolution"☆43Jun 14, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆11Oct 3, 2024Updated last year
- (ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"☆28Mar 2, 2026Updated 3 months ago
- A PyTorch implementation of "Revisiting Multi-Task Learning with ROCK: a Deep Residual Auxiliary Block for Visual Detection"☆14Jun 29, 2020Updated 5 years ago
- On Infusing Reachability-Based Safety Assurance within Probabilistic Planning Frameworks for Human-Robot Vehicle Interactions☆17Jul 10, 2020Updated 5 years ago
- PrivacyAsst: Safeguarding User Privacy in Tool-Using Large Language Model Agents (TDSC 2024)☆19Mar 29, 2024Updated 2 years ago
- Finetune t5 and bart on Chinese Grammatical Error Correction data.☆19Aug 24, 2022Updated 3 years ago
- Applies ROME and MEMIT on Mamba-S4 models☆15Apr 5, 2024Updated 2 years ago