[EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"
☆20Oct 2, 2024Updated last year
Alternatives and similar repositories for Course-Correction
Users that are interested in Course-Correction are comparing it to the libraries listed below
Sorting:
- [ACL 2024] The official GitHub repo for the paper "The Earth is Flat because...: Investigating LLMs' Belief towards Misinformation via Pe…☆82Jul 19, 2024Updated last year
- ☆31Feb 23, 2025Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆152Sep 21, 2024Updated last year
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆51Oct 31, 2024Updated last year
- Official code for the ICCV2023 paper ``One-bit Flip is All You Need: When Bit-flip Attack Meets Model Training''☆20Aug 9, 2023Updated 2 years ago
- Lightweight tool to identify Data Contamination in LLMs evaluation☆53Mar 8, 2024Updated last year
- A Survey of Hallucination in Large Foundation Models☆56Jan 10, 2024Updated 2 years ago
- ☆57Jun 13, 2024Updated last year
- This is the code repository of our submission: Understanding the Dark Side of LLMs’ Intrinsic Self-Correction.☆63Dec 20, 2024Updated last year
- ☆67Feb 13, 2026Updated 2 weeks ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆32Aug 5, 2025Updated 6 months ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆39Mar 2, 2023Updated 2 years ago
- This the implementation of LeCo☆31Jan 20, 2025Updated last year
- Simulator.☆101Apr 21, 2025Updated 10 months ago
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆80Mar 11, 2024Updated last year
- Evaluate the Quality of Critique☆36Jun 1, 2024Updated last year
- This repo contains the code to reproduce figures in my dissertation "Passive Imaging and Characterization of the Subsurface With Distribu…☆10Jun 14, 2018Updated 7 years ago
- ☆12Sep 21, 2023Updated 2 years ago
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆35Nov 4, 2025Updated 3 months ago
- ☆11Mar 11, 2024Updated last year
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning☆25Jan 5, 2026Updated last month
- Tensorflow implementation of the paper "Fast Compressive Sensing Using Generative Model with Structed Latent Variables"☆10Apr 7, 2020Updated 5 years ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆29Updated this week
- The sparse Bayesian learning sandbox☆11Jul 4, 2021Updated 4 years ago
- Official implementation of the paper "On the Importance of Environments in Human-Robot Coordination", published in RSS 2021.☆16May 1, 2024Updated last year
- [ACM MM 2024 (Oral)] Official PyTorch Implementation of Paper "MovingColor: Seamless Fusion of Fine-grained Video Color Enhancement"☆11Dec 30, 2024Updated last year
- [ACM MM 2025] LMM4Edit: Benchmarking and Evaluating Multimodal Image Editing with LMMs☆15Feb 10, 2026Updated 2 weeks ago
- 🎹🎵🎶 A platform to make Original and Cover Visible and Valuable.☆13Nov 8, 2022Updated 3 years ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆14Apr 30, 2025Updated 10 months ago
- Prediction of glycopeptide fragment mass spectra by deep learning☆10Feb 20, 2024Updated 2 years ago
- Generative Models for Low Rank Video Representation and Reconstruction☆10May 20, 2019Updated 6 years ago
- ☆15Jan 25, 2025Updated last year
- [TMLR 25] An automated method for explaining complex neuron behaviors in deep vision models using large language models☆10Feb 20, 2025Updated last year
- ACL24☆11Jun 7, 2024Updated last year
- Code for MERMAID : Metaphor Generation with Symbolism and Discriminative Decoding☆11May 2, 2022Updated 3 years ago
- A simple ChatGPT plugin to manage upcoming AI conferences. Best way to learn ChatGPT plugin development.☆11May 14, 2023Updated 2 years ago
- ☆11Oct 8, 2023Updated 2 years ago
- ☆11Jan 19, 2025Updated last year
- ☆12Jun 9, 2025Updated 8 months ago