YuejiangLIU / cslView external linksLinks
Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts
☆16Feb 26, 2024Updated last year
Alternatives and similar repositories for csl
Users that are interested in csl are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆14Jun 21, 2024Updated last year
- ☆14Feb 26, 2024Updated last year
- ☆21Jul 9, 2022Updated 3 years ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆191Jan 16, 2025Updated last year
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆85Mar 7, 2025Updated 11 months ago
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 4 months ago
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 3 months ago
- ☆10Mar 28, 2022Updated 3 years ago
- FamilyTool benchmark☆12Sep 10, 2025Updated 5 months ago
- https://avocado-captioner.github.io/☆29Oct 16, 2025Updated 3 months ago
- Scripts for KGIRNet model for ESWC☆10Jul 6, 2023Updated 2 years ago
- Information Extraction related tools and models☆10Mar 16, 2023Updated 2 years ago
- A python tool help to interact with chatgpt.☆10Dec 11, 2022Updated 3 years ago
- Evaluate interpretability methods on localizing and disentangling concepts in LLMs.☆57Oct 30, 2025Updated 3 months ago
- ☆26Feb 4, 2026Updated last week
- Automatically replace full publication names in a bibtex database file into official abbreviated names, or reverse. (Support IEEE/ACM/Sci…☆14Jul 30, 2024Updated last year
- ☆12Jan 2, 2024Updated 2 years ago
- Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation☆27Dec 12, 2025Updated 2 months ago
- ☆14May 30, 2024Updated last year
- Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation☆28Dec 10, 2025Updated 2 months ago
- ☆16Mar 22, 2025Updated 10 months ago
- Codebase for Mechanistic Mode Connectivity☆13Jul 14, 2023Updated 2 years ago
- Official codebase for the NeurIPS 2023 paper: Towards Last-layer Retraining for Group Robustness with Fewer Annotations. https://arxiv.or…☆11May 15, 2024Updated last year
- ☆11Jun 1, 2023Updated 2 years ago
- ☆12Jan 25, 2024Updated 2 years ago
- [ICML 2023] On Pitfalls of Test-Time Adaptation☆124Apr 6, 2024Updated last year
- Dataset Pinocchio for paper "Towards Understanding Factual Knowledge of Large Language Models" accepted by ICLR 2024 (Spotlight)☆12Mar 13, 2024Updated last year
- ☆12Dec 23, 2022Updated 3 years ago
- [EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Langua…☆13Nov 11, 2024Updated last year
- [ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2 & [ICLR 2025] Mask-DPO☆62Apr 30, 2025Updated 9 months ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated last year
- Code for "Automatic Circuit Finding and Faithfulness"☆16Jul 11, 2024Updated last year
- ☆34Feb 11, 2025Updated last year
- ☆12Mar 7, 2024Updated last year
- Code and Data for the ACL21 paper "Modeling Bilingual Conversational Characteristics for Neural Chat Translation"☆12Dec 17, 2021Updated 4 years ago
- ☆12Feb 16, 2023Updated 2 years ago
- ☆21Jun 22, 2025Updated 7 months ago
- ☆20Nov 15, 2024Updated last year