Feeling confused about super alignment? Here is a reading list
☆44Jan 9, 2024Updated 2 years ago
Alternatives and similar repositories for about-super-alignment
Users that are interested in about-super-alignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆320Jul 16, 2024Updated last year
- Explore what LLMs are really leanring over SFT☆28Mar 30, 2024Updated last year
- [ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang☆14Jan 4, 2024Updated 2 years ago
- [NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"☆43May 22, 2025Updated 10 months ago
- PyTorch implementation of experiments in the paper Aligning Language Models with Human Preferences via a Bayesian Approach☆32Nov 6, 2023Updated 2 years ago
- ☆14Jun 20, 2022Updated 3 years ago
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…☆186Jun 8, 2025Updated 9 months ago
- Teaching Models to Express Their Uncertainty in Words☆38May 26, 2022Updated 3 years ago
- ☆25Jan 1, 2025Updated last year
- Crafting Adversarial Examples for Neural Machine Translation☆10Apr 7, 2023Updated 2 years ago
- [EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.☆77Mar 8, 2024Updated 2 years ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆71Jul 13, 2025Updated 8 months ago
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Jul 19, 2023Updated 2 years ago
- Tools for formatting WMT hypothesis and test sets in XML☆27Apr 18, 2025Updated 11 months ago
- MurisPro-专业的小鼠管理软件,造福广大需要动物实验的朋友☆22Dec 28, 2025Updated 2 months ago
- ☆13Oct 18, 2023Updated 2 years ago
- 🎉 TrustJudge is accepted to ICLR 2026!☆38Sep 27, 2025Updated 5 months ago
- LaTeX Drawing☆18Dec 22, 2025Updated 3 months ago
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆54Sep 3, 2024Updated last year
- A curated list of papers & resources linked to concept learning☆12Aug 9, 2023Updated 2 years ago
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning☆47Jan 22, 2026Updated 2 months ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆77Jan 16, 2026Updated 2 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆63Mar 26, 2024Updated last year
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆145Nov 13, 2025Updated 4 months ago
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆16Sep 15, 2023Updated 2 years ago
- GAU-alpha-pytorch☆20May 11, 2022Updated 3 years ago
- Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW…☆129Jul 26, 2023Updated 2 years ago
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…☆241Nov 3, 2023Updated 2 years ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆147Sep 20, 2024Updated last year
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆59Aug 24, 2023Updated 2 years ago
- LeetCode Training and Evaluation Dataset☆49Apr 22, 2025Updated 11 months ago
- The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”☆17Feb 26, 2024Updated 2 years ago
- SuperDebug,debug如此简单!☆17Jul 19, 2022Updated 3 years ago
- Official implementation of Matcha-agent, https://arxiv.org/abs/2303.08268☆28Aug 25, 2024Updated last year
- ☆22Sep 19, 2023Updated 2 years ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆133Jul 10, 2024Updated last year
- [ICML2024]Adaptive decoding balances the diversity and coherence of open-ended text generation.☆19Jun 2, 2024Updated last year
- [TNNLS, to appear] FET-LM: Flow Enhanced Variational Auto-Encoder for Topic-Guided Language Modeling PyTorch Implementation☆14Mar 4, 2023Updated 3 years ago