jzhou316 / Post-DeepSeek-R1_LLM-RLLinks
Learning and research after DeepSeek-R1, around test-time computing, resurgence of RL, and new LLM learning/application paradigms.
☆19Updated 2 weeks ago
Alternatives and similar repositories for Post-DeepSeek-R1_LLM-RL
Users that are interested in Post-DeepSeek-R1_LLM-RL are comparing it to the libraries listed below
Sorting:
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆137Updated 4 months ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆286Updated 9 months ago
- Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.☆168Updated this week
- ☆59Updated 2 years ago
- ☆223Updated last year
- awesome SAE papers☆69Updated 6 months ago
- awesome papers in LLM interpretability☆596Updated 4 months ago
- 📜 Paper list on decoding methods for LLMs and LVLMs☆67Updated last month
- A resource repository for machine unlearning in large language models☆513Updated 5 months ago
- [NeurIPS 2024] How do Large Language Models Handle Multilingualism?☆46Updated last year
- Must-read Papers on Knowledge Editing for Large Language Models.☆1,206Updated 5 months ago
- A resource repository for representation engineering in large language models☆143Updated last year
- This is the repository for our paper: Untying the Reversal Curse via Bidirectional Language Model Editing☆11Updated 6 months ago
- ☆26Updated 3 weeks ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆235Updated last week
- [NeurIPS D&B '25] The one-stop repository for large language model (LLM) unlearning. Supports TOFU, MUSE, WMDP, and many unlearning metho…☆448Updated 2 weeks ago
- Paper list for Efficient Reasoning.☆768Updated last week
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆48Updated last year
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆254Updated 4 months ago
- Source code of our paper MIND, ACL 2024 Long Paper☆58Updated last month
- A curated list of personalized alignment resources (continually updated).☆52Updated last month
- ☆89Updated 11 months ago
- ☆240Updated last year
- ☆55Updated 6 months ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆411Updated 8 months ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆33Updated 9 months ago
- This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.☆561Updated last year
- ☆20Updated last year
- [AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems☆12Updated 7 months ago
- A curated list of resources for activation engineering☆119Updated 2 months ago