jzhou316 / Post-DeepSeek-R1_LLM-RLLinks
Learning and research after DeepSeek-R1, around test-time computing, resurgence of RL, and new LLM learning/application paradigms.
☆18Updated last week
Alternatives and similar repositories for Post-DeepSeek-R1_LLM-RL
Users that are interested in Post-DeepSeek-R1_LLM-RL are comparing it to the libraries listed below
Sorting:
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆130Updated 3 months ago
- ☆57Updated last year
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆285Updated 8 months ago
- [ACL 2025] Adaptive Retrieval without Self-Knowledge? Bringing Uncertainty Back Home☆15Updated 6 months ago
- This is the repository for our paper: Untying the Reversal Curse via Bidirectional Language Model Editing☆11Updated 6 months ago
- Official style files for papers submitted to venues of the Association for Computational Linguistics☆1,309Updated 2 weeks ago
- awesome SAE papers☆60Updated 6 months ago
- ☆214Updated last year
- Source code of our paper MIND, ACL 2024 Long Paper☆57Updated 2 weeks ago
- awesome papers in LLM interpretability☆584Updated 3 months ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆406Updated 7 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆255Updated 3 months ago
- A resource repository for machine unlearning in large language models☆509Updated 4 months ago
- Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.☆164Updated this week
- Must-read Papers on Knowledge Editing for Large Language Models.☆1,203Updated 4 months ago
- ☆237Updated last year
- ☆87Updated 11 months ago
- The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static t…☆47Updated 2 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆194Updated 2 weeks ago
- [NeurIPS 2024] How do Large Language Models Handle Multilingualism?☆46Updated last year
- Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization☆37Updated last year
- ☆55Updated last year
- Tools for checking ACL paper submissions☆847Updated 2 months ago
- [AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems☆12Updated 6 months ago
- [NeurIPS D&B '25] The one-stop repository for large language model (LLM) unlearning. Supports TOFU, MUSE, WMDP, and many unlearning metho…☆430Updated last month
- 📜 Paper list on decoding methods for LLMs and LVLMs☆65Updated 3 weeks ago
- Paper list for Efficient Reasoning.☆732Updated last week
- Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep☆165Updated 7 months ago
- This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.☆557Updated last year
- ☆174Updated last year