jzhou316 / Post-DeepSeek-R1_LLM-RLLinks
Learning and research after DeepSeek-R1, around test-time computing, resurgence of RL, and new LLM learning/application paradigms.
☆19Updated this week
Alternatives and similar repositories for Post-DeepSeek-R1_LLM-RL
Users that are interested in Post-DeepSeek-R1_LLM-RL are comparing it to the libraries listed below
Sorting:
- ☆61Updated 2 years ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆290Updated 2 weeks ago
- Tools for checking ACL paper submissions☆870Updated last month
- This is the repository for our paper: Untying the Reversal Curse via Bidirectional Language Model Editing☆11Updated 7 months ago
- ☆26Updated last month
- ☆227Updated last year
- [ACL 2025] Adaptive Retrieval without Self-Knowledge? Bringing Uncertainty Back Home☆15Updated 7 months ago
- awesome papers in LLM interpretability☆602Updated 4 months ago
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆142Updated 4 months ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆414Updated 8 months ago
- Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.☆168Updated this week
- Official style files for papers submitted to venues of the Association for Computational Linguistics☆1,467Updated last month
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆255Updated 4 months ago
- [AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems☆12Updated 8 months ago
- This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.☆562Updated last year
- ☆38Updated 2 weeks ago
- awesome SAE papers☆69Updated 7 months ago
- ☆57Updated 7 months ago
- ☆89Updated last year
- A curated list of personalized alignment resources (continually updated).☆55Updated 2 months ago
- Official code for the paper Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception. The code is based on t…☆19Updated 5 months ago
- ☆244Updated last year
- ☆38Updated last year
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆541Updated last year
- [NeurIPS 2024] How do Large Language Models Handle Multilingualism?☆48Updated last year
- ☆631Updated 5 months ago
- Must-read Papers on Knowledge Editing for Large Language Models.☆1,209Updated 5 months ago
- [TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models☆712Updated 2 months ago
- A resource repository for representation engineering in large language models☆145Updated last year
- A bibliography and survey of the papers surrounding o1☆1,216Updated last year