jzhou316 / Post-DeepSeek-R1_LLM-RLLinks
Learning and research after DeepSeek-R1, around test-time computing, resurgence of RL, and new LLM learning/application paradigms.
☆19Updated last month
Alternatives and similar repositories for Post-DeepSeek-R1_LLM-RL
Users that are interested in Post-DeepSeek-R1_LLM-RL are comparing it to the libraries listed below
Sorting:
- ☆61Updated 2 years ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆295Updated 2 weeks ago
- This is the repository for our paper: Untying the Reversal Curse via Bidirectional Language Model Editing☆11Updated 8 months ago
- Official style files for papers submitted to venues of the Association for Computational Linguistics☆1,506Updated 2 months ago
- ☆230Updated last year
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆155Updated 5 months ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆415Updated 9 months ago
- ☆642Updated 6 months ago
- Tools for checking ACL paper submissions☆895Updated 2 months ago
- ☆41Updated last week
- ☆14Updated last year
- Source code of our paper MIND, ACL 2024 Long Paper☆60Updated 2 months ago
- [NeurIPS 2024] How do Large Language Models Handle Multilingualism?☆51Updated last year
- Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.☆177Updated this week
- Must-read Papers on Knowledge Editing for Large Language Models.☆1,212Updated 6 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆258Updated 5 months ago
- awesome SAE papers☆71Updated 8 months ago
- ☆27Updated 2 months ago
- This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.☆565Updated last year
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆552Updated last year
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆273Updated last year
- Official code for the paper Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception. The code is based on t…☆19Updated 6 months ago
- Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures☆30Updated 2 weeks ago
- awesome papers in LLM interpretability☆609Updated 5 months ago
- ☆91Updated last year
- ☆247Updated last year
- Paper list for Efficient Reasoning.☆822Updated last week
- ☆38Updated 2 years ago
- A Survey on Data Selection for Language Models☆253Updated 9 months ago
- ☆41Updated last year