Learning and research after DeepSeek-R1, around test-time computing, resurgence of RL, and new LLM learning/application paradigms.
☆19Feb 23, 2026Updated last week
Alternatives and similar repositories for Post-DeepSeek-R1_LLM-RL
Users that are interested in Post-DeepSeek-R1_LLM-RL are comparing it to the libraries listed below
Sorting:
- ☆49Mar 7, 2025Updated 11 months ago
- The code implementation of the paper Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks (A…☆13Jul 16, 2024Updated last year
- [ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"☆23Mar 4, 2025Updated last year
- ☆17Mar 22, 2025Updated 11 months ago
- Accompanying repo for the DP2O paper accepted by AAAI 2024 main conference☆17Mar 28, 2024Updated last year
- This repository includes the code implementation of the paper Improving Pacing in Long-Form Story Planning by Yichen Wang, Kevin Yang, Xi…☆16Nov 19, 2024Updated last year
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- ☆27Nov 27, 2025Updated 3 months ago
- The MiniAgents visualization tool for simulacra.☆17Apr 18, 2024Updated last year
- MoCo: A One-Stop Shop for Model Collaboration Research☆48Feb 24, 2026Updated last week
- Code for the AAAI 2023 Paper "Real or Fake Text?: Investigating Human Ability to Detect Boundaries Between Human-Written and Machine-Gene…☆17Oct 29, 2024Updated last year
- PyDictionary is an offline English dictionary made using Python along with the Wordnet Lexical Database and Enchant Spell Dictionary. The…☆19May 16, 2021Updated 4 years ago
- Official implementation for the paper "Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation", publish…☆20Jun 3, 2024Updated last year
- Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation☆86Feb 21, 2026Updated last week
- [ACL2025 Best Paper] Language Models Resist Alignment☆43Jun 11, 2025Updated 8 months ago
- Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"☆20Feb 23, 2021Updated 5 years ago
- ☆27Feb 17, 2026Updated 2 weeks ago
- Official code for the ICLR 2025 paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"☆28Dec 1, 2024Updated last year
- Modular Pluralism @ EMNLP 2024☆23Sep 20, 2024Updated last year
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆109Dec 4, 2024Updated last year
- AbstainQA, ACL 2024☆29Feb 4, 2026Updated last month
- ☆26Nov 21, 2022Updated 3 years ago
- Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)☆35Oct 15, 2024Updated last year
- Preparing for ML Interviews.☆54Jan 12, 2026Updated last month
- The information of NLP PhD application in the world.☆37Aug 27, 2024Updated last year
- [EMNLP 2025 Main] ConceptVectors Benchmark and Code for the paper "Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces"☆39Aug 20, 2025Updated 6 months ago
- The official repository for "Rongsheng Wang's Arxiv Template"☆55May 7, 2025Updated 9 months ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆42Dec 15, 2023Updated 2 years ago
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆47Jan 19, 2024Updated 2 years ago
- A Python Commonsense Knowledge Inference Toolkit☆63Dec 13, 2023Updated 2 years ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆72Mar 26, 2023Updated 2 years ago
- A large scale Humor Dataset, containing more than 550k rated English jokes (LREC'20)☆73Jun 12, 2023Updated 2 years ago
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆77Oct 9, 2025Updated 4 months ago
- Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs☆108Dec 2, 2024Updated last year
- RAID is the largest and most challenging benchmark for AI-generated text detection. (ACL 2024)☆158Feb 25, 2026Updated last week
- ☆101Aug 24, 2022Updated 3 years ago
- Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment☆108Mar 8, 2024Updated last year
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆126May 7, 2024Updated last year
- ☆114Jun 9, 2022Updated 3 years ago