shivamag125 / EM_PTLinks
☆19Updated last month
Alternatives and similar repositories for EM_PT
Users that are interested in EM_PT are comparing it to the libraries listed below
Sorting:
- ☆16Updated 3 months ago
- ☆20Updated 7 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆88Updated 7 months ago
- [ACL 2025] Knowledge Unlearning for Large Language Models☆42Updated last week
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆80Updated 9 months ago
- ☆26Updated 5 months ago
- [ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"☆24Updated 3 months ago
- ☆30Updated 5 months ago
- AdaRFT: Efficient Reinforcement Finetuning via Adaptive Curriculum Learning☆45Updated 3 months ago
- The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"☆18Updated last week
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆86Updated 7 months ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆25Updated 2 weeks ago
- The repository of the paper "REEF: Representation Encoding Fingerprints for Large Language Models," aims to protect the IP of open-source…☆63Updated 8 months ago
- ☆131Updated 2 weeks ago
- "Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning" by Chongyu Fan*, Jiancheng Liu*, Licong Lin*, Jingh…☆32Updated 3 months ago
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆21Updated 9 months ago
- [arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents☆43Updated 2 months ago
- RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024☆82Updated 11 months ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆29Updated 6 months ago
- Official code for SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆41Updated 5 months ago
- ☆50Updated 2 months ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆69Updated 5 months ago
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆115Updated 4 months ago
- ☆67Updated 5 months ago
- ☆43Updated 5 months ago
- ☆40Updated 5 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆157Updated last week
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆67Updated 3 months ago
- This repo is for the safety topic, including attacks, defenses and studies related to reasoning and RL☆43Updated 3 weeks ago
- The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static t…☆45Updated 2 weeks ago