shivamag125 / EM_PTLinks

☆24

Alternatives and similar repositories for EM_PT

Users that are interested in EM_PT are comparing it to the libraries listed below

Sorting:

ZJU-REAL / EasySteer
A Unified Framework for High-Performance and Extensible LLM Steering
☆131Updated last week
TrustedLLM / UnKE
☆22Updated 9 months ago
Alsace08 / Chain-of-Embedding
[ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"
☆86Updated 11 months ago
ChnQ / MI-Peaks
☆56Updated 4 months ago
eric-ai-lab / MSSBench
[ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"
☆30Updated 5 months ago
Raibows / CREAM
Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.
☆27Updated 9 months ago
ChnQ / TracingLLM
☆30Updated last year
MozerWang / AMPO
[arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents
☆46Updated 4 months ago
horseee / CoT-Valve
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
☆87Updated 9 months ago
Joshua-Ren / Learning_dynamics_LLM
☆184Updated 6 months ago
UCSC-VLAA / STAR-1
[AAAI'26 Oral] Official Implementation of STAR-1: Safer Alignment of Reasoning LLMs with 1K Data
☆32Updated 7 months ago
alenai97 / PEFT-MLLM
Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"
☆23Updated last year
ZhentingWang / DUMP
☆32Updated 6 months ago
zhyang2226 / AR-Lopti
[AI4MATH@ICML2025] Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs
☆40Updated 6 months ago
rhyang2021 / ARIA
Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".
☆25Updated 3 months ago
StarDewXXX / O1-Pruner
Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning
☆97Updated 9 months ago
lichengliu03 / unary-feedback
☆38Updated 3 months ago
GATECH-EIC / ACT
[ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…
☆46Updated last year
aeroplanepaper / GRPO-LEAD
☆30Updated last week
AI45Lab / REEF
The repository of the paper "REEF: Representation Encoding Fingerprints for Large Language Models," aims to protect the IP of open-source…
☆68Updated 10 months ago
THU-KEG / RM-Bench
[ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style
☆70Updated 4 months ago
Blueyee / Efficient-CoT-LRMs
Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!
☆70Updated 7 months ago
ybwang119 / Awesome-reasoning-safety
This repo is for the safety topic, including attacks, defenses and studies related to reasoning and RL
☆52Updated 2 months ago
MikaStars39 / FeatureAlignment
FeatureAlignment = Alignment + Mechanistic Interpretability
☆31Updated 8 months ago
Dereck0602 / Awesome_Test_Time_LLMs
☆131Updated 8 months ago
StarDewXXX / AdaR1
The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"
☆20Updated 3 weeks ago
Jihuai-wpy / InferAligner
☆37Updated last year
sail-sg / LightTrans
The official implementation of "LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation"
☆20Updated 7 months ago
QingyangZhang / EMPO
[NeurIPS25 Spotlight] EMPO, A Fully Unsupervised RLVR Method
☆84Updated this week
MingyuJ666 / Rope_with_LLM
[ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…
☆82Updated 5 months ago