PRIS-CV/EAFT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PRIS-CV/EAFT)

PRIS-CV / EAFT

EAFT(Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting) official repo

☆105

Alternatives and similar repositories for EAFT

Users that are interested in EAFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PRIS-CV / AutoDriveRL
View on GitHub
☆19Jun 13, 2025Updated last year
PRIS-CV / FakeReasoning
View on GitHub
[TIP 2026] Toward Generalizable Forgery Detection and Reasoning.
☆22Apr 20, 2026Updated 3 months ago
PRIS-CV / CineTechBench
View on GitHub
A Benchmark for Cinematographic Technique Understanding and Generation
☆29Sep 19, 2025Updated 10 months ago
PRIS-CV / GRPO-for-Llava
View on GitHub
GRPO Algorithm for Llava Architecture (Based on Verl)
☆49May 9, 2025Updated last year
RLHFlow / GVM
View on GitHub
☆16Jul 29, 2025Updated 11 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jinpeng0528 / SEFE
View on GitHub
☆13May 6, 2025Updated last year
wutaiqiang / MI
View on GitHub
Official code for paper "Revisiting Model Interpolation for Efficient Reasoning"
☆17Jul 14, 2026Updated last week
AI45Lab / DeepSafe
View on GitHub
All-in-One Safety Evaluation Framwork
☆51Updated this week
Utaotao / ProFit
View on GitHub
☆35Jan 20, 2026Updated 6 months ago
RUC-NLPIR / EnvScaler
View on GitHub
The official implementation of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis".
☆175Feb 12, 2026Updated 5 months ago
emrecanacikgoz / Tool-R0
View on GitHub
☆35Apr 3, 2026Updated 3 months ago
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
We-Math / We-Math
View on GitHub
The code and data of We-Math, accepted by ACL 2025 main conference.
☆134Dec 11, 2025Updated 7 months ago
MurrayTom / ToolSafe
View on GitHub
Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…
☆70Mar 25, 2026Updated 3 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
fang-d / BlackboardCaptor
View on GitHub
A mobile application that can help users get the perfect blackboard photos.
☆25Jun 2, 2024Updated 2 years ago
csbench / csbench
View on GitHub
☆46Oct 28, 2025Updated 8 months ago
ZunhaiSu / Super-Experts-Profilling
View on GitHub
(ICLR 2026) Unveiling Super Experts in Mixture-of-Experts Large Language Models
☆43Sep 25, 2025Updated 9 months ago
Trae1ounG / BuPO
View on GitHub
[arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
☆60Feb 6, 2026Updated 5 months ago
WujiangXu / EPO
View on GitHub
The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"
☆40Jul 13, 2026Updated last week
vyomakesh09 / longagent
View on GitHub
LONGAGENT: Scaling Language Models to 128k Context through Multi-Agent Collaboration
☆11Mar 11, 2024Updated 2 years ago
microsoft / Simia-Agent-Training
View on GitHub
Official Implementation of "Simulating Environments with Reasoning Models for Agent Training"
☆65Feb 18, 2026Updated 5 months ago
tpoisonooo / open-r1
View on GitHub
Fully open reproduction of DeepSeek-R1
☆11Mar 24, 2025Updated last year
AI45Lab / DeepScan
View on GitHub
Diagnostic Framework for LLMs and MLLMs
☆39Mar 2, 2026Updated 4 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
MikaStars39 / PeRL
View on GitHub
PeRL: Parameter-Efficient Reinforcement Learning
☆81May 20, 2026Updated 2 months ago
idanshen / Self-Distillation
View on GitHub
☆657Apr 7, 2026Updated 3 months ago
mcleish7 / retrofitting-recurrence
View on GitHub
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
☆68Nov 11, 2025Updated 8 months ago
caskcsg / lightretriever
View on GitHub
Code for LightRetriever: A LLM-based Text Retrieval Architecture with Extremely Faster Query Inference
☆19Oct 19, 2025Updated 9 months ago
liushulinle / UloRL
View on GitHub
An Ultra-Long Output Reinforcement Learning Approach
☆23Jul 31, 2025Updated 11 months ago
ZHUANGHP / Any-SSR
View on GitHub
This is the official code for Any-SSR "Analytic Subspace Routing: How Recursive Least Squares Works in Continual Learning of Large Langua…
☆27Jan 31, 2026Updated 5 months ago
shangshang-wang / Resa
View on GitHub
Resa: Transparent Reasoning Models via SAEs
☆50Sep 23, 2025Updated 9 months ago
gouki510 / Topology_of_Reasoning
View on GitHub
☆42Jun 11, 2025Updated last year
SagnikMukherjee / sparsity_in_rl
View on GitHub
Reinforcement Learning Finetunes Small Subnetworks in Large Language Models
☆15Oct 20, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
adobe-research / vaw_dataset
View on GitHub
This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in th…
☆72Jul 22, 2022Updated 3 years ago
Minato-Zackie / SMoLoRA
View on GitHub
This is the official code implementation of "SMoLoRA: Exploring and Defying Dual Catastrophic Forgetting in Continual Visual Instruction …
☆17Feb 27, 2026Updated 4 months ago
RUCAIBox / Passk_Training
View on GitHub
The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''
☆113Aug 15, 2025Updated 11 months ago
TransluceAI / introspective-interp
View on GitHub
Repository for "Training Language Models To Explain Their Own Computations"
☆23Jul 7, 2026Updated 2 weeks ago
beanie00 / self-distillation-analysis
View on GitHub
Codebase for the work “Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?”
☆74Apr 14, 2026Updated 3 months ago
PreckLi / MIP-Editor
View on GitHub
Official implementation of Cross-Modal Unlearning via Influential Neuron Path Editing in Multimodal Large Language Models
☆16Mar 21, 2026Updated 4 months ago
lasgroup / SDPO
View on GitHub
Reinforcement Learning via Self-Distillation (SDPO)
☆1,017Jul 1, 2026Updated 2 weeks ago