zhuchichi56/ASFT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhuchichi56/ASFT)

zhuchichi56 / ASFT

[ICLR 2026] The official implementation of the paper “Anchored Supervised Fine-Tuning”

☆47

Alternatives and similar repositories for ASFT

Users that are interested in ASFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sustech-nlp / SPPO
View on GitHub
[ACL 2026 Oral] SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks official repos.
☆26May 18, 2026Updated 2 months ago
yongliang-wu / DFT
View on GitHub
[ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.
☆587Jan 4, 2026Updated 6 months ago
Utaotao / ProFit
View on GitHub
☆35Jan 20, 2026Updated 6 months ago
X1AOX1A / ZoFiles
View on GitHub
Connect Claude to your Zotero library — Zotero plugin that mirrors collections as agent-readable folders with Markdown, BibTeX, and AI re…
☆17May 21, 2026Updated 2 months ago
shiweijiezero / R3L
View on GitHub
☆23Apr 5, 2026Updated 3 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
emmyqin / iw_sft
View on GitHub
☆28Jul 18, 2025Updated last year
Lauorie / DFT
View on GitHub
Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629
☆24Oct 14, 2025Updated 9 months ago
hy-struggle / Pretrain-BERT-based-model
View on GitHub
☆11Oct 16, 2020Updated 5 years ago
GaotangLi / Beyond-Log-Likelihood
View on GitHub
[ICML'26 Spotlight] What is the right loss function for LLM supervised finetuning?
☆66May 28, 2026Updated last month
wutaiqiang / LLM_KD_AKL
View on GitHub
☆22Oct 22, 2024Updated last year
dengyang17 / PPDPP
View on GitHub
☆33Jan 16, 2025Updated last year
fzaiser / nonparametric-hmc
View on GitHub
Implementation of Nonparametric Hamiltonian Monte Carlo
☆13Feb 13, 2023Updated 3 years ago
yuki-younai / MTSA
View on GitHub
offical implementation of MTSA: Multi-turn Safety Alignment for LLMs through Multi-round Red-teaming
☆16Jun 2, 2025Updated last year
JiangHaoPG11 / LGSID
View on GitHub
This repository is for the "LLM-Aligned Geographic Item Tokenization for Local-Life Recommendation".
☆19Nov 18, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Furyton / GR-as-MVDR
View on GitHub
[SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval
☆36Oct 18, 2024Updated last year
anglyan / spikingtorch
View on GitHub
A pytorch implementation of spiking neural networks and backpropagation through spikes
☆13Oct 3, 2024Updated last year
RUCBM / AtomMem
View on GitHub
☆27Mar 31, 2026Updated 3 months ago
tengxiao1 / SimPER
View on GitHub
SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)
☆17Aug 22, 2025Updated 10 months ago
yuanjiayiy / AgenticRed
View on GitHub
An automated pipeline that leverages LLM's meta-learning capability to iteratively design and refine red-teaming systems without human in…
☆30May 24, 2026Updated last month
CoopReason / TESSY
View on GitHub
A Teacher–Student Cooperation Framework to Synthesize Student-Consistent SFT Data
☆33May 1, 2026Updated 2 months ago
xuchennlp / S2T
View on GitHub
The project for speech translation
☆12Sep 28, 2023Updated 2 years ago
qzp2018 / UniECS
View on GitHub
Official implement of CIKM2025: 《UniECS: Unified Multimodal E-Commerce Search Framework with Gated Cross-modal Fusion》
☆21Sep 17, 2025Updated 10 months ago
TianHongZXY / RLVR-Decomposed
View on GitHub
[NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"
☆165Mar 2, 2026Updated 4 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Shenzhi-Wang / Beyond-the-80-20-Rule-RLVR
View on GitHub
The open-source code for the NeurIPS 2025 paper, "Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learn…
☆60Jan 5, 2026Updated 6 months ago
liumy2010 / UFT
View on GitHub
UFT: Unifying Supervised and Reinforcement Fine-Tuning
☆31Jun 30, 2025Updated last year
zhaijianyang / MQL4GRec
View on GitHub
☆57Apr 1, 2025Updated last year
Nipunnyka / GlobalPathPlannerPlugin
View on GitHub
Path Planner Plugin for turtlebot3 using KinoDynamic A Star in ROS melodic
☆12Jun 28, 2020Updated 6 years ago
backprop07 / Self-Certainty
View on GitHub
Implementation of self-certainty as an extention of ZeroEval Project
☆38May 31, 2025Updated last year
Crazy-Jack / Cl-InfoNCE
View on GitHub
An official implementation of Cl-InfoNCE
☆14Feb 14, 2022Updated 4 years ago
farukakgul / ReasonMaxxer
View on GitHub
☆18May 8, 2026Updated 2 months ago
YennNing / CoFiRec
View on GitHub
CoFiRec: Coarse-to-Fine Tokenization for Generative Recommendationn
☆21Jan 23, 2026Updated 5 months ago
NeuraLiying / Progressive-Layered-Extraction-PLE-
View on GitHub
A multi-task learning framework based on pytorch
☆12Nov 8, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
iliaschalkidis / flash-roberta
View on GitHub
Hugging Face RoBERTa with Flash Attention 2
☆24Sep 14, 2025Updated 10 months ago
gracefulning / TIDPO
View on GitHub
TOKEN-IMPORTANCE GUIDED DIRECT PREFERENCE OPTIMIZATION
☆38Jan 26, 2026Updated 5 months ago
lisakawai / music_transformation_ismir
View on GitHub
☆13May 16, 2021Updated 5 years ago
bo-scnu / NER
View on GitHub
network security named entity recognition, Chinese
☆11Aug 27, 2019Updated 6 years ago
shivamag125 / EM_PT
View on GitHub
☆33Aug 21, 2025Updated 11 months ago
TanqiuJiang / AgentLAB
View on GitHub
The official implementation of the paper "AgentLAB: Benchmarking LLM Agents against Long-Horizon Attacks"
☆26Jun 1, 2026Updated last month
zyang1580 / BinLLM
View on GitHub
Code used in ACL rebuttal
☆31Sep 3, 2024Updated last year