The official implementation of the paper “Anchored Supervised Fine-Tuning”
☆30Feb 12, 2026Updated 2 weeks ago
Alternatives and similar repositories for ASFT
Users that are interested in ASFT are comparing it to the libraries listed below
Sorting:
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆17Aug 22, 2025Updated 6 months ago
- DCPO: Dynamic Adaptive Clipping for RL☆45Dec 20, 2025Updated 2 months ago
- InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models☆84Feb 2, 2026Updated 3 weeks ago
- ☆32Feb 8, 2026Updated 2 weeks ago
- ⚙️ Lightweight & smart Bun & Browser configuration loader.☆15Updated this week
- ☆26Jul 29, 2025Updated 7 months ago
- gradio bbox labeling tools☆11May 12, 2023Updated 2 years ago
- ☆11Jul 21, 2024Updated last year
- [MMM2025] Official repository for Music2MIDI: Pop Music to MIDI Piano Cover Generation☆15Jul 1, 2025Updated 8 months ago
- ☆12Jul 24, 2025Updated 7 months ago
- FamilyTool benchmark☆12Sep 10, 2025Updated 5 months ago
- A user-friendly interface built on top of Thinking Machines Tinker API that lets you fine-tune LLMs, chat with your trained model, and de…☆27Jan 31, 2026Updated last month
- shadowsocks + v2ray-plugin☆16Jun 20, 2024Updated last year
- ☆28Nov 6, 2025Updated 3 months ago
- A cage-based deformation for meshes in 2D.☆14Sep 8, 2018Updated 7 years ago
- ☆11Mar 12, 2024Updated last year
- Repo for Paper "From Role-Play to Drama-Interaction: An LLM Solution" @ACL 2024☆13Jul 25, 2024Updated last year
- Official PyTorch implementation of our CVPR 2025 paper, "Revisiting Generative Replay for Class Incremental Object Detection."☆24Jan 1, 2026Updated 2 months ago
- ICML 2025 Spotlight, PCEvolve: Private Contrastive Evolution for Synthetic Dataset Generation via Few-Shot Private Data and Generative AP…☆14Jun 27, 2025Updated 8 months ago
- ☆12Nov 28, 2022Updated 3 years ago
- ☆13Mar 9, 2024Updated last year
- ☆42Updated this week
- A react-typescript component for Plotly.JS graphs.☆15Feb 29, 2020Updated 6 years ago
- ☆18Jun 11, 2025Updated 8 months ago
- Accepted to ICLR 2025. MetaMetrics is a calibrated meta-metric designed to evaluate generation tasks across different modalities aligned …☆14Dec 30, 2024Updated last year
- ☆15Jan 11, 2023Updated 3 years ago
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆18Apr 22, 2025Updated 10 months ago
- 微聚,专业的数据标注,采集平台☆13Jun 19, 2018Updated 7 years ago
- [ICLR25] STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs☆18Jun 3, 2025Updated 8 months ago
- This is the companion code for the method reported in the paper "Learning game-theoretic models of multiagent trajectories using implicit…☆12Feb 8, 2021Updated 5 years ago
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆21Oct 16, 2025Updated 4 months ago
- This is LaTex PDF(PPT) template for SUSTech, you can use it to perform your presentations.☆15Sep 14, 2021Updated 4 years ago
- Companion code to https://arxiv.org/abs/2409.03797v2☆19Sep 18, 2025Updated 5 months ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated last year
- minify html with CSS and JS☆14Nov 15, 2019Updated 6 years ago
- SUSTech 2023 Spring CS328 Distributed System☆18Oct 5, 2024Updated last year
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 4 months ago
- Multi-agent coordination for Pi - presence, messaging, file reservations☆51Feb 18, 2026Updated last week
- ☆15Updated this week