Repo for paper "Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability"
☆76Apr 11, 2026Updated this week
Alternatives and similar repositories for rethink_sft_generalization
Users that are interested in rethink_sft_generalization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2025] Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection☆63Feb 2, 2026Updated 2 months ago
- ☆16Sep 17, 2024Updated last year
- PaperPub is an academic arena where diverse AI Agents read papers daily, pick apart each other's arguments, and fiercely debate.☆43Mar 25, 2026Updated 2 weeks ago
- 🚀 CCF DDL Tracker: a lightweight chrome extension for tracking CCF deadlines (Ongoing...)☆23Apr 5, 2026Updated last week
- Official Repository of "Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Ste…☆27Mar 9, 2026Updated last month
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- Generate Persona 5 style “calling card” images.☆20Mar 5, 2025Updated last year
- Socratic-Zero is a fully autonomous framework that generates high-quality training data for mathematical reasoning☆36Oct 26, 2025Updated 5 months ago
- FlexiFilm: Long Video Generation with Flexible Conditions☆31May 1, 2024Updated last year
- Diagnostic Framework for LLMs and MLLMs☆36Mar 2, 2026Updated last month
- Fault Trees on R☆10Aug 26, 2023Updated 2 years ago
- DUT编译原理课程设计,定义了一个C语言子集,包含词法分析,语法分析,语义分析,解释执行以及相应的图形界面☆12Nov 13, 2020Updated 5 years ago
- CVPR(Highlight) Decoupled Distillation to Erase: A General Unlearning Method for Any Class-centric Tasks☆21Jul 22, 2025Updated 8 months ago
- Official reposity for paper "High-Dimension Human Value Representation in Large Language Models" (NAACL'25 Main)☆23Jul 9, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- OpenFTA☆14Jun 14, 2013Updated 12 years ago
- 一个开源数学大模型项目,旨在探索大模型是否具有数学创造能力,以及大模型在前沿数学研究中的潜在能力。☆18Mar 19, 2026Updated 3 weeks ago
- Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence☆60Nov 11, 2025Updated 5 months ago
- GEMS: Agent-Native Multimodal Generation with Memory and Skills☆101Apr 1, 2026Updated last week
- AAAI2025☆12Apr 18, 2025Updated 11 months ago
- Professor and Group List of CS☆10Mar 12, 2024Updated 2 years ago
- a PL/0 compiler☆16Aug 25, 2019Updated 6 years ago
- This is the official repository for the ICLR 2025 Conference Paper - Fast and Slow Streams for Online Time Series Forecasting without Inf…☆16Apr 30, 2025Updated 11 months ago
- 具身智能入门自学☆26Apr 21, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆27Sep 28, 2024Updated last year
- ai+语音合成☆24Jul 4, 2024Updated last year
- Official implementation of DiTFuse (TPAMI 2026)☆52Mar 7, 2026Updated last month
- Open-source self-hosted password manager built with Flutter. Store passwords and crypto seed phrases securely without cloud storage.☆53Updated this week
- JoinAI是一个开源仓库,专注于算法工程能力的培养,包括工程和数学原理的整理☆11Apr 20, 2025Updated 11 months ago
- 这是一个CP31的抢票脚本,有问题可以看项目描述捏,答应我,一定要看好嘛☆30May 12, 2025Updated 11 months ago
- NeurIPS 2024 "NoisyGL: A Comprehensive Benchmark for Graph Neural Networks under Label Noise"☆32Jun 4, 2025Updated 10 months ago
- Official Implementation of "Geometrically-Constrained Agent for Spatial Reasoning"☆74Updated this week
- ☆36Mar 23, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The official implementation of "Test-time Adaptation for Regression by Subspace Alignment" (ICLR 2025).☆17Jun 6, 2025Updated 10 months ago
- collab-dev - Collaboration Metrics for Code Reviews☆23May 12, 2025Updated 10 months ago
- Xmixers: A collection of SOTA efficient token/channel mixers☆28Sep 4, 2025Updated 7 months ago
- Cross Domain Recommendation via Bi-directional Transfer Graph Collaborative Filtering Networks☆28Dec 17, 2020Updated 5 years ago
- 这是一个简单的每日待办事项管理软件,轻巧绿色,满足基本的操作需求,基于aardio开发☆34Jul 25, 2019Updated 6 years ago
- A latest curated list of resources on implicit neural representations.☆16Apr 18, 2025Updated 11 months ago
- [arXiv 2026] Official PyTorch Repository for "Coarse-Guided Visual Generation via Weighted h-Transform Sampling"☆41Mar 16, 2026Updated 3 weeks ago