AnonymousAlethiometer / SGD_SaIView external linksLinks
Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"
☆56Jan 27, 2025Updated last year
Alternatives and similar repositories for SGD_SaI
Users that are interested in SGD_SaI are comparing it to the libraries listed below
Sorting:
- ☆36Mar 12, 2025Updated 11 months ago
- ☆21Jul 21, 2025Updated 6 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆17Oct 17, 2025Updated 3 months ago
- Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning☆28Jul 14, 2025Updated 7 months ago
- When it comes to optimizers, it's always better to be safe than sorry☆404Sep 26, 2025Updated 4 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69May 18, 2025Updated 8 months ago
- ☆15Jan 12, 2026Updated last month
- [ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]☆15Jul 15, 2025Updated 6 months ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Jan 26, 2025Updated last year
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆18Mar 7, 2025Updated 11 months ago
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆26Nov 7, 2025Updated 3 months ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆13Mar 17, 2025Updated 10 months ago
- ☆19Jun 4, 2025Updated 8 months ago
- ☆14Oct 4, 2024Updated last year
- The official repo for the DanQing dataset.☆29Jan 16, 2026Updated 3 weeks ago
- Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆29Sep 19, 2025Updated 4 months ago
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- Sparse Backpropagation for Mixture-of-Expert Training☆29Jul 2, 2024Updated last year
- ☆19Oct 14, 2024Updated last year
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 8 months ago
- [ICCV 2025] Official implementation of "What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?"☆18Aug 7, 2025Updated 6 months ago
- ☆15Sep 22, 2024Updated last year
- Official code of "StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs".☆74Jun 23, 2025Updated 7 months ago
- ☆19Jun 29, 2025Updated 7 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated 11 months ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆19Oct 14, 2024Updated last year
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- The code for the paper "Efficient Self-Supervised Video Hashing with Selective State Spaces" (AAAI'25).☆21Aug 2, 2025Updated 6 months ago
- Using short models to classify long texts☆21Mar 8, 2023Updated 2 years ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆37Oct 1, 2025Updated 4 months ago
- hola amigos☆22Oct 18, 2025Updated 3 months ago
- Control LLM☆22Apr 6, 2025Updated 10 months ago
- Fork of Flame repo for training of some new stuff in development☆19Jan 5, 2026Updated last month
- ☆47Apr 29, 2025Updated 9 months ago
- [ICLR 2026] Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusio…☆99Feb 4, 2026Updated last week
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Oct 31, 2024Updated last year
- TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of u…☆25Jun 4, 2025Updated 8 months ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 10 months ago