naba89 / custom_hf_trainerLinks
A custom Huggingface trainer which supports logging auxiliary losses returned by your model
☆15Updated 6 months ago
Alternatives and similar repositories for custom_hf_trainer
Users that are interested in custom_hf_trainer are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"☆364Updated last year
- ☆58Updated 2 years ago
- Spectral Sphere Optimizer☆94Updated 3 weeks ago
- [NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example☆408Updated 2 months ago
- A collection of papers on discrete diffusion models☆168Updated 7 months ago
- An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT☆133Updated 11 months ago
- PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)☆409Updated 7 months ago
- [ICLR 2026] TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆423Updated 2 weeks ago
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆362Updated 8 months ago
- ☆218Updated 2 months ago
- [ICLR‘24 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"☆103Updated 7 months ago
- ☆352Updated 6 months ago
- ☆205Updated last month
- ☆142Updated 3 weeks ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆261Updated 8 months ago
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆201Updated 2 months ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆154Updated 3 weeks ago
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models☆153Updated 7 months ago
- The HELMET Benchmark☆199Updated 2 months ago
- Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".☆260Updated last week
- AnchorAttention: Improved attention for LLMs long-context training☆213Updated last year
- The trainer for HF to record losses of different tasks and objectives.☆49Updated 11 months ago
- [NeurIPS 24 Spotlight] MaskLLM: Learnable Semi-structured Sparsity for Large Language Models☆187Updated last year
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆402Updated 2 weeks ago
- 🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training☆91Updated last year
- ☆55Updated 7 months ago
- ☆176Updated last year
- One-shot Entropy Minimization☆188Updated 7 months ago
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆229Updated last year
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆91Updated 11 months ago