hhnqqq / MyTransformersLinks

This repository provides a comprehensive library for parallel training and LoRA algorithm implementations, supporting multiple parallel strategies and a rich collection of LoRA variants. It serves as a flexible and efficient model fine-tuning toolkit for researchers and developers. Please contact hehn@mail.ustc.edu.cn for detailed information.

☆46

Alternatives and similar repositories for MyTransformers

Users that are interested in MyTransformers are comparing it to the libraries listed below

Sorting:

ML-GSAI / Diffusion-LLM-Papers
A Collection of Papers on Diffusion Language Models
☆90Updated 2 weeks ago
ThreeSR / Awesome-Inference-Time-Scaling
Paper List of Inference/Test Time Scaling/Computing
☆280Updated 2 weeks ago
yczhou001 / Awesome-Diffusion-LLM
paper list, tutorial, and nano code snippet for Diffusion Large Language Models.
☆82Updated 3 weeks ago
Chongjie-Si / Subspace-Tuning
A generalized framework for subspace tuning methods in parameter efficient fine-tuning.
☆147Updated 3 weeks ago
fscdc / Awesome-Efficient-Reasoning-Models
[arXiv 2025] Efficient Reasoning Models: A Survey
☆227Updated this week
Outsider565 / LoRA-GA
☆203Updated 8 months ago
Clin0212 / HydraLoRA
[NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning
☆216Updated 7 months ago
LINs-lab / DynMoE
[ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models
☆116Updated last week
horseee / CoT-Valve
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
☆77Updated 5 months ago
MingyuJ666 / Rope_with_LLM
[ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…
☆74Updated 3 weeks ago
JinXins / Awesome-Token-Merge-for-MLLMs
A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.
☆67Updated 6 months ago
SUSTechBruce / LOOK-M
[EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…
☆97Updated 8 months ago
maomaocun / dLLM-cache
Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…
☆126Updated last week
hanyang1999 / discrete-diffusion-papers
A collection of papers on discrete diffusion models
☆151Updated 2 weeks ago
zitian-gao / one-shot-em
One-shot Entropy Minimization
☆167Updated last month
OpenSparseLLMs / Skip-DiT
✈️ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints
☆71Updated last week
NVlabs / Fast-dLLM
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
☆296Updated last week
Purshow / Awesome-Unified-Multimodal
📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.
☆256Updated 3 weeks ago
xuyang-liu16 / Awesome-Token-level-Model-Compression
📚 Collection of token-level model compression resources.
☆140Updated 2 weeks ago
LMM101 / Awesome-Multimodal-Next-Token-Prediction
[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
☆447Updated 6 months ago
wutaiqiang / MoSLoRA
☆108Updated last year
xuyang-liu16 / Awesome-Generation-Acceleration
📚 Collection of awesome generation acceleration resources.
☆286Updated last week
ML-GSAI / LLaDA-V
☆174Updated 3 weeks ago
Gumpest / SparseVLMs
[ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference".
☆130Updated last month
zhijie-group / Orthus
☆38Updated 2 months ago
WayneJin0918 / SOTA-paper-rating.io
A tiny paper rating web
☆38Updated 3 months ago
NVlabs / Long-RL
Long-RL: Scaling RL to Long Sequences
☆323Updated this week
horseee / dKV-Cache
☆88Updated last month
jianghoucheng / AlphaEdit
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)
☆282Updated last week
NeuroDong / Bibtex_for_NeurIPS2025
Provide .bst files for NeurIPS latex template
☆49Updated 3 months ago