This repository provides a comprehensive library for parallel training and LoRA algorithm implementations, supporting multiple parallel strategies and a rich collection of LoRA variants. It serves as a flexible and efficient model fine-tuning toolkit for researchers and developers. Please contact hehn@mail.ustc.edu.cn for detailed information.
☆54Jan 6, 2026Updated 2 months ago
Alternatives and similar repositories for MyTransformers
Users that are interested in MyTransformers are comparing it to the libraries listed below
Sorting:
- LLMRouterBench: A Massive Benchmark and Unified Framework for LLM Routing☆38Jan 30, 2026Updated last month
- An Android Application for GLCC☆11Sep 30, 2022Updated 3 years ago
- The first high school physics Olympiad benchmark for evaluating (M)LLMs with step-level grading and human-level comparison.☆26Dec 19, 2025Updated 2 months ago
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"☆33Feb 19, 2025Updated last year
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆77Mar 1, 2025Updated last year
- [AAAI-2025] Towards Efficient and Intelligent Laser Weeding: Method and Dataset for Weed Stem Detection☆32May 15, 2025Updated 9 months ago
- ☆22Sep 10, 2024Updated last year
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆32Feb 18, 2026Updated 2 weeks ago
- LLMem: GPU Memory Estimation for Fine-Tuning Pre-Trained LLMs☆29May 31, 2025Updated 9 months ago
- (ICCV-2025 Official Code)) Improving Generalist Model with Domain-Specific Experts☆87Oct 29, 2025Updated 4 months ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆39Dec 31, 2024Updated last year
- ☆11Jan 31, 2025Updated last year
- This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).☆42Oct 15, 2024Updated last year
- The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2…☆14Dec 7, 2024Updated last year
- MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.☆16Apr 8, 2025Updated 10 months ago
- ☆17Jan 17, 2026Updated last month
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆16Sep 28, 2024Updated last year
- ☆12Nov 29, 2020Updated 5 years ago
- Conditional DDPM for characterizing radio sources from dirty images. (autumn 2023)☆11Nov 30, 2023Updated 2 years ago
- This is an Augmented Reality application which will help in learning about Wild life animal by creating an augmented Zoo and Spread awar…☆10Nov 1, 2018Updated 7 years ago
- Fully open reproduction of DeepSeek-R1☆12Mar 24, 2025Updated 11 months ago
- Official implementation of ICML'24 paper "LQER: Low-Rank Quantization Error Reconstruction for LLMs"☆19Jul 11, 2024Updated last year
- ☆10Feb 12, 2024Updated 2 years ago
- ☆13Jun 22, 2025Updated 8 months ago
- 抓取Here地图的三维建筑物模型☆12Jun 29, 2017Updated 8 years ago
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated last year
- Unofficial docker wrapper for Qualcomm SNPE(Snapdragon Neural Processing Engine) SDK☆11Mar 3, 2022Updated 4 years ago
- ACL Rolling Review website☆11Feb 24, 2026Updated last week
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆13Sep 2, 2024Updated last year
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆13Nov 17, 2024Updated last year
- Fine-tuning Quantized Neural Networks with Zeroth-order Optimization☆16Sep 17, 2025Updated 5 months ago
- code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis☆12Nov 17, 2024Updated last year
- LLM for genomic☆14Feb 23, 2024Updated 2 years ago
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆16Feb 15, 2025Updated last year
- Internal wave atlas for Northern Australia☆10Sep 17, 2025Updated 5 months ago
- A simple python SDK around PubMed API.☆21Jan 1, 2025Updated last year
- ☆16Dec 7, 2025Updated 2 months ago
- ☆14Apr 29, 2025Updated 10 months ago
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆14Jun 26, 2025Updated 8 months ago