PKU-DAIR / Hetu-GalvatronLinks

Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).

☆159

Alternatives and similar repositories for Hetu-Galvatron

Users that are interested in Hetu-Galvatron are comparing it to the libraries listed below

Sorting:

junzhang-zj / LoRAM
[ICLR 2025] Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models
☆75Updated 3 months ago
SkyworkAI / MoE-plus-plus
[ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts
☆233Updated 9 months ago
uanu2002 / JSQ
[ICML 2024] JSQ: Compressing Large Language Models by Joint Sparsification and Quantization
☆150Updated 8 months ago
mangopy / tool-retrieval-benchmark
Official code for ACL2025 "🔍 Retrieval Models Aren’t Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models"
☆187Updated 3 weeks ago
starriver030515 / SynthVLM
☆112Updated last month
liuxukun2000 / Adaptix
Adaptive Draft-Verification for Efficient Large Language Model Decoding (AAAI 2025 Oral)
☆67Updated 3 months ago
mangopy / AutoTools
Official Repo for WWW 2025 paper "Tool Learning in the Wild: Empowering Language Models as Automatic Tool Agents"
☆201Updated 3 months ago
starriver030515 / FUSION
☆184Updated 3 months ago
OceannTwT / Tool-Planner
[ICLR 2025] Tool-Planner: Task Planning with Clusters across Multiple Tools
☆114Updated 2 months ago
MJinXiang / Reasoning-Table
Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning
☆82Updated last month
tsinghua-fib-lab / ANeurIPS2024_SPV-MIA
[NeurIPS'24] "Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt Calibration"
☆189Updated 4 months ago
shizhl / CoAgents
Official code for paper "Learning to Use Tools via Cooperative and Interactive Agents"
☆137Updated last year
wjmZZZ / LLM-zero2hero
☆203Updated 7 months ago
1989chenguo / CloudComputingLabs
☆342Updated 2 years ago
HAL-42 / AlchemyCat
Alchemy Cat —— 🔥Config System for SOTA
☆115Updated this week
CHB-learner / NeoBert
NeoBERT is an advanced model designed specifically for predicting the binding affinity between neoantigens and HLA. It is a variant of th…
☆154Updated 6 months ago
SkyworkAI / MoH
MoH: Multi-Head Attention as Mixture-of-Head Attention
☆262Updated 8 months ago
LAMDASZ-ML / Aries
ARIES (ArXiv Research Intelligent Efficient Summary)
☆68Updated 6 months ago
Decade-qiu / Gridea-Theme-Eternal
a Simple and elegant theme for gridea (inspire from hexo-theme-matery).
☆66Updated 4 months ago
PhoenixZ810 / OmniAlign-V
Official Repository of paper OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference
☆145Updated 4 months ago
HeZephyr / MIT6.5840
code and document for MIT6.5840(6.824) 2024 Spring
☆68Updated 3 months ago
mangopy / Confucius-tool-learning
Official Repo for AAAI 2024 paper "Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum"
☆78Updated 4 months ago
QuanjianSong / LightMotion
Official Pytorch Code of the Paper "LightMotion: A Light and Tuning-free Method for Simulating Camera Motion in Video Generation"
☆38Updated last month
Albert-Weasker / ai_developer
ai_developer is an AI-driven software engineer that turns a single-line requirement into a fully functional project.
☆67Updated 4 months ago
HeZephyr / application-logic
Logic for application
☆40Updated last year
Decade-qiu / Typora-Theme-Eternal
a theme for typora
☆52Updated 4 months ago
Decade-qiu / Online-Live-Teaching-Platform
A concise and complete online teaching platform featuring live streaming, interactive tools, and course management, built using Flask, Vu…
☆52Updated 4 months ago
dingdinglz / openai
golang的支持调用所有openai范式的ai的api的库
☆131Updated last week
AIDC-AI / Parrot
🎉 The code repository for "Parrot: Multilingual Visual Instruction Tuning" in PyTorch.
☆88Updated last month
riverback / V2C-CBM
V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer (AAAI 2025)
☆52Updated last week