PKU-DAIR / Hetu-Galvatron
Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).
☆136Updated this week
Alternatives and similar repositories for Hetu-Galvatron:
Users that are interested in Hetu-Galvatron are comparing it to the libraries listed below
- [ICLR 2025] Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models☆66Updated this week
- Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs). If you hav…☆20Updated last week
- Official code for "🔍 Retrieval Models Aren’t Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models"☆140Updated last week
- [ICML 2024] JSQ: Compressing Large Language Models by Joint Sparsification and Quantization☆148Updated 5 months ago
- ☆71Updated this week
- Adaptive Draft-Verification for Efficient Large Language Model Decoding☆64Updated 3 months ago
- [ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts☆197Updated 5 months ago
- Official Repo for WWW 2025 paper "Tool Learning in the Wild: Empowering Language Models as Automatic Tool Agents"☆176Updated last week
- ARIES (ArXiv Research Intelligent Efficient Summary)☆65Updated 2 months ago
- Official Repo for AAAI 2024 paper "Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum"☆98Updated 3 weeks ago
- Logic for application☆41Updated 9 months ago
- golang的支持调用所有openai范式的ai的api的库☆129Updated last month
- This is a repository for the source code of Swift_Ielts platform☆53Updated last week
- a theme for typora☆57Updated 3 weeks ago
- GoMicroKit is a lightweight microservice framework for Go that balances simplicity with powerful features. Built for developers who need …☆1Updated 3 weeks ago
- 🤖 Discord AI assistant with OpenAI, Gemini, Claude & DeepSeek integration, multilingual support, multimodal chat, image generation, web …☆102Updated 3 weeks ago
- A library that removes the __subclasses__() list from all classes, allowing for nearly absolute security in exec and eval functions. 一个清除…☆44Updated last week
- ai_developer is an AI-driven software engineer that turns a single-line requirement into a fully functional project.☆67Updated last month
- Turning every screen moment into a memory at your fingertips.☆149Updated 2 months ago
- A library that automatically captures the complete stack frames including local and global variables when any exceptions occur. 一个在发生异常时,…☆49Updated this week
- A concise and complete online teaching platform featuring live streaming, interactive tools, and course management, built using Flask, Vu…☆58Updated 3 weeks ago
- 一款基于 Typecho 默认主题 Replica 开发的博客主题,旨在简约现代的基础上提升阅读体验。☆90Updated last month
- ☆25Updated last week
- A full-stack online coding platform powered by Spring Cloud microservices and a Vue 3 frontend, offering scalable, secure, and interactiv…☆60Updated 3 weeks ago
- ant-design-x-pro☆57Updated last month
- Omega-AI:基于java打造的深度学习框架,帮助你快速搭建神经网络,实现模型推理与训练,引擎支持自动求导,多线程与GPU运算,GPU支持CUDA,CUDNN。☆22Updated last month
- rModel is a framework for building LLM applications with agentic workflow☆65Updated last month
- ☆42Updated 3 weeks ago
- ☆52Updated last month
- FastSFile 是一个基于 Python 的命令行文件传输工具,方便在手机或其他设备上轻松下载文件。☆58Updated last month