Ongoing research training transformer models at scale
☆19Jul 27, 2023Updated 2 years ago
Alternatives and similar repositories for Megatron-LM
Users that are interested in Megatron-LM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evaluate Transformers from the Hub 🔥☆14May 26, 2026Updated 3 weeks ago
- A template repo used by Autonomous Systems and Robotics Group for ML projects☆16Mar 8, 2023Updated 3 years ago
- Simple Python package for converting between CustomVision <-> Pascal VOC <-> YOLO annotations☆19Nov 28, 2022Updated 3 years ago
- ☆19Mar 24, 2025Updated last year
- ☆12Sep 1, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 首届中国心电智能大赛决赛阶段解决方案-公开版 比赛网址 http://mdi.ids.tsinghua.edu.cn/☆10Aug 21, 2019Updated 6 years ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 5 months ago
- State-of-the-art pretrained vision model from Bing Multimedia☆19Oct 2, 2023Updated 2 years ago
- ☆15Oct 24, 2023Updated 2 years ago
- ☆13Oct 1, 2024Updated last year
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆13Jul 15, 2024Updated last year
- Repository to go along with the paper "Plumber: Diagnosing and Removing Performance Bottlenecks in Machine Learning Data Pipelines"☆10Mar 31, 2022Updated 4 years ago
- ☆10Jan 15, 2023Updated 3 years ago
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- FPGA-based HyperLogLog Accelerator☆12Jul 13, 2020Updated 5 years ago
- Quicksilver superpage management system☆10May 14, 2021Updated 5 years ago
- Exploring by Minimizing Uncertainty of Q values (EMU-Q) as presented in "Bayesian RL for Goal-Only Rewards" at CoRL'18.☆10Nov 8, 2018Updated 7 years ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆16Aug 3, 2021Updated 4 years ago
- Uart module written in chisel☆13Feb 19, 2016Updated 10 years ago
- Microbenchmark for zsmalloc allocation mapping☆11Dec 14, 2015Updated 10 years ago
- Latr: Lazy Translation Coherence - ASPLOS'18☆15Nov 15, 2021Updated 4 years ago
- Towards Hardware and Software Continuous Integration☆13Jun 8, 2020Updated 6 years ago
- (elastic) cuckoo hashing☆17Jun 20, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repo is obsolete. Please see https://github.com/tomtom?tab=repositories&q=_vim for vim plugins.☆55Dec 11, 2015Updated 10 years ago
- ☆10Sep 15, 2023Updated 2 years ago
- A simple exam generator and grader written in Python with OpenCV☆14Jan 14, 2026Updated 5 months ago
- Few training heuristics and small architectural changes that can significantly improve YOLOv3 performance with tiny increase in inference…☆13May 10, 2020Updated 6 years ago
- Code for the ICRA2018 paper "Trajectory-Optimized Sensing for Active Search of Tissue Abnormalities in Robotic Surgery"☆11May 22, 2018Updated 8 years ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆27May 31, 2025Updated last year
- Building a multi-agent RAG system with advanced RAG methods☆13Jan 12, 2025Updated last year
- CTC decoder with hotwords for ASR.☆36Apr 13, 2025Updated last year
- Agent驱动的实时广播电台 实验性项目☆37Feb 8, 2026Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- HLS implementation of cuckoo hashing. Refer to paper : https://ieeexplore.ieee.org/document/7577355/☆14Dec 4, 2018Updated 7 years ago
- Cluster simulator with far memory☆12Apr 28, 2020Updated 6 years ago
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆12Apr 17, 2023Updated 3 years ago
- PyTorch - Albert Large V2, Bert Base Uncased, Bert Large Uncased WWM Finetuned Squad, Distil Roberta Base, Roberta Base Squad2, Roberta l…☆11Jul 10, 2020Updated 5 years ago
- LLM-based character segmentation agent for ComfyUI based on SAM 3 and the SAM 3 Agent notebook☆27Dec 22, 2025Updated 5 months ago
- ☆15Apr 15, 2022Updated 4 years ago
- KeyTerms centralized terminology management tool☆13Feb 7, 2019Updated 7 years ago