Ongoing research training transformer models at scale
☆18Jul 27, 2023Updated 2 years ago
Alternatives and similar repositories for Megatron-LM
Users that are interested in Megatron-LM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evaluate Transformers from the Hub 🔥☆14Apr 3, 2026Updated 2 weeks ago
- Scripts to parse arxiv documents for NLP tasks☆19Jun 12, 2023Updated 2 years ago
- ✨ Web interface for NeuralCoref coreference resolution☆35May 15, 2023Updated 2 years ago
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆28Feb 17, 2026Updated 2 months ago
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17May 1, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Simple Python package for converting between CustomVision <-> Pascal VOC <-> YOLO annotations☆19Nov 28, 2022Updated 3 years ago
- ☆19Mar 24, 2025Updated last year
- 深度学习可解释性论文汇总☆15Mar 3, 2021Updated 5 years ago
- ☆12Sep 1, 2023Updated 2 years ago
- State-of-the-art pretrained vision model from Bing Multimedia☆19Oct 2, 2023Updated 2 years ago
- ☆15Oct 24, 2023Updated 2 years ago
- ☆13Oct 1, 2024Updated last year
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆13Jul 15, 2024Updated last year
- ☆11Jun 2, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 一个基于原生浏览器书签的知识库:用 GitHub Gist 跨浏览器同步书签,并用 AI 为书签生成摘要、标签和封面,提供一个简洁的 Web 端浏览体验。☆32Jan 5, 2026Updated 3 months ago
- Exploring by Minimizing Uncertainty of Q values (EMU-Q) as presented in "Bayesian RL for Goal-Only Rewards" at CoRL'18.☆10Nov 8, 2018Updated 7 years ago
- 1st place solution of 🦾😢 in https://www.kaggle.com/c/ai-medical-contest-2021/☆10Apr 2, 2021Updated 5 years ago
- ☆32Jul 5, 2021Updated 4 years ago
- A simple exam generator and grader written in Python with OpenCV☆14Jan 14, 2026Updated 3 months ago
- ☆10Oct 20, 2023Updated 2 years ago
- Few training heuristics and small architectural changes that can significantly improve YOLOv3 performance with tiny increase in inference…☆12May 10, 2020Updated 5 years ago
- Materials for workshops on the Hugging Face ecosystem☆153Apr 2, 2026Updated 2 weeks ago
- CTC decoder with hotwords for ASR.☆35Apr 13, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- NanoGPT (124M) in 5 minutes☆15Feb 14, 2025Updated last year
- ☆13Feb 5, 2025Updated last year
- 多Agent驱动的实时广播电台 实验性项目☆33Feb 8, 2026Updated 2 months ago
- PyTorch - Albert Large V2, Bert Base Uncased, Bert Large Uncased WWM Finetuned Squad, Distil Roberta Base, Roberta Base Squad2, Roberta l…☆11Jul 10, 2020Updated 5 years ago
- ☆15Apr 15, 2022Updated 4 years ago
- A lightweight frontend for ffmpeg intended specifically for convenient video clipping☆42Sep 23, 2025Updated 6 months ago
- KeyTerms centralized terminology management tool☆13Feb 7, 2019Updated 7 years ago
- Bert TensorRT模型加速部署☆10Apr 1, 2022Updated 4 years ago
- ☆13Aug 12, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Collection of different types of transformers for learning purposes☆12Jan 30, 2020Updated 6 years ago
- Using Vrep to simulate a six-legged robot to do motion planning & path planning☆10Jan 10, 2019Updated 7 years ago
- ☆28Mar 30, 2026Updated 2 weeks ago
- ☆22Dec 12, 2025Updated 4 months ago
- Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge☆10Aug 8, 2023Updated 2 years ago
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- 新网银行杯Top1方案☆23Dec 16, 2018Updated 7 years ago