Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆18Feb 17, 2023Updated 3 years ago
Alternatives and similar repositories for Megatron-LM
Users that are interested in Megatron-LM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codebase for multilingual neural machine translation☆13Nov 24, 2022Updated 3 years ago
- ☆10Jul 15, 2024Updated last year
- Code to reproduce the paper "Do causal predictors generalize better to new domains?"☆17Feb 7, 2025Updated last year
- 5th Place Solution to 3rd YouTube-8M Video Understanding Challenge (Last Top GB Model)☆13Oct 23, 2019Updated 6 years ago
- Techniques & resources for training interpretable ML models, explaining ML models, and debugging ML models.☆21Feb 2, 2026Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- SAFE Drive: access SAFE Network using the file system of Windows, Mac OS and Linux☆14Dec 9, 2022Updated 3 years ago
- Code for paper: Weakly- and Semi-supervised Evidence Extraction☆15Apr 12, 2021Updated 5 years ago
- Paper: "Predicting Subjective Features from Questions on QA Websites using BERT"☆14May 22, 2022Updated 4 years ago
- ☆11Jan 19, 2026Updated 5 months ago
- Safe serialization of ML models☆18Apr 21, 2023Updated 3 years ago
- Unofficial download repository for MusicCaps☆47Apr 21, 2023Updated 3 years ago
- Example of writing a backtesting framework from scratch☆15Apr 8, 2021Updated 5 years ago
- MuCR is a benchmark designed to evaluate Multimodal Large Language Models' (MLLMs) ability to discern causal links across modalities☆20May 27, 2025Updated last year
- Using GPT-3 to detect hate speech that contains sexist and racist content☆24Nov 11, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆17Mar 1, 2025Updated last year
- Repository for the implementation and evaluation of DD-GloVe, a train-time debiasing algorithm to learn GloVe word embeddings by leveragi…☆13May 29, 2022Updated 4 years ago
- 💡Light Bulb is a tool to help you label, train, test and deploy machine learning models without any coding.☆25Feb 15, 2023Updated 3 years ago
- ☆22Dec 4, 2023Updated 2 years ago
- A Pytorch-based library to evaluate learning methods on small image classification datasets☆18Jun 22, 2022Updated 4 years ago
- OffensEval2020 Shared Task☆17Apr 5, 2021Updated 5 years ago
- Detecting bursty terms in computer science☆10Feb 2, 2022Updated 4 years ago
- 📊 Easy plug-and-chug discounted cash flow model framework that allows for advanced modeling and sensitivity tests.☆18Aug 28, 2020Updated 5 years ago
- [Deprecated] An unofficial API for Quora.☆17Jan 17, 2017Updated 9 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Android Videokit - basic FFMPEG build for Android with x264 and libtheora support.☆22Jun 23, 2012Updated 14 years ago
- End-to-end integration of HuggingFace's models for sequence labeling.☆11Oct 4, 2020Updated 5 years ago
- Collection of brief notes from 592 lectures (started in 2014)☆13Aug 9, 2023Updated 2 years ago
- ☆10Oct 15, 2014Updated 11 years ago
- ☆19Dec 9, 2024Updated last year
- tiktoken is a BPE tokeniser for use with OpenAI's models☆26Jul 16, 2023Updated 2 years ago
- ☆42Oct 3, 2024Updated last year
- ☆14May 15, 2025Updated last year
- Chinese Word Segmentation task based on BERT and implemented in Pytorch☆14Aug 14, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Interpretation of Isolation Forests☆22Jun 17, 2024Updated 2 years ago
- Code for "On Long-Tailed Phenomena in NMT".☆10Jan 10, 2021Updated 5 years ago
- Assorted tools and utility functions, mainly for doing NLP with Python☆23Sep 12, 2025Updated 9 months ago
- Text generation with entities as context☆30Jun 13, 2018Updated 8 years ago
- Lawma: A lightly fine-tuned Llama model for legal classification tasks.☆31Sep 14, 2024Updated last year
- 🧬 an evolving design philosophy (masquerading as a color scheme)☆11Dec 8, 2025Updated 6 months ago
- A multi-frame-inpainting script for stable diffusion webui☆11Apr 7, 2023Updated 3 years ago