Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆18Feb 17, 2023Updated 3 years ago
Alternatives and similar repositories for Megatron-LM
Users that are interested in Megatron-LM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Topic supervised non-negative matrix factorization with sparse matrices☆12Mar 24, 2020Updated 6 years ago
- Multiple correspondence analysis☆10Apr 2, 2015Updated 10 years ago
- Code for the anonymous submission "Cockpit: A Practical Debugging Tool for Training Deep Neural Networks"☆31Nov 24, 2020Updated 5 years ago
- Code for paper: Weakly- and Semi-supervised Evidence Extraction☆15Apr 12, 2021Updated 4 years ago
- Paper: "Predicting Subjective Features from Questions on QA Websites using BERT"☆14May 22, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆30May 30, 2023Updated 2 years ago
- Repository for the implementation and evaluation of DD-GloVe, a train-time debiasing algorithm to learn GloVe word embeddings by leveragi…☆13May 29, 2022Updated 3 years ago
- A set of pre-trained machine-learning models that predict (im-)politeness scores in texts.☆19Jan 2, 2025Updated last year
- Testing the performance of CNN and BERT embeddings on GLUE tasks☆15Mar 24, 2023Updated 3 years ago
- 💡Light Bulb is a tool to help you label, train, test and deploy machine learning models without any coding.☆25Feb 15, 2023Updated 3 years ago
- OffensEval2020 Shared Task☆17Apr 5, 2021Updated 4 years ago
- [Deprecated] An unofficial API for Quora.☆17Jan 17, 2017Updated 9 years ago
- Android Videokit - basic FFMPEG build for Android with x264 and libtheora support.☆22Jun 23, 2012Updated 13 years ago
- ☆20Aug 30, 2022Updated 3 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆10Oct 15, 2014Updated 11 years ago
- ☆19Dec 9, 2024Updated last year
- Product Quantization k-Nearest Neighbors☆21Jun 24, 2021Updated 4 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- ☆41Oct 3, 2024Updated last year
- An Interactive Tool for Natural Language Processing on Clinical Text☆23Aug 20, 2021Updated 4 years ago
- ☆14May 15, 2025Updated 10 months ago
- Interpretation of Isolation Forests☆21Jun 17, 2024Updated last year
- Code for "On Long-Tailed Phenomena in NMT".☆10Jan 10, 2021Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Conway's Game of Life using experimental Scala.js WebAssembly backend☆15Apr 24, 2025Updated 11 months ago
- Lawma: A lightly fine-tuned Llama model for legal classification tasks.☆28Sep 14, 2024Updated last year
- 🧬 an evolving design philosophy (masquerading as a color scheme)☆11Dec 8, 2025Updated 3 months ago
- A multi-frame-inpainting script for stable diffusion webui☆11Apr 7, 2023Updated 2 years ago
- Official implementation of AnimateDiff.☆10Sep 27, 2023Updated 2 years ago
- Deep Learning for Coders with fastai and PyTorch: AI Applications Without a PhD - the book and the course☆16Jun 7, 2022Updated 3 years ago
- ☆54Jan 29, 2018Updated 8 years ago
- ☆86Dec 26, 2022Updated 3 years ago
- auto image cropping/composition methods☆16Oct 23, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- A network service that allows credit card payment for Sia storage.☆12May 11, 2025Updated 10 months ago
- Data and some code for the DopeLearning paper☆28Jun 4, 2016Updated 9 years ago
- A monolithic index that supports worst-case optimal joins (WCOJ) by providing all collation orders in a single redundancy eliminating dat…☆16Sep 18, 2025Updated 6 months ago
- MXNet implementation of AC-BLSTM☆23Apr 3, 2019Updated 6 years ago
- pialign - A Phrasal ITG Aligner☆24Apr 29, 2019Updated 6 years ago
- Prose Markup Language☆10Mar 31, 2023Updated 2 years ago