Ongoing research training transformer language models at scale, including: BERT
☆16Apr 25, 2019Updated 6 years ago
Alternatives and similar repositories for Megatron-LM
Users that are interested in Megatron-LM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Question Answering with Interactive Text (QAit), code for EMNLP 2019 paper "Interactive Language Learning by Question Answering"☆44Sep 3, 2019Updated 6 years ago
- Code for Casual Indoor HDR Radiance Capture from Omnidirectional Images. BMVC 22☆13Dec 16, 2022Updated 3 years ago
- ☆14Nov 28, 2022Updated 3 years ago
- Dataset and pre-trained model of EMNLP-IJCNLP 2019 paper "TalkDown: A Corpus for Condescension Detection in Context."☆10Jan 26, 2020Updated 6 years ago
- ☆14Jul 10, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Oct 10, 2021Updated 4 years ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆139Apr 8, 2026Updated last week
- An API wrapper for @convertio (convertio.co) written in Python.☆12Jun 30, 2025Updated 9 months ago
- ☆11Aug 10, 2021Updated 4 years ago
- ☆14Jun 18, 2023Updated 2 years ago
- Authors' implementation of EMNLP-IJCNLP 2019 paper "Answering Complex Open-domain Questions Through Iterative Query Generation"☆195Oct 29, 2019Updated 6 years ago
- Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"☆65Aug 31, 2021Updated 4 years ago
- ☆15Oct 10, 2021Updated 4 years ago
- A Large-Scale Dataset for Paraphrased Reading Comprehension☆15Jul 16, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- SC-Adagrad, SC-RMSProp and RMSProp algorithms for training deep networks proposed in☆14Oct 5, 2018Updated 7 years ago
- A toolkit for neural language modeling using Tensorflow including basic models like RNNs and LSTMs as well as more advanced models.☆21Jan 31, 2019Updated 7 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆22Jun 13, 2023Updated 2 years ago
- A project to collect all tamil nouns☆12Dec 14, 2024Updated last year
- A collection of handy tools such as adding Key & BPM to your music library☆16Mar 8, 2023Updated 3 years ago
- ☆18Jan 3, 2025Updated last year
- Code to compute AnthroScore, a computational linguistic measure of anthropomorphism in text☆18Mar 31, 2025Updated last year
- Feel the Vibes☆13Feb 26, 2025Updated last year
- PyTorch implementation of context2vec from Melamud et al., CoNLL 2016☆19Sep 25, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Word sense disambiguation test sets for NMT☆20Dec 3, 2020Updated 5 years ago
- ICLR 2019 paper: "textTOvec: DEEP CONTEXTUALIZED NEURAL AUTOREGRESSIVE TOPIC MODELS OF LANGUAGE WITH DISTRIBUTED COMPOSITIONAL PRIOR"☆25Dec 30, 2018Updated 7 years ago
- Experiments for recognising textual entailment☆14Oct 12, 2012Updated 13 years ago
- ☆14Jul 8, 2024Updated last year
- ☆12Mar 13, 2025Updated last year
- A Multi-Type Multi-Span Network for Reading Comprehension that Requires Discrete Reasoning☆89Nov 19, 2019Updated 6 years ago
- ☆14Feb 2, 2025Updated last year
- ☆11Nov 11, 2016Updated 9 years ago
- Here I show how to use Deep Learning for biological and biomedical Data Integration.☆11Sep 17, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆17Jun 20, 2023Updated 2 years ago
- Exploring the relationships in the historical data of weather, wind generated electricity and electricity demand. Base on the analysis, u…☆13Oct 12, 2021Updated 4 years ago
- The notebooks used to demonstrate the blog post about Interpretability in ML☆12Dec 7, 2019Updated 6 years ago
- PyTorch Implementation of "Learning Natural Language Inference with LSTM", 2016, S. Wang et al. (https://arxiv.org/pdf/1512.08849.pdf)☆19Dec 23, 2022Updated 3 years ago
- PyTorch implementation of HashedNets☆38Apr 21, 2023Updated 2 years ago
- ☆18Apr 26, 2025Updated 11 months ago
- Code for EMNLP 2019 Paper "Do NLP Models Know Numbers? Probing Numeracy in Embeddings.☆21Dec 15, 2019Updated 6 years ago