Ongoing research training transformer language models at scale, including: BERT
☆16Apr 25, 2019Updated 6 years ago
Alternatives and similar repositories for Megatron-LM
Users that are interested in Megatron-LM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Question Answering with Interactive Text (QAit), code for EMNLP 2019 paper "Interactive Language Learning by Question Answering"☆44Sep 3, 2019Updated 6 years ago
- ELECTRA MODEL NLP☆13Apr 8, 2020Updated 5 years ago
- This is a solution accelerator for creating personalized content recommendations based on user activity.☆13Mar 26, 2024Updated 2 years ago
- This code accompanies the paper "Bayesian Framework for Information-Theoretic Probing" published in EMNLP 2021.☆10Aug 23, 2021Updated 4 years ago
- ☆13Jul 10, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Two-day level 300 Azure Synapse Analytics workshop☆11Mar 16, 2021Updated 5 years ago
- Code for Casual Indoor HDR Radiance Capture from Omnidirectional Images. BMVC 22☆13Dec 16, 2022Updated 3 years ago
- An API wrapper for @convertio (convertio.co) written in Python.☆12Jun 30, 2025Updated 8 months ago
- ☆11Aug 10, 2021Updated 4 years ago
- Authors' implementation of EMNLP-IJCNLP 2019 paper "Answering Complex Open-domain Questions Through Iterative Query Generation"☆195Oct 29, 2019Updated 6 years ago
- Patch-based inpainting Python library☆12Mar 5, 2023Updated 3 years ago
- Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"☆65Aug 31, 2021Updated 4 years ago
- ☆15Oct 10, 2021Updated 4 years ago
- a small demo repo to show how I got neuralbeagle14-7b running locally on my 8GB GPU☆14Jan 29, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Pointer Networks Implementation in Keras☆11Aug 17, 2017Updated 8 years ago
- NMT with ssp☆11Oct 28, 2021Updated 4 years ago
- A Large-Scale Dataset for Paraphrased Reading Comprehension☆15Jul 16, 2023Updated 2 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆22Jun 13, 2023Updated 2 years ago
- Private Preview: Responsible AI Tooling in Azure Machine Learning☆18Mar 28, 2022Updated 3 years ago
- Implementation of "Adversarial Text Generation without Reinforcement Learning"☆12Mar 13, 2019Updated 7 years ago
- The Shape of Data: Intrinsic Distance for Comparing Data Distributions☆12Sep 25, 2019Updated 6 years ago
- Code to compute AnthroScore, a computational linguistic measure of anthropomorphism in text☆18Mar 31, 2025Updated 11 months ago
- Feel the Vibes☆13Feb 26, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- PyTorch implementation of context2vec from Melamud et al., CoNLL 2016☆19Sep 25, 2018Updated 7 years ago
- open book question answering☆15Dec 8, 2022Updated 3 years ago
- A project to translate the Voynich Manuscript into English☆11Jun 30, 2023Updated 2 years ago
- Word sense disambiguation test sets for NMT☆20Dec 3, 2020Updated 5 years ago
- ☆12Mar 13, 2025Updated last year
- Implementation of Relation Network and Recurrent Relational Network using PyTorch v1.3. Original papers: (RN) https://arxiv.org/abs/1706.…☆19Feb 4, 2022Updated 4 years ago
- A Multi-Type Multi-Span Network for Reading Comprehension that Requires Discrete Reasoning☆89Nov 19, 2019Updated 6 years ago
- ☆14Feb 2, 2025Updated last year
- ☆10Nov 11, 2016Updated 9 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Here I show how to use Deep Learning for biological and biomedical Data Integration.☆11Sep 17, 2020Updated 5 years ago
- Supporting code for the paper "Portuguese Language Models and Word Embeddings: Evaluating on Semantic Similarity Tasks".☆11Dec 8, 2022Updated 3 years ago
- This is a comprehensive guide on how you can automate your feature engineering process.☆11Jun 25, 2018Updated 7 years ago
- Exploring the relationships in the historical data of weather, wind generated electricity and electricity demand. Base on the analysis, u…☆13Oct 12, 2021Updated 4 years ago
- Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.☆18Mar 23, 2023Updated 3 years ago
- ☆23May 5, 2022Updated 3 years ago
- A Chainlit App Used to Showcase: Async, Caching, Additional Chainlit Methods, and more!☆11Oct 1, 2024Updated last year