Ongoing research training transformer language models at scale, including: BERT
β16Apr 25, 2019Updated 7 years ago
Alternatives and similar repositories for Megatron-LM
Users that are interested in Megatron-LM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- πΈπΉπ₯ Band-in-a-Box/RealBand automation scriptsβ15May 23, 2022Updated 4 years ago
- A common protocol for AI agent toolsβ10Oct 21, 2024Updated last year
- NEM(NIS1) simple library of Python3β17Dec 27, 2018Updated 7 years ago
- ELECTRA MODEL NLPβ13Apr 8, 2020Updated 6 years ago
- LLM Assistent with Chat Integrationβ14Sep 5, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A list of Japanese verbs and adjectives.β23Oct 1, 2025Updated 8 months ago
- Code for Casual Indoor HDR Radiance Capture from Omnidirectional Images. BMVC 22β13Dec 16, 2022Updated 3 years ago
- Dataset and pre-trained model of EMNLP-IJCNLP 2019 paper "TalkDown: A Corpus for Condescension Detection in Context."β10Jan 26, 2020Updated 6 years ago
- β14Jul 10, 2021Updated 4 years ago
- Automatic Korean Hanja tagging tool powered by Hanjaro (hanjaro.juntong.or.kr)β19Feb 22, 2019Updated 7 years ago
- β17Jun 27, 2024Updated last year
- Python port to the normalizer in https://github.com/twitter/twitter-korean-textβ12Apr 26, 2016Updated 10 years ago
- Starter Code for the Course 2 project of the Udacity ML DevOps Nanodegree Programβ22Jun 20, 2024Updated last year
- β12Oct 10, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code and data for SCED sentence cloze datasetβ12Dec 8, 2022Updated 3 years ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languagβ¦β143May 6, 2026Updated last month
- Skillset Challenge for the Apprenticeship Programβ22Jan 8, 2022Updated 4 years ago
- An API wrapper for @convertio (convertio.co) written in Python.β12Jun 30, 2025Updated 11 months ago
- β11Aug 10, 2021Updated 4 years ago
- β14Jun 18, 2023Updated 2 years ago
- Authors' implementation of EMNLP-IJCNLP 2019 paper "Answering Complex Open-domain Questions Through Iterative Query Generation"β195Oct 29, 2019Updated 6 years ago
- β10Jan 5, 2015Updated 11 years ago
- Pointer Networks Implementation in Kerasβ11Aug 17, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- NMT with sspβ11Oct 28, 2021Updated 4 years ago
- Conference talk: from zero to your first LLM applicationβ17Jul 1, 2024Updated last year
- A toolkit for neural language modeling using Tensorflow including basic models like RNNs and LSTMs as well as more advanced models.β21Jan 31, 2019Updated 7 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)β23Jun 13, 2023Updated 3 years ago
- Rust server that summarizes text with pre-trained modelsβ18Mar 8, 2023Updated 3 years ago
- Auto-Video maker handling many AI'sβ11Mar 18, 2024Updated 2 years ago
- upload a manim script and generate an animationβ11Mar 10, 2024Updated 2 years ago
- A project to collect all tamil nounsβ12Dec 14, 2024Updated last year
- Debiasing Methods in Natural Language Understanding Make Bias More Accessible:Β Code and Dataβ14Apr 24, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671β19May 6, 2021Updated 5 years ago
- Source Code for paper "Learning from Explanations with Neural Execution Tree", ICLR 2020β18Mar 29, 2021Updated 5 years ago
- A very basic C++ trading engine based on QuickFIX Engineβ26Jan 1, 2013Updated 13 years ago
- Implementation of "Adversarial Text Generation without Reinforcement Learning"β12Mar 13, 2019Updated 7 years ago
- The Shape of Data: Intrinsic Distance for Comparing Data Distributionsβ12Sep 25, 2019Updated 6 years ago
- β17Jan 3, 2025Updated last year
- β18Jun 1, 2021Updated 5 years ago