ngoyal2707 / Megatron-LMLinks
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆18Updated 2 years ago
Alternatives and similar repositories for Megatron-LM
Users that are interested in Megatron-LM are comparing it to the libraries listed below
Sorting:
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Updated 2 years ago
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."☆46Updated 11 months ago
- ☆11Updated 3 years ago
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated 3 months ago
- MozoLM: A language model (LM) serving library☆45Updated this week
- Cortex-compatible model server for Python and TensorFlow☆17Updated 2 years ago
- Experiments with Hugging Face 🔬 🤗☆44Updated 10 months ago
- ☆32Updated 2 years ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆66Updated 2 years ago
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆27Updated 2 years ago
- Using short models to classify long texts☆21Updated 2 years ago
- Detecting gibberish as a type of sentiment analysis with GPT2☆24Updated 4 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Helper scripts and notes that were used while porting various nlp models☆46Updated 3 years ago
- Efficiently computing & storing token n-grams from large corpora☆24Updated 9 months ago
- Hugging Face and Pyserini interoperability☆20Updated 2 years ago
- Execute arbitrary SQL queries on 🤗 Datasets☆32Updated last year
- ☆90Updated 3 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- GPT-jax based on the official huggingface library☆13Updated 4 years ago
- One stop shop for all things carp☆59Updated 2 years ago
- Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)☆10Updated 2 months ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- Implementation of N-Grammer in Flax☆17Updated 2 years ago
- Developing tools to automatically analyze datasets☆74Updated 8 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated 2 weeks ago
- Common crawl pretrained sentencepiece tokenizers for English and Japanese for various vocabulary sizes. Also development environment for …☆10Updated 3 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- ☆22Updated 5 months ago