ngoyal2707 / Megatron-LMLinks
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆18Updated 2 years ago
Alternatives and similar repositories for Megatron-LM
Users that are interested in Megatron-LM are comparing it to the libraries listed below
Sorting:
- ☆11Updated 2 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"☆13Updated 3 years ago
- Compute-optimal LLMs☆10Updated 2 years ago
- Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)☆10Updated last month
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."☆46Updated 10 months ago
- Ranking of fine-tuned HF models as base models.☆35Updated 3 weeks ago
- Code for "CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering for Ensemble Models for Propaganda Detection" (V. Blasch…☆9Updated 4 years ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Updated 3 years ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated last year
- GPT-jax based on the official huggingface library☆13Updated 3 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Using short models to classify long texts☆21Updated 2 years ago
- Open source library for few shot NLP☆78Updated last year
- ☆14Updated 8 months ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- PyTorch implementation of GLOM☆22Updated 3 years ago
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆30Updated 2 years ago
- ☆32Updated 2 years ago
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 4 years ago
- ☆78Updated last year
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated last month
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Updated 2 years ago
- ☆22Updated 3 years ago
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`☆18Updated 3 years ago
- Code for Stage-wise Fine-tuning for Graph-to-Text Generation☆26Updated 2 years ago
- Transformers at any scale☆41Updated last year
- Submission to the inverse scaling prize☆23Updated last year
- ☆28Updated 2 years ago