ngoyal2707 / Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆18Updated 2 years ago
Alternatives and similar repositories for Megatron-LM:
Users that are interested in Megatron-LM are comparing it to the libraries listed below
- ☆11Updated 2 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Updated last year
- Compute-optimal LLMs☆11Updated 2 years ago
- Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)☆10Updated 3 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`☆18Updated 2 years ago
- Using short models to classify long texts☆21Updated 2 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- ☆23Updated 2 months ago
- Embedding Recycling for Language models☆38Updated last year
- Cortex-compatible model server for Python and TensorFlow☆17Updated 2 years ago
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"☆13Updated 3 years ago
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- [ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators☆24Updated last year
- ☆32Updated last year
- FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction☆24Updated 2 years ago
- ☆14Updated 6 months ago
- Code for Stage-wise Fine-tuning for Graph-to-Text Generation☆26Updated 2 years ago
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated last week
- A file utility for accessing both local and remote files through a unified interface.☆40Updated last month
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 2 years ago
- Developing tools to automatically analyze datasets☆74Updated 5 months ago
- ☆15Updated last week
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆15Updated 3 years ago
- ☆16Updated 11 months ago
- GPT-jax based on the official huggingface library☆13Updated 3 years ago
- Helper scripts and notes that were used while porting various nlp models☆46Updated 3 years ago
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆81Updated last year
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆33Updated 2 years ago