☆254Feb 25, 2020Updated 6 years ago
Alternatives and similar repositories for pg19
Users that are interested in pg19 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆99Jul 7, 2020Updated 5 years ago
- Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.☆29Feb 25, 2021Updated 5 years ago
- Neural Text Generation with Unlikelihood Training☆311Aug 31, 2021Updated 4 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,928Feb 14, 2023Updated 3 years ago
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆20May 30, 2024Updated last year
- Commonsense Explanations Dataset and Code☆147Jun 16, 2025Updated 9 months ago
- Long Range Arena for Benchmarking Efficient Transformers☆788Dec 16, 2023Updated 2 years ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated 2 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,373Mar 23, 2024Updated 2 years ago
- [NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling☆40Dec 2, 2023Updated 2 years ago
- ☆220Jun 8, 2020Updated 5 years ago
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- Randomized Positional Encodings Boost Length Generalization of Transformers☆82Mar 14, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Tools to download and cleanup Common Crawl data☆1,040Apr 25, 2023Updated 2 years ago
- Transformer training code for sequential tasks☆609Sep 14, 2021Updated 4 years ago
- Crawl BookCorpus☆854Jul 14, 2023Updated 2 years ago
- PyTorch implementation for PaLM: A Hybrid Parser and Language Model.☆10Jan 7, 2020Updated 6 years ago
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,125Apr 20, 2022Updated 3 years ago
- JAX-based neural network library☆3,210Mar 31, 2026Updated last week
- ☆22Aug 31, 2021Updated 4 years ago
- The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".☆69Jan 12, 2024Updated 2 years ago
- Boolean Question Answering with multi-task learning and uses large LM embeddings like BERT, RoBERTa☆18Aug 30, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆13Aug 19, 2024Updated last year
- Expanding linear RNN state-transition matrix eigenvalues to include negatives improves state-tracking tasks and language modeling without…☆21Mar 15, 2025Updated last year
- Longformer: The Long-Document Transformer☆2,190Feb 8, 2023Updated 3 years ago
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆21Jul 18, 2023Updated 2 years ago
- ☆12Jan 29, 2021Updated 5 years ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,502Jan 14, 2026Updated 2 months ago
- Author implementation of the paper "CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge"☆168Jul 25, 2024Updated last year
- jiant is an nlp toolkit☆1,675Jul 6, 2023Updated 2 years ago
- Code for NAACL paper☆21Aug 31, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆260Jun 6, 2025Updated 10 months ago
- Reformer, the efficient Transformer, in Pytorch☆2,190Jun 21, 2023Updated 2 years ago
- Implementation of AAAI 21 paper: Nested Named Entity Recognition with Partially Observed TreeCRFs☆50May 11, 2021Updated 4 years ago
- ☆45Nov 3, 2019Updated 6 years ago
- New dataset☆311Aug 31, 2021Updated 4 years ago
- NeurIPS 2019 - Learning Data Manipulation for Augmentation and Weighting☆110Sep 5, 2020Updated 5 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year