☆259Feb 25, 2020Updated 6 years ago
Alternatives and similar repositories for pg19
Users that are interested in pg19 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆99Jul 7, 2020Updated 5 years ago
- Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.☆30Feb 25, 2021Updated 5 years ago
- Neural Text Generation with Unlikelihood Training☆311Aug 31, 2021Updated 4 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,929Feb 14, 2023Updated 3 years ago
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Commonsense Explanations Dataset and Code☆147Jun 16, 2025Updated last year
- Long Range Arena for Benchmarking Efficient Transformers☆788Dec 16, 2023Updated 2 years ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated 2 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,367Mar 23, 2024Updated 2 years ago
- [NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling☆40Dec 2, 2023Updated 2 years ago
- ☆220Jun 8, 2020Updated 6 years ago
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- Randomized Positional Encodings Boost Length Generalization of Transformers☆83Mar 14, 2024Updated 2 years ago
- Tools to download and cleanup Common Crawl data☆1,045Apr 25, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Transformer training code for sequential tasks☆610Sep 14, 2021Updated 4 years ago
- Crawl BookCorpus☆862Jul 14, 2023Updated 2 years ago
- PyTorch implementation for PaLM: A Hybrid Parser and Language Model.☆10Jan 7, 2020Updated 6 years ago
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,131Apr 20, 2022Updated 4 years ago
- JAX-based neural network library☆3,242Jun 2, 2026Updated 3 weeks ago
- ☆22Aug 31, 2021Updated 4 years ago
- The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".☆69Jan 12, 2024Updated 2 years ago
- Boolean Question Answering with multi-task learning and uses large LM embeddings like BERT, RoBERTa☆18Aug 30, 2019Updated 6 years ago
- ☆13Aug 19, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Expanding linear RNN state-transition matrix eigenvalues to include negatives improves state-tracking tasks and language modeling without…☆22Mar 15, 2025Updated last year
- Longformer: The Long-Document Transformer☆2,196Feb 8, 2023Updated 3 years ago
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆21Jul 18, 2023Updated 2 years ago
- ☆12Jan 29, 2021Updated 5 years ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,528Jan 14, 2026Updated 5 months ago
- Author implementation of the paper "CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge"☆169Jul 25, 2024Updated last year
- jiant is an nlp toolkit☆1,676Jul 6, 2023Updated 2 years ago
- Code for NAACL paper☆21Aug 31, 2018Updated 7 years ago
- ☆260Jun 6, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Reformer, the efficient Transformer, in Pytorch☆2,193Jun 21, 2023Updated 3 years ago
- Implementation of AAAI 21 paper: Nested Named Entity Recognition with Partially Observed TreeCRFs☆50May 11, 2021Updated 5 years ago
- ☆45Nov 3, 2019Updated 6 years ago
- New dataset☆311Aug 31, 2021Updated 4 years ago
- NeurIPS 2019 - Learning Data Manipulation for Augmentation and Weighting☆110Sep 5, 2020Updated 5 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated 2 years ago
- ☆12Jun 5, 2024Updated 2 years ago