☆255Feb 25, 2020Updated 6 years ago
Alternatives and similar repositories for pg19
Users that are interested in pg19 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆99Jul 7, 2020Updated 5 years ago
- Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.☆29Feb 25, 2021Updated 5 years ago
- Neural Text Generation with Unlikelihood Training☆311Aug 31, 2021Updated 4 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,932Feb 14, 2023Updated 3 years ago
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆20May 30, 2024Updated last year
- Commonsense Explanations Dataset and Code☆147Jun 16, 2025Updated 10 months ago
- Long Range Arena for Benchmarking Efficient Transformers☆788Dec 16, 2023Updated 2 years ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated 2 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,372Mar 23, 2024Updated 2 years ago
- [NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling☆40Dec 2, 2023Updated 2 years ago
- ☆220Jun 8, 2020Updated 5 years ago
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- Randomized Positional Encodings Boost Length Generalization of Transformers☆82Mar 14, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Tools to download and cleanup Common Crawl data☆1,042Apr 25, 2023Updated 3 years ago
- Transformer training code for sequential tasks☆610Sep 14, 2021Updated 4 years ago
- Crawl BookCorpus☆855Jul 14, 2023Updated 2 years ago
- PyTorch implementation for PaLM: A Hybrid Parser and Language Model.☆10Jan 7, 2020Updated 6 years ago
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,128Apr 20, 2022Updated 4 years ago
- JAX-based neural network library☆3,226Updated this week
- ☆22Aug 31, 2021Updated 4 years ago
- The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".☆69Jan 12, 2024Updated 2 years ago
- Boolean Question Answering with multi-task learning and uses large LM embeddings like BERT, RoBERTa☆18Aug 30, 2019Updated 6 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆13Aug 19, 2024Updated last year
- Expanding linear RNN state-transition matrix eigenvalues to include negatives improves state-tracking tasks and language modeling without…☆21Mar 15, 2025Updated last year
- Longformer: The Long-Document Transformer☆2,194Feb 8, 2023Updated 3 years ago
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆21Jul 18, 2023Updated 2 years ago
- ☆12Jan 29, 2021Updated 5 years ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,508Jan 14, 2026Updated 3 months ago
- Author implementation of the paper "CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge"☆168Jul 25, 2024Updated last year
- jiant is an nlp toolkit☆1,676Jul 6, 2023Updated 2 years ago
- Code for NAACL paper☆21Aug 31, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆260Jun 6, 2025Updated 10 months ago
- Implementation of AAAI 21 paper: Nested Named Entity Recognition with Partially Observed TreeCRFs☆50May 11, 2021Updated 4 years ago
- ☆45Nov 3, 2019Updated 6 years ago
- New dataset☆311Aug 31, 2021Updated 4 years ago
- NeurIPS 2019 - Learning Data Manipulation for Augmentation and Weighting☆110Sep 5, 2020Updated 5 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- ☆12Jun 5, 2024Updated last year