Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers
☆49Mar 20, 2023Updated 3 years ago
Alternatives and similar repositories for bigbird
Users that are interested in bigbird are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Transformers for Longer Sequences☆633Sep 1, 2022Updated 3 years ago
- Minimal implementation of Denoised Smoothing (https://arxiv.org/abs/2003.01908) in TensorFlow.☆20Aug 4, 2021Updated 4 years ago
- GC4LM: A Colossal (Biased) language model for German☆13May 2, 2021Updated 5 years ago
- ☆13May 30, 2022Updated 3 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Dec 21, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆10Apr 12, 2021Updated 5 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Jul 28, 2022Updated 3 years ago
- Code for An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality (ICLR 2020)☆11Mar 24, 2023Updated 3 years ago
- ☆21Jun 20, 2019Updated 6 years ago
- A multi-functional library for full-stack Deep Learning. Simplifies Model Building, API development, and Model Deployment.☆234Apr 6, 2026Updated last month
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- Coding utilities for quantitative legal studies☆14Dec 7, 2025Updated 5 months ago
- ☆12Mar 4, 2025Updated last year
- [ICLR 2023] "Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!" Shiwei Liu, Tianlong Chen, Zhenyu Zhang, Xuxi Chen…☆28Aug 29, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- A simple guestbook example for Container VMs on GCE. Uses Redis and Python/Flask.☆39May 7, 2015Updated 11 years ago
- ☆15Nov 22, 2023Updated 2 years ago
- ☆27Dec 12, 2024Updated last year
- [ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…☆11Nov 28, 2023Updated 2 years ago
- A baseline system for ContractNLI (https://stanfordnlp.github.io/contract-nli/)☆37Mar 10, 2023Updated 3 years ago
- A parameter-efficient compression model architecture for a variety of NLP tasks at BERT level performance at a fraction of the computatio…☆10Jan 25, 2026Updated 3 months ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 4 months ago
- ☆13Mar 27, 2020Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆13Nov 27, 2025Updated 5 months ago
- ☆17Apr 30, 2025Updated last year
- Text summarization with python and transformer☆13Jun 17, 2023Updated 2 years ago
- ☆10Oct 15, 2019Updated 6 years ago
- Meta-learning learning rates with higher☆12Sep 27, 2019Updated 6 years ago
- GyroSPD: Vector-valued Distance and Gyrocalculus on the Space of Symmetric Positive Definite Matrices☆17Nov 8, 2021Updated 4 years ago
- texrex web page cleaning & ClaraX random walk crawler☆11Dec 13, 2021Updated 4 years ago
- Source code for Text Infilling, implemented with Texar.☆27Feb 18, 2019Updated 7 years ago
- Hands-on Python 3.x GUI Programming, Published by Packt☆13Jan 18, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CNN ensemble for prostate cancer Gleason grading☆19Jan 28, 2026Updated 3 months ago
- Co:here-powered Slack App Starter Project☆13Apr 1, 2022Updated 4 years ago
- Survey on Machine Reading Comprehension☆147Jan 26, 2021Updated 5 years ago
- This is the implementation of the visual model mentioned in our paper 'Automated Radiology Report Generation using Conditioned Transforme…☆10Jul 25, 2024Updated last year
- code for "Natural Language to Code Translation with Execution"☆41Nov 2, 2022Updated 3 years ago
- Topic clustering library built on Transformer embeddings and cosine similarity metrics.Compatible with all BERT base transformers from hu…☆44Jun 11, 2021Updated 4 years ago
- Train small sequence models in your browser with WebGPU.☆34Dec 3, 2025Updated 5 months ago