Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers
☆49Mar 20, 2023Updated 3 years ago
Alternatives and similar repositories for bigbird
Users that are interested in bigbird are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Transformers for Longer Sequences☆633Sep 1, 2022Updated 3 years ago
- Minimal implementation of Denoised Smoothing (https://arxiv.org/abs/2003.01908) in TensorFlow.☆20Aug 4, 2021Updated 4 years ago
- GC4LM: A Colossal (Biased) language model for German☆13May 2, 2021Updated 4 years ago
- ☆13May 30, 2022Updated 3 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.☆15Sep 29, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Jul 28, 2022Updated 3 years ago
- ☆21Jun 20, 2019Updated 6 years ago
- A text truncation method, useful for instance in long text classification☆22Jun 22, 2022Updated 3 years ago
- A multi-functional library for full-stack Deep Learning. Simplifies Model Building, API development, and Model Deployment.☆233Apr 6, 2026Updated last week
- Deep Program Structure Modeling ThroughMulti-Relational Graph-based Learning☆10May 24, 2021Updated 4 years ago
- ☆24May 31, 2025Updated 10 months ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Apr 21, 2023Updated 2 years ago
- Replication of AST Neural Network from Zhang J. et. al (2019) and application to software vulnerability detection☆12Jan 13, 2020Updated 6 years ago
- Coding utilities for quantitative legal studies☆14Dec 7, 2025Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆20Apr 12, 2024Updated 2 years ago
- ☆12Mar 4, 2025Updated last year
- [ICLR 2023] "Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!" Shiwei Liu, Tianlong Chen, Zhenyu Zhang, Xuxi Chen…☆28Aug 29, 2023Updated 2 years ago
- Text Classification Dataset for Turkish Language☆10Nov 16, 2021Updated 4 years ago
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- A simple guestbook example for Container VMs on GCE. Uses Redis and Python/Flask.☆39May 7, 2015Updated 10 years ago
- ☆15Nov 22, 2023Updated 2 years ago
- Code for Learning Bregman Divergences☆13Oct 23, 2021Updated 4 years ago
- ☆27Dec 12, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…☆11Nov 28, 2023Updated 2 years ago
- A baseline system for ContractNLI (https://stanfordnlp.github.io/contract-nli/)☆37Mar 10, 2023Updated 3 years ago
- Code for reproducing results in Delayed Impact of Fair Machine Learning (Liu et al 2018)☆14Jul 23, 2022Updated 3 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆30Apr 21, 2021Updated 4 years ago
- gradio bbox labeling tools☆11May 12, 2023Updated 2 years ago
- A parameter-efficient compression model architecture for a variety of NLP tasks at BERT level performance at a fraction of the computatio…☆10Jan 25, 2026Updated 2 months ago
- ☆13Mar 27, 2020Updated 6 years ago
- Text summarization with python and transformer☆13Jun 17, 2023Updated 2 years ago
- Neural question generation using transformers☆1,145Apr 5, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆10Oct 15, 2019Updated 6 years ago
- ☆13Mar 9, 2024Updated 2 years ago
- Source code for Text Infilling, implemented with Texar.☆27Feb 18, 2019Updated 7 years ago
- Co:here-powered Slack App Starter Project☆13Apr 1, 2022Updated 4 years ago
- Survey on Machine Reading Comprehension☆147Jan 26, 2021Updated 5 years ago
- This is the implementation of the visual model mentioned in our paper 'Automated Radiology Report Generation using Conditioned Transforme…☆10Jul 25, 2024Updated last year
- ☆13Feb 12, 2023Updated 3 years ago