Google's BigBird (Jax/Flax & PyTorch) @ π€Transformers
β49Mar 20, 2023Updated 3 years ago
Alternatives and similar repositories for bigbird
Users that are interested in bigbird are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Transformers for Longer Sequencesβ634Sep 1, 2022Updated 3 years ago
- Minimal implementation of Denoised Smoothing (https://arxiv.org/abs/2003.01908) in TensorFlow.β20Aug 4, 2021Updated 4 years ago
- GC4LM: A Colossal (Biased) language model for Germanβ13May 2, 2021Updated 5 years ago
- Tutorial to pretrain & fine-tune a π€ Flax T5 model on a TPUv3-8 with GCPβ58Jul 28, 2022Updated 3 years ago
- Code for An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality (ICLR 2020)β11Mar 24, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- β21Jun 20, 2019Updated 6 years ago
- A multi-functional library for full-stack Deep Learning. Simplifies Model Building, API development, and Model Deployment.β234Apr 6, 2026Updated 2 months ago
- β24May 13, 2026Updated last month
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy modβ¦β15Apr 21, 2023Updated 3 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.β21Jul 13, 2022Updated 3 years ago
- β12Mar 4, 2025Updated last year
- Text Classification Dataset for Turkish Languageβ10Nov 16, 2021Updated 4 years ago
- π Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networksβ12Feb 21, 2020Updated 6 years ago
- A cute little python module for calculating different ranking metrics. Based entirely on the gist from @bwhite: https://gist.github.com/bβ¦β21Apr 12, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A baseline system for ContractNLI (https://stanfordnlp.github.io/contract-nli/)β36Mar 10, 2023Updated 3 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR modelsβ30Apr 21, 2021Updated 5 years ago
- A parameter-efficient compression model architecture for a variety of NLP tasks at BERT level performance at a fraction of the computatioβ¦β10Jan 25, 2026Updated 4 months ago
- Convert Transkribus PAGE-XML to standard PAGE-XMLβ12Dec 10, 2025Updated 6 months ago
- β13Mar 27, 2020Updated 6 years ago
- β18Apr 30, 2025Updated last year
- Text summarization with python and transformerβ13Jun 17, 2023Updated 3 years ago
- Neural question generation using transformersβ1,143Apr 5, 2024Updated 2 years ago
- β10Oct 15, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- several algorithms for converting dependency structures into constituency structures.β10Feb 7, 2022Updated 4 years ago
- Meta-learning learning rates with higherβ12Sep 27, 2019Updated 6 years ago
- Curated list of resources for various topics, articles, tutorials, etc I've found useful.β22Jun 29, 2022Updated 3 years ago
- GyroSPD: Vector-valued Distance and Gyrocalculus on the Space of Symmetric Positive Definite Matricesβ17Nov 8, 2021Updated 4 years ago
- texrex web page cleaning & ClaraX random walk crawlerβ11Dec 13, 2021Updated 4 years ago
- Source code for Text Infilling, implemented with Texar.β27Feb 18, 2019Updated 7 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"β211Aug 31, 2021Updated 4 years ago
- Co:here-powered Slack App Starter Projectβ13Apr 1, 2022Updated 4 years ago
- β13Feb 12, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- code for "Natural Language to Code Translation with Execution"β41Nov 2, 2022Updated 3 years ago
- Topic clustering library built on Transformer embeddings and cosine similarity metrics.Compatible with all BERT base transformers from huβ¦β44Jun 11, 2021Updated 5 years ago
- Train small sequence models in your browser with WebGPU.β34Dec 3, 2025Updated 6 months ago
- A fast, simple, multi-threaded string interning library.β18Jul 11, 2025Updated 11 months ago
- Format conversion and graphical representation of [Universal Dependencies](http://universaldependencies.org) trees.β12Sep 3, 2024Updated last year
- Multi-hop dense retrieval for question answeringβ218Oct 12, 2021Updated 4 years ago
- β46Apr 13, 2022Updated 4 years ago