Google's BigBird (Jax/Flax & PyTorch) @ π€Transformers
β49Mar 20, 2023Updated 3 years ago
Alternatives and similar repositories for bigbird
Users that are interested in bigbird are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Transformers for Longer Sequencesβ633Sep 1, 2022Updated 3 years ago
- β13May 30, 2022Updated 3 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.β14Dec 21, 2021Updated 4 years ago
- This repositary hosts my experiments for the project, I did with OffNote Labs.β10Apr 12, 2021Updated 5 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.β15Sep 29, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality (ICLR 2020)β11Mar 24, 2023Updated 3 years ago
- β21Jun 20, 2019Updated 6 years ago
- A text truncation method, useful for instance in long text classificationβ22Jun 22, 2022Updated 3 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy modβ¦β15Apr 21, 2023Updated 3 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.β21Jul 13, 2022Updated 3 years ago
- Multi-task modelling extensions for huggingface transformersβ21Mar 3, 2023Updated 3 years ago
- Coding utilities for quantitative legal studiesβ14Dec 7, 2025Updated 5 months ago
- β12Mar 4, 2025Updated last year
- [ICLR 2023] "Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!" Shiwei Liu, Tianlong Chen, Zhenyu Zhang, Xuxi Chenβ¦β28Aug 29, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- PyTorch Implementation for CS229 Course Project - "Grammatical Error Correction using Neural Networks"β10Dec 16, 2017Updated 8 years ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"β11Mar 31, 2024Updated 2 years ago
- β15Nov 22, 2023Updated 2 years ago
- [ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Tiβ¦β11Nov 28, 2023Updated 2 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XMLβ12Dec 10, 2025Updated 5 months ago
- β13Mar 27, 2020Updated 6 years ago
- β13Nov 27, 2025Updated 6 months ago
- β17Apr 30, 2025Updated last year
- Text summarization with python and transformerβ13Jun 17, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- β10Oct 15, 2019Updated 6 years ago
- Meta-learning learning rates with higherβ12Sep 27, 2019Updated 6 years ago
- this is a TypeScript-based MCP server that implements a simple loom and makes it available for Claude to use.β23Feb 17, 2026Updated 3 months ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"β211Aug 31, 2021Updated 4 years ago
- Survey on Machine Reading Comprehensionβ147Jan 26, 2021Updated 5 years ago
- β13Feb 12, 2023Updated 3 years ago
- code for "Natural Language to Code Translation with Execution"β41Nov 2, 2022Updated 3 years ago
- Topic clustering library built on Transformer embeddings and cosine similarity metrics.Compatible with all BERT base transformers from huβ¦β44Jun 11, 2021Updated 4 years ago
- Train small sequence models in your browser with WebGPU.β34Dec 3, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A fast, simple, multi-threaded string interning library.β18Jul 11, 2025Updated 10 months ago
- Course materials for Spring 2021 ETH Course, "Sequencing Legal DNA: NLP for Law and Political Economy"β14Aug 18, 2021Updated 4 years ago
- Format conversion and graphical representation of [Universal Dependencies](http://universaldependencies.org) trees.β12Sep 3, 2024Updated last year
- Multi-hop dense retrieval for question answeringβ218Oct 12, 2021Updated 4 years ago
- β46Apr 13, 2022Updated 4 years ago
- BERT for Evidence Retrieval and Claim Verificationβ35Jun 2, 2020Updated 5 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.β14Oct 24, 2016Updated 9 years ago