Google's BigBird (Jax/Flax & PyTorch) @ π€Transformers
β49Mar 20, 2023Updated 2 years ago
Alternatives and similar repositories for bigbird
Users that are interested in bigbird are comparing it to the libraries listed below
Sorting:
- Minimal implementation of Denoised Smoothing (https://arxiv.org/abs/2003.01908) in TensorFlow.β20Aug 4, 2021Updated 4 years ago
- Transformers for Longer Sequencesβ631Sep 1, 2022Updated 3 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.β14Dec 21, 2021Updated 4 years ago
- GC4LM: A Colossal (Biased) language model for Germanβ13May 2, 2021Updated 4 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.β15Sep 29, 2021Updated 4 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy modβ¦β15Apr 21, 2023Updated 2 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.β21Jul 13, 2022Updated 3 years ago
- A cute little python module for calculating different ranking metrics. Based entirely on the gist from @bwhite: https://gist.github.com/bβ¦β21Apr 12, 2023Updated 2 years ago
- Official implementation of NeurIPS'21: Implicit SVD for Graph Representation Learningβ21Nov 4, 2021Updated 4 years ago
- Curated list of resources for various topics, articles, tutorials, etc I've found useful.β21Jun 29, 2022Updated 3 years ago
- β24May 31, 2025Updated 9 months ago
- β29Nov 30, 2021Updated 4 years ago
- [ICLR 2023] "Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!" Shiwei Liu, Tianlong Chen, Zhenyu Zhang, Xuxi Chenβ¦β28Aug 29, 2023Updated 2 years ago
- RoBERTa Marathi Language model trained from scratch during huggingface π€ x flax community weekβ28Jul 18, 2021Updated 4 years ago
- β17Feb 21, 2026Updated 2 weeks ago
- Addressing the problem of predicting crime occurrence based on historic recordsβ11Nov 27, 2019Updated 6 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR modelsβ30Apr 21, 2021Updated 4 years ago
- Neural question generation using transformersβ1,143Apr 5, 2024Updated last year
- code for "Natural Language to Code Translation with Execution"β41Nov 2, 2022Updated 3 years ago
- This is the implementation of the visual model mentioned in our paper 'Automated Radiology Report Generation using Conditioned Transformeβ¦β10Jul 25, 2024Updated last year
- Implementation of calibrated precision and calibrated metricsβ14Apr 23, 2020Updated 5 years ago
- The purpose of this repository is for devs and non devs to carry out tests on the precompiled botanix artifacts. It contains an easy rpc β¦β13Feb 23, 2026Updated 2 weeks ago
- Multi-hop dense retrieval for question answeringβ219Oct 12, 2021Updated 4 years ago
- A baseline system for ContractNLI (https://stanfordnlp.github.io/contract-nli/)β36Mar 10, 2023Updated 2 years ago
- This repository reproduces the results in the paper "How expressive are transformers in spectral domain for graphs?"(published in TMLR)β12Jul 10, 2022Updated 3 years ago
- Newspaper Segmentation into images and textβ12Jan 11, 2019Updated 7 years ago
- Mitigating the Filter Bubble while Maintaining Relevance: Targeted Diversification with VAE-based Recommender Systemsβ10Mar 15, 2023Updated 2 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XMLβ12Dec 10, 2025Updated 2 months ago
- TensorFlow 2 / Lite implementation of Ultra-Fast Structure-Aware Lane Detectionβ12Aug 19, 2020Updated 5 years ago
- GPT-5 and Opus 4.1 implementations of one-shot coding examplesβ17Feb 6, 2026Updated last month
- β13Jun 18, 2025Updated 8 months ago
- "SCONE: A Novel Stochastic Sampling to Generate Contrastive Views and Hard Negative Samples for Recommendation", WSDM 2025β15Nov 25, 2025Updated 3 months ago
- Official repository for 'Risk of Bias in Chest Radiography Deep Learning Foundation Models'β12Sep 27, 2023Updated 2 years ago
- β11Jan 7, 2025Updated last year
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"β11Mar 31, 2024Updated last year
- β11Jul 20, 2021Updated 4 years ago
- Implementation of data dimensionality reduction algorithms SVD and CUR without using library functions.β10Jul 24, 2017Updated 8 years ago
- Reinforcement Learning Recommender System suggesting relevant scientific services to appropriate researchersβ11Aug 29, 2024Updated last year
- Graphical intuition to MOSFET square-lawβ12Jan 5, 2021Updated 5 years ago