quinte22 / bumblebee
bumble bee transformer
☆14Updated 3 years ago
Alternatives and similar repositories for bumblebee:
Users that are interested in bumblebee are comparing it to the libraries listed below
- PyTorch implementation of GLOM☆21Updated 2 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Contrastive Language-Audio Pretraining☆15Updated 3 years ago
- codes for TokenManipulationGAN☆7Updated 4 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 4 years ago
- ☆24Updated 9 months ago
- Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"☆14Updated 5 years ago
- Local Attention - Flax module for Jax☆20Updated 3 years ago
- High performance pytorch modules☆18Updated 2 years ago
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.☆19Updated 3 years ago
- GASP! Dataset - Generating Abstracts of Scientific Papers from Abstracts of Cited Papers☆9Updated 4 years ago
- Combining encoder-based language models☆11Updated 3 years ago
- ☆10Updated 10 months ago
- Implementation for NATv2.☆23Updated 4 years ago
- A simple implementation of a deep linear Pytorch module☆19Updated 4 years ago
- My explorations into editing the knowledge and memories of an attention network☆34Updated 2 years ago
- ☆24Updated 3 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆72Updated 2 years ago
- Repository with illustrations for cft-contest-2018☆12Updated 6 years ago
- Siamese network for unsupervised speech representation learning☆11Updated 6 years ago
- Code for "MIM: Mutual Information Machine" paper.☆16Updated 2 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆24Updated 4 years ago
- A Pytorch Implementations for Various Vector Quantization Methods☆27Updated 3 years ago
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆14Updated 4 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Updated 2 years ago
- Usable implementation of Mogrifier, a circuit for enhancing LSTMs and potentially other networks, from Deepmind☆17Updated 8 months ago
- The History of Speech Recognition to the Year 2030☆12Updated 3 years ago
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"☆18Updated 2 years ago