gunchagarg / differential-learning-rate-keras
Implementation of Differential Learning Rate in Keras
☆11Updated 5 years ago
Alternatives and similar repositories for differential-learning-rate-keras:
Users that are interested in differential-learning-rate-keras are comparing it to the libraries listed below
- Minimalistic TensorFlow2+ deep metric/similarity learning library with loss functions, miners, and utils as embedding projector.☆37Updated 2 years ago
- Exploring learning rates to improve model performance☆19Updated 5 years ago
- Radam+lookahead implemented by tensorflow☆11Updated 5 years ago
- Source code for "Training Generative Adversarial Networks Via Turing Test".☆13Updated 4 years ago
- Pytorch Code for S2IGAN☆41Updated 4 years ago
- ☆24Updated 3 years ago
- Collection of models and extensions for deployment in PyTorch☆24Updated 2 years ago
- bumble bee transformer☆14Updated 3 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- ☆26Updated 5 years ago
- Contrastive Language-Audio Pretraining☆15Updated 3 years ago
- Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch☆45Updated 4 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆32Updated 2 years ago
- Large Scale BERT Distillation☆32Updated 2 years ago
- Converts TensorFlow checkpoints (with index, meta and data files) to PyTorch, HDF5 and JSON☆18Updated 4 years ago
- Implementation of Multistream Transformers in Pytorch☆53Updated 3 years ago
- ASR project with pytorch-lightning☆20Updated this week
- NMT model with BERT in tensorflow 2.0☆20Updated 5 years ago
- Comprehensive Python library for speech and voice.☆32Updated 2 years ago
- sequence tagging for NER for ULMFiT☆20Updated 4 years ago
- Lambda Networks implemented in PyTorch☆13Updated 4 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 4 years ago
- ☆21Updated 5 years ago
- Speeech Recognition for Indic languages.☆12Updated 3 years ago
- Local Attention - Flax module for Jax☆20Updated 3 years ago
- A convolution-free, transformer-only version of the CycleGAN framework☆33Updated 3 years ago
- Adaptive embedding and softmax☆17Updated 3 years ago
- ☆44Updated 3 years ago
- Implementation of the DocLLM paper for Llama models.☆12Updated 4 months ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆24Updated 5 years ago