SeanSdahl / PytorchDataloaderForTensorflowLinks
This repository defines a python class that can be used to load data for the tf.keras.model.fit_generator function by using a torch.utils.data.Dataloader object for image data. This way transforms on the input image data can be transformed using the PyTorch library but still be used to fit a tf.keras.model.
☆11Updated last year
Alternatives and similar repositories for PytorchDataloaderForTensorflow
Users that are interested in PytorchDataloaderForTensorflow are comparing it to the libraries listed below
Sorting:
- ☆12Updated 6 years ago
- This sample includes simeple CNN classifier for music and audio-folder dataloader just like ImageFolder in torchvision.☆11Updated 7 years ago
- Library of TensorFlow layers for audio data processing and data augmentation☆20Updated 3 years ago
- ☆15Updated 6 years ago
- Researchers who published code, models (in some cases), and demo apps (in few cases) along with their SOTA paper☆12Updated 2 years ago
- Speech in Flax/JAX☆15Updated 3 years ago
- Enhancment of Audio Quality (Bit-Depth and Sampling-Rate) using Deep Learning.☆33Updated 5 years ago
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 3 years ago
- A tool for assignment to a slice in TensorFlow☆20Updated 4 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- Code base for WaveTransformer: A novel architecture for automated audio captioning☆44Updated 4 years ago
- docker for HF wav2vec2-sprint☆13Updated 4 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.☆15Updated 4 years ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆38Updated 2 years ago
- Collection of research papers on cough classification☆40Updated 5 years ago
- ☆76Updated 4 years ago
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub 🤗⚡️☆35Updated 3 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆29Updated 4 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆15Updated last year
- Discrete wavelet transform layers with fixed and trainable wavelets☆22Updated 3 years ago
- Tensorflow Audio Classification Models☆13Updated 2 years ago
- Contrastive Language-Audio Pretraining☆15Updated 4 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Updated last year
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…☆33Updated last year
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆13Updated 3 years ago
- Zero-shot Audio Classification using Whisper☆79Updated 3 years ago
- Collaborative audio annotation tool☆18Updated 3 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 3 years ago
- ☆18Updated 4 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Updated 5 years ago