Vvkmnn / voiceAILinks
π Speech transcription and synthesis via Keras and Tensorflow.
β13Updated 7 years ago
Alternatives and similar repositories for voiceAI
Users that are interested in voiceAI are comparing it to the libraries listed below
Sorting:
- An AI-aided image segmentation ML-Module for Heartexlab/Label-Studio. Easy to deploy. Great to use.β11Updated 4 years ago
- Web app for keyword spotting using TensorflowJSβ72Updated 2 years ago
- How to run GPU accelerated Signal Processing in TensorFlowβ23Updated 6 years ago
- Web-based tool for straight-forward class annotation of audio filesβ11Updated 4 years ago
- Trains a convolutional autoencoder on Mel Spectrogram images for a list of songs, then displays the encoded latent features using t-SNE.β21Updated 8 years ago
- An end-to-end system which makes use of a recurrent encoder-decoder deep neural network to translate speech from the Hindi (Fourth most sβ¦β18Updated 5 years ago
- Audio Classification using Image Classificationβ48Updated 5 years ago
- A repo with scripts to test and play around with Facebook's recent llama models! π€β28Updated last year
- Real-time speech to text with specific language translation.β49Updated 4 years ago
- A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.β36Updated last year
- Packages tfjs models for shipping with websitesβ14Updated 5 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.β48Updated 8 years ago
- generate & query embeddings from VTT files using openai & pinecone on Andrej Karpathy's's latest GPT tutorialβ19Updated last year
- Classification of WAV files from cats and dogsβ22Updated 7 years ago
- Audio processing using deep neural networks. Speaker identification using voice embeddings.β13Updated 2 years ago
- Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Networkβ10Updated 6 years ago
- This is the code and introduction for how to apply simple deep learning method on background removal.β33Updated 5 years ago
- Dockerfile for audiogrep and pocketsphinxβ12Updated 8 years ago
- Code for running a Magic card image generator APIβ17Updated 5 years ago
- gentle forced alignerβ11Updated last year
- Tutorial to run TensorFlow 2 on mobile devices: Android, iOS and Browserβ30Updated 2 years ago
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- The library is useful for analyzing the emotions present in any audio file(call/music/recordings) into three classes namely positive, negβ¦β32Updated 8 years ago
- On-device voice activity detection (VAD) powered by deep learningβ219Updated this week
- Spatializing audio in 3D for immersive music experiences.β22Updated 2 years ago
- Generate embedding vectors from audio filesβ59Updated last month
- Real-time human emotion detection and analysis through voice and speech pattern processingβ28Updated 6 years ago
- Speech to Text with Hugging Face and Wav2vecΒ 2.0β35Updated 4 years ago
- Clone a voice in 5 seconds to generate arbitrary speech in real-timeβ53Updated 5 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, reβ¦β47Updated 2 years ago