thetobysiu / DeepstoryView external linksLinks
Deepstory turns a text/generated text into a video where the character is animated to speak your story using his/her voice.
☆102Nov 23, 2025Updated 2 months ago
Alternatives and similar repositories for Deepstory
Users that are interested in Deepstory are comparing it to the libraries listed below
Sorting:
- DoyenTalker uses deep learning techniques to generate personalized avatar videos that speak user-provided text in a specified voice. The …☆14Sep 20, 2024Updated last year
- Speech to Facial Animation using GANs☆40Nov 3, 2021Updated 4 years ago
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Jun 20, 2023Updated 2 years ago
- 📺 NeuralAtlases - A Pytorch implementation of the paper "Layered Neural Atlases for Consistent Video Editing" (https://arxiv.org/abs/210…☆34Feb 17, 2023Updated 2 years ago
- An implementation of the Wav2Letter Speech-to-Text model using PyTorch.☆14Mar 8, 2023Updated 2 years ago
- Self-supervised neural network for music recommendations.☆18Jul 6, 2023Updated 2 years ago
- Fast and differentiable hidden Markov model in C++☆19Jan 20, 2023Updated 3 years ago
- ☆17Mar 23, 2025Updated 10 months ago
- ☆18Jan 18, 2024Updated 2 years ago
- Implementation of Denoising Diffusion Probabilistic Model in Pytorch☆18Jun 20, 2023Updated 2 years ago
- Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speakin…☆40Sep 13, 2023Updated 2 years ago
- Invite people to Google drive without your inconvenience☆15Sep 3, 2022Updated 3 years ago
- Unofficial Zippyshare CLI tools (download and search)☆14Apr 24, 2023Updated 2 years ago
- ☆43Jan 5, 2024Updated 2 years ago
- Avatar Generation For Characters and Game Assets Using Deep Fakes☆232Aug 18, 2024Updated last year
- Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.☆119Dec 13, 2021Updated 4 years ago
- A simple voice conversion tool☆19Mar 10, 2022Updated 3 years ago
- Faster Talking Face Animation on Xeon CPU☆130Nov 14, 2023Updated 2 years ago
- Pytorch implementation of the TecoGan video super resolution model.☆17Mar 1, 2022Updated 3 years ago
- This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).☆22Apr 29, 2024Updated last year
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆25Nov 9, 2023Updated 2 years ago
- ☆22Feb 22, 2024Updated last year
- City of Light (COL) is a geospatially faithful, Unity-based digital twin of Paris enabling high-performance embodied simulation for AI an…☆43Feb 2, 2026Updated 2 weeks ago
- [ECCV 2022] PadInv: High-fidelity GAN Inversion with Padding Space☆87Dec 17, 2022Updated 3 years ago
- Neural Network modelling of guitar amplifiers and audio effects.☆28Jun 6, 2021Updated 4 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆48May 6, 2024Updated last year
- Recognize speech from an audio file and convert it into animation FBX☆24Mar 7, 2022Updated 3 years ago
- PyTorch implementation for NED (CVPR 2022). It can be used to manipulate the facial emotions of actors in videos based on emotion labels …☆160Oct 6, 2022Updated 3 years ago
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- ☆10Feb 23, 2023Updated 2 years ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆11Jun 28, 2022Updated 3 years ago
- The model implementations for T5 encoder decoder soft prompt tuning for text generation.☆25Dec 5, 2022Updated 3 years ago
- A neural network based file sorter. Trains an autoencoder to sort images or audio based on the similarity of their encodings, or uses the…☆31Jun 24, 2023Updated 2 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35May 24, 2024Updated last year
- Official repo of Text-Free Learning of a Natural Language Interface for Pretrained Face Generators☆66Dec 13, 2023Updated 2 years ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆75Jun 19, 2025Updated 7 months ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- ☆32Oct 28, 2023Updated 2 years ago