yl4467/singer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yl4467/singer)

yl4467 / singer

☆15

Alternatives and similar repositories for singer

Users that are interested in singer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhang-haojie / LetsTalk
View on GitHub
[IEEE TMM] Multimodal Diffusion Transformer with Memory Bank for Scalable Long-Duration Talking Video Generation
☆62May 8, 2026Updated 2 months ago
chenqi008 / V2C
View on GitHub
Pytorch implementation for “V2C: Visual Voice Cloning”
☆34Jan 28, 2023Updated 3 years ago
Ditzley / joint-gestures-and-face
View on GitHub
Code for the paper "Joint Co-Speech Gesture and Expressive Talking Face Generation using Diffusion with Adapters"
☆26Jan 7, 2025Updated last year
zds-potato / multilingual-phonetic-sv
View on GitHub
☆10Dec 22, 2023Updated 2 years ago
GigaAI-research / HumanDreamer-X
View on GitHub
☆23Jul 11, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CVI-SZU / DEGSTalk
View on GitHub
[ICASSP'25] DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesis
☆55Oct 25, 2025Updated 8 months ago
zhenghuatan / Audio-adversarial-examples
View on GitHub
Datasets of audio adversarial examples for deep speech recognition systems and Python code of a detection system
☆14May 6, 2023Updated 3 years ago
pratyushmaini / ssft
View on GitHub
[NeurIPS'22] Official Repository for Characterizing Datapoints via Second-Split Forgetting
☆16Aug 11, 2023Updated 2 years ago
MegEngine / awesome-megengine
View on GitHub
Awesome Resources about MegEngine
☆16Mar 2, 2023Updated 3 years ago
Guohanzhong / OSA-LCM
View on GitHub
☆25Dec 19, 2024Updated last year
KoMyeongJin / SpecDiff-GAN
View on GitHub
Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS
☆40Aug 4, 2023Updated 2 years ago
EasyFL-AI / EasyFL
View on GitHub
An easy-to-use federated learning platform
☆26Aug 23, 2023Updated 2 years ago
MatthewTamYT / Breakout
View on GitHub
Breakout is a game created with Python 3, using the module PyGame. It is a ball game where you bounce the ball by moving the paddle. Elim…
☆18Jul 24, 2021Updated 4 years ago
aiming-lab / MJ-Video
View on GitHub
[NeurIPS'25 Spotlight] MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation
☆20Feb 23, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zhliuworks / EyeLipCropper
View on GitHub
✂️ EyeLipCropper is a Python tool to crop eyes and mouth ROIs of the given video.
☆14Nov 28, 2021Updated 4 years ago
ucwxb / GraphAvatar
View on GitHub
[AAAI2025] GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians
☆38Apr 2, 2025Updated last year
SWJTU-3DVision / BoundaryFace
View on GitHub
☆26Feb 19, 2023Updated 3 years ago
corticph / MSTmodel
View on GitHub
Code for https://arxiv.org/abs/1712.00254
☆17Dec 6, 2017Updated 8 years ago
URRealHero / JudgeAnything
View on GitHub
☆17Jun 1, 2025Updated last year
yevvonlim / kai-presentation
View on GitHub
Claude Code skill for KAI presentation design in HTML
☆15Mar 20, 2026Updated 4 months ago
ahaliassos / usr2
View on GitHub
PyTorch implementation of USR 2.0 (ICLR 2026)
☆15Apr 3, 2026Updated 3 months ago
Intelligent-Microsystems-Lab / QuantizedSNNs
View on GitHub
This repository contains the models and training scripts used in the papers: "Quantizing Spiking Neural Networks with Integers" (ICONS 20…
☆13Oct 20, 2020Updated 5 years ago
UMass-Embodied-AGI / TalkCuts
View on GitHub
[NeurIPS 2025] TalkCuts: A Large-Scale Dataset for Multi-Shot Human Speech Video Generation
☆37Dec 14, 2025Updated 7 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
GATECH-EIC / FracTrain
View on GitHub
[NeurIPS 2020] "FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training" by Yonggan Fu, Ha…
☆10Feb 13, 2022Updated 4 years ago
yxdydgithub / difftalk_preprocess
View on GitHub
☆13May 11, 2024Updated 2 years ago
GalaxyCong / HPMDubbing_Vocoder
View on GitHub
16k Hz Vocoder (HiFiGAN Codes and Pretrained Models)
☆18Apr 3, 2023Updated 3 years ago
DiffPoseTalk / DiffPoseTalk
View on GitHub
DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models
☆355Mar 11, 2025Updated last year
jzr99 / DNF-Avatar
View on GitHub
[ICCV 2025 Findings Oral] DNF-Avatar: Distilling Neural Fields for Real-time Animatable Avatar Relighting
☆39Nov 20, 2025Updated 8 months ago
jakc4103 / scale-adjusted-training
View on GitHub
PyTorch implementation of Towards Efficient Training for Neural Network Quantization
☆16Jan 16, 2020Updated 6 years ago
wsj-sjtu / SingingHead
View on GitHub
Official implentation of SingingHead: A Large-scale 4D Dataset for Singing Head Animation. (TMM 25)
☆65Feb 1, 2026Updated 5 months ago
xg-chu / ARTalk
View on GitHub
ARTalk generates realistic 3D head motions (lip sync, blinking, expressions, head poses) from audio in ⚡ real-time ⚡.
☆136May 19, 2026Updated 2 months ago
ZZDoog / ProDubber
View on GitHub
[CVPR 2025] Official implementation of paper "Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie…
☆23Jun 6, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
sungnyun / cav2vec
View on GitHub
(ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation
☆16Apr 29, 2025Updated last year
yhytoto12 / Behavior-SD
View on GitHub
Official Implementation of NAACL 2025 Paper: Behavior-SD: Behaviorally Aware Spoken Dialogue Generation with Large Language Models
☆18Apr 30, 2025Updated last year
Juzezhang / ViBES
View on GitHub
This repository contains the official implementation of "ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body".
☆33Updated this week
MinahilRaza / Brevitas_Fixed_Point
View on GitHub
Quantized Training for Convolutional Neural Networks using Xilinx Brevitas
☆12Mar 16, 2022Updated 4 years ago
Dorniwang / SpeakerVid-5M-Code
View on GitHub
The official SpeakerVid-5M data curation code.
☆82Jul 23, 2025Updated 11 months ago
lars76 / forced-alignment-chinese
View on GitHub
Mandarin Chinese audio datasets aligned with Montreal Forced Aligner
☆19Aug 13, 2024Updated last year
kaistmm / VoxMM
View on GitHub
☆23May 11, 2026Updated 2 months ago