Coder-jzq/ICASSP2025-IIICSS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Coder-jzq/ICASSP2025-IIICSS)

Coder-jzq / ICASSP2025-IIICSS

☆11

Alternatives and similar repositories for ICASSP2025-IIICSS

Users that are interested in ICASSP2025-IIICSS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Coder-jzq / RADKA-CSS
View on GitHub
☆17Mar 25, 2025Updated last year
walker-hyf / FCTalker
View on GitHub
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)
☆26Feb 22, 2024Updated 2 years ago
GalaxyCong / HPMDubbing_Vocoder
View on GitHub
16k Hz Vocoder (HiFiGAN Codes and Pretrained Models)
☆18Apr 3, 2023Updated 3 years ago
walker-hyf / NCSSD
View on GitHub
Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)
☆61Nov 1, 2024Updated last year
I2-Multimedia-Lab / UGRAN
View on GitHub
[TIP2025] The implementation of "Uncertainty Guided Refinement for Fine-grained Salient Object Detection"
☆18Apr 20, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
walker-hyf / ECSS
View on GitHub
Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)
☆59Jun 20, 2024Updated 2 years ago
Jonathan-Greve / Xmake-template
View on GitHub
Xmake C++23 project template. Using C++ modules, github workflows for CI/CD (Windows and Ubuntu) and gtest for testing. Compiles with bot…
☆17Mar 11, 2024Updated 2 years ago
dragazo / rustex
View on GitHub
Rust-style mutex type for C++
☆17Jan 12, 2024Updated 2 years ago
GMY628 / RIS-Fuse
View on GitHub
☆19Dec 22, 2025Updated 7 months ago
AI-S2-Lab / MEIJU2025-baseline
View on GitHub
☆24Oct 23, 2024Updated last year
SpeechEE / SpeechEE
View on GitHub
☆11Aug 20, 2025Updated 11 months ago
lizhuangzi / SPU
View on GitHub
Code of Semantic Point Cloud Upsampling (SPU) published on IEEE Transactions on Multimedia.
☆23Apr 23, 2022Updated 4 years ago
AntXinyuan / SSP
View on GitHub
Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection
☆13Jul 7, 2026Updated 2 weeks ago
Vincentqyw / light-field-TB
View on GitHub
This is a simple test for light field ToolBox.
☆27Oct 15, 2017Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
nlp-waseda / traveling-across-languages
View on GitHub
Official repo and evaluation implementation of KnowRecall and VisRecall
☆10May 22, 2025Updated last year
Sreyan88 / RECAP
View on GitHub
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
☆16Jun 23, 2024Updated 2 years ago
lhmouse / poseidon
View on GitHub
The Poseidon Server Framework
☆20Jul 16, 2026Updated last week
SJTU-IPADS / fisslock
View on GitHub
A fast and scalable distributed lock service using programmable switches.
☆21Jul 30, 2024Updated last year
TsingZ0 / FedKTL
View on GitHub
CVPR 2024 accepted paper, An Upload-Efficient Scheme for Transferring Knowledge From a Server-Side Pre-trained Generator to Clients in He…
☆68Mar 12, 2025Updated last year
ThugKd / ChattingRoom
View on GitHub
Java聊天室
☆30Jul 30, 2016Updated 9 years ago
Dhravya / DeepDubber
View on GitHub
Dubs the video in another language, Powered by Deepgram API and Google Translate.
☆16Apr 11, 2022Updated 4 years ago
SMILELab-FL / FedPETuning
View on GitHub
☆71Jun 2, 2023Updated 3 years ago
FudanCVL / AVI-Bench
View on GitHub
[ICML'26] Toward Human-like Audio-Visual Intelligence of Omni-MLLMs
☆16Jun 20, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
JiazuoYu / Fines
View on GitHub
Code for paper "FineRS: Fine-grained Reasoning and Segmentation of Small Objects with Reinforcement Learning" Neurips2025.
☆15Jan 29, 2026Updated 5 months ago
google-deepmind / vocap
View on GitHub
☆17Sep 5, 2025Updated 10 months ago
jaypipes / sqltoast
View on GitHub
A SQL parser written in C++
☆32Oct 22, 2021Updated 4 years ago
Jiaxin-Ye / Emo-DNA
View on GitHub
[ACM MM 2023] Official PyTorch implementation of "Emo-DNA: Emotion Decoupling and Alignment Learning for Cross-Corpus Speech Emotion Reco…
☆12Aug 4, 2023Updated 2 years ago
ryogrid / create_pg_super_document
View on GitHub
create_pg_super_document is a project that generates documentation for all symbols in the PostgreSQL codebase, then utilizes these symbol…
☆31May 9, 2026Updated 2 months ago
ShiQiu0419 / pnp-3d
View on GitHub
PnP-3D: A Plug-and-Play for 3D Point Clouds (TPAMI 2021)
☆45Feb 14, 2022Updated 4 years ago
aidayang / FunASR-OneClick
View on GitHub
FunASR实时语音识别版，识别麦克风和电脑内播放的声音，电脑语音打字软件
☆19Sep 12, 2025Updated 10 months ago
ZZDoog / ProDubber
View on GitHub
[CVPR 2025] Official implementation of paper "Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie…
☆23Jun 6, 2025Updated last year
wonjune-kang / llm-speech-summarization
View on GitHub
Prompting Large Language Models with Audio for General-Purpose Speech Summarization
☆20May 14, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Ryan-rsm-McKenzie / binary_io
View on GitHub
A binary i/o library for C++, without the agonizing pain
☆36Aug 3, 2023Updated 2 years ago
VIPL-Audio-Visual-Speech-Understanding / deep-face-speechreading
View on GitHub
Visual speech recognition with face inputs: code and models for F&G 2020 paper "Can We Read Speech Beyond the Lips? Rethinking RoI Select…
☆19Apr 12, 2021Updated 5 years ago
GSW-D / PCDreamerCode
View on GitHub
The code of paper PCDreamer
☆46Oct 15, 2025Updated 9 months ago
Fischer-Tom / unified-detection-and-pose-estimation
View on GitHub
☆15Apr 9, 2026Updated 3 months ago
VisualAIKHU / Missing-AVQA
View on GitHub
Official Repository for "Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality" (ECCV 2024)
☆16Oct 29, 2024Updated last year
DreamMr / EST
View on GitHub
Expression Snippet Transformer for Robust Video-based Facial Expression Recognition
☆17Jan 27, 2024Updated 2 years ago
Haiyang0226 / Symphony
View on GitHub
code of cvpr26 paper Symphony
☆17Apr 7, 2026Updated 3 months ago