Grouping and Recognize speaker from an animation video. 从动漫中提取每一个说话人。
☆13May 8, 2024Updated last year
Alternatives and similar repositories for Speaker-Grouping
Users that are interested in Speaker-Grouping are comparing it to the libraries listed below
Sorting:
- CoV: Chain-of-View Prompting for Spatial Reasoning☆51Jan 23, 2026Updated last month
- ☆18Aug 1, 2025Updated 7 months ago
- Presets for FxSound equalizer software for boosting sound quality, volume, and bass☆11Nov 19, 2025Updated 3 months ago
- Modern normalizing flows in Python. Simple to use and easily extensible.☆12Feb 11, 2026Updated 3 weeks ago
- 3D Editing via Propagation of Image Prompts to Multi-View☆18Nov 30, 2025Updated 3 months ago
- OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models☆29Feb 4, 2026Updated last month
- Azure Machine Learning - MLOps Python SDKv2☆10Jul 24, 2023Updated 2 years ago
- ☆11Jan 3, 2026Updated 2 months ago
- A Bevy Procedural Slum Generator☆22Dec 15, 2025Updated 2 months ago
- OTP generation & validation library for Rust☆14Dec 4, 2025Updated 3 months ago
- Wrapper integrating aria2 (https://aria2.github.io/) into portage's FETCHCOMMAND for faster downloads (Python)☆10Updated this week
- A scalable data preprocessing framework built on PySpark for LLM training☆23Dec 9, 2025Updated 3 months ago
- Mihomo任意文件写,可通过写SSH密钥、cron任务等实现RCE☆13May 21, 2025Updated 9 months ago
- ☆18May 15, 2025Updated 9 months ago
- [ICLR 2026] RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation☆30Feb 5, 2026Updated last month
- 初音天气 / Miku Weather for Windows☆11Jul 7, 2024Updated last year
- The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization"☆10Dec 24, 2024Updated last year
- My templates used in OI. All C++.☆11Jul 17, 2018Updated 7 years ago
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆24Jan 4, 2026Updated 2 months ago
- [CVPR 2024 Highlight] Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering☆10Jul 29, 2024Updated last year
- [SIGGRAPH Asia 2025] "ASIA: Adaptive 3D Segmentation using Few Image Annotations ".☆23Feb 14, 2026Updated 3 weeks ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆17Nov 4, 2025Updated 4 months ago
- Creating Your Divine Agent 😇☆10Jan 26, 2026Updated last month
- A Flexible Cache Architectural Simulator☆17Sep 16, 2025Updated 5 months ago
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆11Jun 16, 2025Updated 8 months ago
- Color detection, Contour mapping, Detecting holes, Motion detection☆10Mar 20, 2014Updated 11 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- NikSne's NixOS dotfiles☆10Feb 24, 2026Updated last week
- A tool to bridge the gap between asdf and Nix users.☆15Sep 6, 2025Updated 6 months ago
- pre-commit hooks for running JuliaFormatter.jl☆12Nov 30, 2025Updated 3 months ago
- ☆13Nov 12, 2018Updated 7 years ago
- Complete setup guide of Google Photos on WSA (unlimited backup in original quality, moving to another drive, etc).☆14Mar 21, 2025Updated 11 months ago
- ☆14Apr 3, 2024Updated last year
- ☆20Jun 3, 2025Updated 9 months ago
- ☆17Apr 17, 2025Updated 10 months ago
- EfficientSAM + YOLO World base model for use with Autodistill.☆10Feb 21, 2024Updated 2 years ago
- A plugin for bevy that allows you to easily load config files☆16Sep 4, 2024Updated last year
- [ACMMM 2024] An Inverse Partial Optimal Transport Framework for Music-guided Movie Trailer Generation☆16Mar 15, 2025Updated 11 months ago
- ☆11Jan 8, 2025Updated last year