LC1332 / Speaker-GroupingView external linksLinks
Grouping and Recognize speaker from an animation video. 从动漫中提取每一个说话人。
☆13May 8, 2024Updated last year
Alternatives and similar repositories for Speaker-Grouping
Users that are interested in Speaker-Grouping are comparing it to the libraries listed below
Sorting:
- CoV: Chain-of-View Prompting for Spatial Reasoning☆50Jan 23, 2026Updated 3 weeks ago
- Just for debug☆56Feb 15, 2024Updated 2 years ago
- Based on the Evol-character framework and OpenAI API, enabling fine-grained role-playing data generation 🎭🧩.☆31Feb 1, 2024Updated 2 years ago
- Presets for FxSound equalizer software for boosting sound quality, volume, and bass☆10Nov 19, 2025Updated 2 months ago
- ☆17Aug 1, 2025Updated 6 months ago
- Official code for the paper: InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews (previo…☆91May 27, 2025Updated 8 months ago
- 3D Editing via Propagation of Image Prompts to Multi-View☆19Nov 30, 2025Updated 2 months ago
- ☆14Jul 26, 2025Updated 6 months ago
- Azure Machine Learning - MLOps Python SDKv2☆10Jul 24, 2023Updated 2 years ago
- ☆17Aug 5, 2025Updated 6 months ago
- Modern normalizing flows in Python. Simple to use and easily extensible.☆11Updated this week
- bad apple but its cargo compile output☆21Jan 4, 2026Updated last month
- OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models☆26Feb 4, 2026Updated last week
- A Bevy Procedural Slum Generator☆22Dec 15, 2025Updated 2 months ago
- Wrapper integrating aria2 (https://aria2.github.io/) into portage's FETCHCOMMAND for faster downloads (Python)☆10Updated this week
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆11Jun 16, 2025Updated 8 months ago
- .CUR and .ANI cursor file support for Bevy☆14Feb 4, 2026Updated last week
- Creating Your Divine Agent 😇☆10Jan 26, 2026Updated 3 weeks ago
- A scalable data preprocessing framework built on PySpark for LLM training☆21Dec 9, 2025Updated 2 months ago
- 初音天气 / Miku Weather for Windows☆11Jul 7, 2024Updated last year
- [SIGGRAPH Asia 2025] "ASIA: Adaptive 3D Segmentation using Few Image Annotations ".☆22Jan 23, 2026Updated 3 weeks ago
- A simple bot tracking git commits/PRs/issues/branches☆12Feb 7, 2026Updated last week
- ☆11Jan 3, 2026Updated last month
- ☆17May 15, 2025Updated 9 months ago
- Rust based state and configuration using simple macros. Optionally Axum API, openapi with utoipa, and UI components.☆25Jan 19, 2026Updated 3 weeks ago
- Vector math and other CUDA helper functions for OptiX kernels☆10Oct 21, 2024Updated last year
- C# Wrapper for Speex Codec. Speex 编解码器的 C# 包装.☆11Oct 4, 2024Updated last year
- A Flexible Cache Architectural Simulator☆16Sep 16, 2025Updated 5 months ago
- [CVPR 2024 Highlight] Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering☆10Jul 29, 2024Updated last year
- Complete setup guide of Google Photos on WSA (unlimited backup in original quality, moving to another drive, etc).☆12Mar 21, 2025Updated 10 months ago
- OTP generation & validation library for Rust☆14Dec 4, 2025Updated 2 months ago
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆22Jan 4, 2026Updated last month
- Mihomo任意文件写,可通过写SSH密钥、cron任务等实现RCE☆13May 21, 2025Updated 8 months ago
- Simple, reliable, and efficient distributed task queue in Rust☆35Feb 1, 2026Updated 2 weeks ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆17Nov 4, 2025Updated 3 months ago
- Color detection, Contour mapping, Detecting holes, Motion detection☆10Mar 20, 2014Updated 11 years ago
- Regain device access if denied/disabled by other programs (esp. device control programs, ransomware)☆12Dec 13, 2019Updated 6 years ago
- Python solutions to coding questions in Leetcode☆13Sep 12, 2020Updated 5 years ago