Visualization tools for audio-only and multi-modal speaker diarization dataset
☆13Oct 27, 2023Updated 2 years ago
Alternatives and similar repositories for DiarizationVisualization
Users that are interested in DiarizationVisualization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Both audio-only and audio-visual speaker diarization datasets are listed here.☆15Feb 22, 2023Updated 3 years ago
- add a Arg: label_smoothing for torch.nn.CrossEntropyLoss()☆14Jan 13, 2021Updated 5 years ago
- Write and keep snippets for VSCode in a markdown file.☆15Jul 23, 2023Updated 2 years ago
- wenet_LLM_from_ASLP☆15Nov 26, 2024Updated last year
- [INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.☆64Jan 24, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- This repository creates speaker diarization recipes to be used within the egs folder of kaldi.☆17Aug 12, 2024Updated last year
- Sudoku Solver☆16Aug 27, 2024Updated last year
- Unoffical LivePortrait Training Script [ 🚧 Under Construction]☆39Jan 28, 2025Updated last year
- 汉诺塔游戏,输入层数,自动绘制并自动动画展现解题过程☆12May 18, 2017Updated 9 years ago
- ☆16Mar 23, 2021Updated 5 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Sep 23, 2020Updated 5 years ago
- PHP-in-JS: a silly experiment☆15Nov 13, 2022Updated 3 years ago
- A fluent forward protocol implementation for Node.js☆12May 9, 2023Updated 3 years ago
- Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way☆48Apr 19, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Smart, performant auditing of ActiveRecord models☆60Nov 29, 2018Updated 7 years ago
- ☆21Feb 15, 2022Updated 4 years ago
- SAM2 Track implementation with TensorRT & OnnxRuntime☆23May 23, 2025Updated 11 months ago
- ☆46Jan 22, 2024Updated 2 years ago
- The internet's fastest YouTube downloader made with FFmpeg.WASM.☆13Jul 22, 2023Updated 2 years ago
- ☆51Nov 24, 2022Updated 3 years ago
- Output image to a file, stream, canvas, console, buffer or any other destination☆17Jan 18, 2025Updated last year
- Emmet support for Atom☆10Apr 22, 2018Updated 8 years ago
- A small song with Remotion + Tune.JS☆10Feb 23, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆12Jan 24, 2018Updated 8 years ago
- ☆53Jan 15, 2021Updated 5 years ago
- ☆68Sep 13, 2022Updated 3 years ago
- Clustering-based methods for overlapping diarization☆83Jan 12, 2024Updated 2 years ago
- Balanced Error Rate for Speaker Diarization☆33Feb 28, 2023Updated 3 years ago
- Preprocessing Scipts for Talking Face Generation☆97Jan 21, 2025Updated last year
- zero shot NER fine tuning☆14Mar 17, 2025Updated last year
- Unsupervised Representation Learning by Invariance Propagation☆15Apr 7, 2021Updated 5 years ago
- ☆16May 13, 2026Updated last week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Find out who said what in the video.☆145Jan 22, 2026Updated 3 months ago
- Code and datasets for the salesforce AI research paper on prompt leakage and multi-turn threats against LLMs☆22Nov 10, 2025Updated 6 months ago
- ☆11Dec 10, 2020Updated 5 years ago
- A beginner-friendly inference to finetune & run inference on open TTS models 🗣️☆29Feb 4, 2026Updated 3 months ago
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder☆69Jul 21, 2024Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆22Jun 7, 2025Updated 11 months ago
- pytorch flow matching☆36May 11, 2025Updated last year