Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way
☆48Apr 19, 2023Updated 3 years ago
Alternatives and similar repositories for rttm-viewer
Users that are interested in rttm-viewer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 5 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆46Feb 6, 2025Updated last year
- ☆95Apr 24, 2025Updated last year
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Sep 30, 2022Updated 3 years ago
- ☆32Mar 11, 2022Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- Clustering-based methods for overlapping diarization☆83Jan 12, 2024Updated 2 years ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Jun 12, 2023Updated 2 years ago
- Python package for combining diarization system outputs.☆94Oct 12, 2023Updated 2 years ago
- ☆15Jul 11, 2022Updated 3 years ago
- Spot the conversation: speaker diarisation in the wild☆164Jul 26, 2022Updated 3 years ago
- ☆37Mar 30, 2021Updated 5 years ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆40Oct 27, 2025Updated 6 months ago
- ☆46Jan 22, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Visualization tools for audio-only and multi-modal speaker diarization dataset☆13Oct 27, 2023Updated 2 years ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆62Sep 19, 2024Updated last year
- Simple Python package for fast DER computation☆35Jun 29, 2023Updated 2 years ago
- This repository creates speaker diarization recipes to be used within the egs folder of kaldi.☆17Aug 12, 2024Updated last year
- A python package to build AI-powered real-time audio applications☆1,974Feb 12, 2025Updated last year
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates☆12Mar 13, 2024Updated 2 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- ☆41Oct 14, 2022Updated 3 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆155May 2, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Variational Bayes HMM over x-vectors diarization☆287Jan 15, 2024Updated 2 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆66Jul 14, 2020Updated 5 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"☆125Apr 8, 2022Updated 4 years ago
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 6 months ago
- ☆54Oct 17, 2023Updated 2 years ago
- A JAX library for building lattice-based speech transducer models☆49Mar 2, 2026Updated 2 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆94Oct 18, 2023Updated 2 years ago
- PyTorch implementation of PLDA as described in https://ravisoji.com/assets/papers/ioffe2006probabilistic.pdf☆15Oct 16, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- neural network based speaker embedder☆25Jan 7, 2023Updated 3 years ago
- An implementation of frequency-invariant beamformer☆14Sep 3, 2021Updated 4 years ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆178Updated this week
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆138Jun 10, 2022Updated 3 years ago
- Write and keep snippets for VSCode in a markdown file.☆15Jul 23, 2023Updated 2 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆22Jun 7, 2025Updated 11 months ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆248Updated this week