aiola-lab / whisper-ner
Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"
☆189Updated 2 months ago
Alternatives and similar repositories for whisper-ner
Users that are interested in whisper-ner are comparing it to the libraries listed below
Sorting:
- Joint speech-language model - respond directly to audio!☆369Updated 10 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- Action library for AI Agent☆214Updated last month
- 🐮📢 The first AI voice assistant that interrupts *you*☆145Updated 8 months ago
- Joint speech-language model - respond directly to audio!☆30Updated last year
- Generate ideal question-answers for testing RAG☆126Updated 2 months ago
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆275Updated 2 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆55Updated last month
- Whisper with Medusa heads☆833Updated 2 weeks ago
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…☆222Updated 4 months ago
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆53Updated 9 months ago
- Applying the ideas of Deepseek R1 to computer use☆213Updated 3 months ago
- ☆721Updated 3 weeks ago
- Fully neural approach for text chunking☆347Updated 2 weeks ago
- Collection of Open Source Speech Data☆157Updated 6 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 5 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆210Updated 6 months ago
- Speaker Diarization with Transformers☆64Updated 11 months ago
- ☆204Updated 11 months ago
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆233Updated 8 months ago
- Detect whether or not an audio file was generated by NotebookLM☆137Updated 5 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆166Updated 3 weeks ago
- Mistral7B playing DOOM☆131Updated 10 months ago
- Scripts to create your own moe models using mlx☆89Updated last year
- ☆101Updated 8 months ago
- ai for jq☆240Updated 7 months ago
- ☆156Updated last year
- Lightweight Nearest Neighbors with Flexible Backends☆273Updated 2 months ago
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆221Updated 5 months ago
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆274Updated last month