Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"
☆200Feb 25, 2025Updated last year
Alternatives and similar repositories for whisper-ner
Users that are interested in whisper-ner are comparing it to the libraries listed below
Sorting:
- ☆15Jan 25, 2026Updated last month
- Multi-task model for named-entity recognition, relation extraction, entity mention detection and coreference resolution.☆46Jun 26, 2024Updated last year
- Whisper with Medusa heads☆864Aug 6, 2025Updated 7 months ago
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Oct 10, 2023Updated 2 years ago
- A curated list of awesome papers on contextualizing E2E ASR outputs☆80May 10, 2023Updated 2 years ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- 🍒 Dynamically inline assets into the DOM using Fetch Injection. Mirror of Fetch Inject on Codeberg.☆13May 26, 2024Updated last year
- ☆19Jun 3, 2024Updated last year
- GP-Tree: A Gaussian Process Classifier for Few-Shot Incremental Learning☆34May 12, 2022Updated 3 years ago
- Linux & Powershell scripts to easily set up and run the Qwen 3.5 series locally on Windows and Linux with llama.cpp.☆49Mar 2, 2026Updated 2 weeks ago
- A P2P blog and P2P Chat with no signalling server. Nothin' but RTC!☆16Nov 17, 2023Updated 2 years ago
- ☆12Feb 5, 2026Updated last month
- My first ever training of a piper tts voice☆16May 23, 2025Updated 9 months ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆10Mar 8, 2026Updated last week
- A fork of Lyra (version 1) that supports a webassembly build. See https://github.com/mayitayew/soundstream-wasm for a more recent version…☆25Jul 19, 2022Updated 3 years ago
- Drax: Speech Recognition with Discrete Flow Matching☆75Oct 15, 2025Updated 5 months ago
- Task-based Agentic Framework using StrictJSON as the core☆461Feb 15, 2026Updated last month
- PANiC - PAraphrasing Noun-Compounds☆15Apr 6, 2018Updated 7 years ago
- Bulk unsubscribe from emails in your gmail account☆25Jun 15, 2024Updated last year
- Text Normalization utilities for normalizing text for TTS☆21Mar 4, 2026Updated 2 weeks ago
- Fully neural approach for text chunking☆407Oct 23, 2025Updated 4 months ago
- ☆25Dec 14, 2025Updated 3 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- A graph data model of the human skeleton☆32Oct 6, 2021Updated 4 years ago
- Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.☆35Jan 14, 2026Updated 2 months ago
- Things you can do with the token embeddings of an LLM☆1,453Dec 1, 2025Updated 3 months ago
- Detecting Facial Landmarks on 3D Models Based on Geometric Properties☆18Apr 29, 2024Updated last year
- Todo App API built using Node.JS with Express, Mongoose, Passport & Async\Await syntax. Android app: https://github.com/stavelmashally/an…☆10Jan 29, 2018Updated 8 years ago
- Wikimedia Enterprise - client SDK in Python☆20Nov 11, 2025Updated 4 months ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆4,055Jan 8, 2025Updated last year
- MeetEval - A meeting transcription evaluation toolkit☆147Jan 27, 2026Updated last month
- first base model for full-duplex conversational audio☆1,785Jan 5, 2025Updated last year
- ☆43Nov 10, 2025Updated 4 months ago
- ☆37May 20, 2022Updated 3 years ago
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆148May 18, 2025Updated 10 months ago
- LangChain + LiteLLM that works☆50Sep 1, 2025Updated 6 months ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆929Oct 28, 2024Updated last year
- ☆86Jul 31, 2025Updated 7 months ago
- Code snippets and reproductions from JustAByte☆25Jan 25, 2026Updated last month