Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"
☆200Feb 25, 2025Updated last year
Alternatives and similar repositories for whisper-ner
Users that are interested in whisper-ner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Whisper with Medusa heads☆862Aug 6, 2025Updated 8 months ago
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Oct 10, 2023Updated 2 years ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- ☆19Jun 3, 2024Updated last year
- Linux & Powershell scripts to easily set up and run the Qwen 3.5 series locally on Windows and Linux with llama.cpp.☆50Mar 30, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Mar 8, 2026Updated last month
- My first ever training of a piper tts voice☆16May 23, 2025Updated 10 months ago
- A fork of Lyra (version 1) that supports a webassembly build. See https://github.com/mayitayew/soundstream-wasm for a more recent version…☆25Jul 19, 2022Updated 3 years ago
- Drax: Speech Recognition with Discrete Flow Matching☆75Oct 15, 2025Updated 5 months ago
- Task-based Agentic Framework using StrictJSON as the core☆462Feb 15, 2026Updated last month
- Text Normalization utilities for normalizing text for TTS☆21Mar 4, 2026Updated last month
- Fully neural approach for text chunking☆406Oct 23, 2025Updated 5 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- A graph data model of the human skeleton☆32Oct 6, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.☆35Jan 14, 2026Updated 2 months ago
- Things you can do with the token embeddings of an LLM☆1,454Dec 1, 2025Updated 4 months ago
- [Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation☆13Nov 28, 2024Updated last year
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆242Mar 25, 2026Updated 2 weeks ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆4,064Jan 8, 2025Updated last year
- MeetEval - A meeting transcription evaluation toolkit☆151Jan 27, 2026Updated 2 months ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Sep 27, 2023Updated 2 years ago
- first base model for full-duplex conversational audio☆1,784Jan 5, 2025Updated last year
- ☆37May 20, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- LangChain + LiteLLM that works☆48Sep 1, 2025Updated 7 months ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆929Oct 28, 2024Updated last year
- ☆87Jul 31, 2025Updated 8 months ago
- ☆43Nov 10, 2025Updated 4 months ago
- The creative suite for character-driven AI experiences.☆191Sep 6, 2024Updated last year
- open source audio and video transcription software☆486Feb 18, 2026Updated last month
- A comprehensive suite of tools, built to liberate science by making the creation, evaluation, and dissemination of research more transpar…☆245Aug 8, 2025Updated 8 months ago
- An Open Source text-to-speech system built by inverting Whisper.☆4,583Dec 14, 2025Updated 3 months ago
- Local realtime voice AI☆2,477Nov 26, 2025Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Python module that creates a context map for AI code generation☆29Aug 14, 2024Updated last year
- Fast and accurate automatic speech recognition (ASR) for edge devices☆7,625Updated this week
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆42Sep 11, 2023Updated 2 years ago
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆3,029Mar 31, 2026Updated last week
- An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.☆685Mar 18, 2026Updated 3 weeks ago
- Code snippets and reproductions from JustAByte☆28Jan 25, 2026Updated 2 months ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆150Jan 16, 2024Updated 2 years ago