aiola-lab / whisper-nerLinks
Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"
☆189Updated 3 months ago
Alternatives and similar repositories for whisper-ner
Users that are interested in whisper-ner are comparing it to the libraries listed below
Sorting:
- Whisper with Medusa heads☆838Updated last week
- Joint speech-language model - respond directly to audio!☆369Updated 11 months ago
- Action library for AI Agent☆214Updated 2 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆170Updated last month
- Mistral7B playing DOOM☆132Updated 10 months ago
- ☆204Updated last year
- Collection of Open Source Speech Data☆157Updated 6 months ago
- ☆740Updated last month
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆52Updated 10 months ago
- Build Secure and Compliant AI agents and MCP Servers. YC W23☆140Updated this week
- whisper.cpp bindings for python☆96Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 5 months ago
- Scripts to create your own moe models using mlx☆89Updated last year
- ☆157Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆275Updated last week
- Lightweight Nearest Neighbors with Flexible Backends☆283Updated last week
- run paligemma in real time☆131Updated last year
- LLaVA server (llama.cpp).☆179Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆184Updated 9 months ago
- Video+code lecture on building nanoGPT from scratch☆67Updated 11 months ago
- Joint speech-language model - respond directly to audio!☆30Updated last year
- Pivotal Token Search☆97Updated 3 weeks ago
- Efficient vector database for hundred millions of embeddings.☆206Updated last year
- ☆89Updated 8 months ago
- Examples for Cerebrium Serverless GPUs☆487Updated 2 weeks ago
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆233Updated 9 months ago
- Dead Simple LLM Abliteration☆218Updated 3 months ago
- ☆257Updated last year
- GRDN.AI app for garden optimization☆70Updated last year