aiola-lab / whisper-nerLinks
Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"
☆195Updated 5 months ago
Alternatives and similar repositories for whisper-ner
Users that are interested in whisper-ner are comparing it to the libraries listed below
Sorting:
- Joint speech-language model - respond directly to audio!☆370Updated last year
- Build Secure and Compliant AI agents and MCP Servers. YC W23☆147Updated 2 months ago
- ☆205Updated last year
- Whisper with Medusa heads☆850Updated this week
- Mistral7B playing DOOM☆133Updated last year
- 🐮📢 The first AI voice assistant that interrupts *you*☆149Updated 11 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆120Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆102Updated 2 weeks ago
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆221Updated 7 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆296Updated 8 months ago
- Lightweight Nearest Neighbors with Flexible Backends☆296Updated 3 weeks ago
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆53Updated last year
- Applying the ideas of Deepseek R1 to computer use☆216Updated 6 months ago
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆279Updated last week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Live-bending a foundation model’s output at neural network level.☆266Updated 4 months ago
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆302Updated 3 months ago
- LLaVA server (llama.cpp).☆181Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 7 months ago
- Action library for AI Agent☆222Updated 4 months ago
- ☆102Updated 11 months ago
- The library for character-driven AI experiences.☆88Updated last year
- Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.☆197Updated this week
- Visualize the intermediate output of Mistral 7B☆367Updated 6 months ago
- ☆158Updated 2 years ago
- ☆116Updated 6 months ago
- ASR + diarization model server with speculative decoding☆62Updated last year
- The creative suite for character-driven AI experiences.☆186Updated 11 months ago
- ☆292Updated 4 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆101Updated 7 months ago