aiola-lab / whisper-nerLinks
Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"
☆191Updated 4 months ago
Alternatives and similar repositories for whisper-ner
Users that are interested in whisper-ner are comparing it to the libraries listed below
Sorting:
- ☆205Updated last year
- Joint speech-language model - respond directly to audio!☆370Updated 11 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆95Updated last year
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆233Updated 10 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 6 months ago
- Mistral7B playing DOOM☆132Updated 11 months ago
- Action library for AI Agent☆216Updated 2 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆292Updated 7 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆119Updated last year
- Open Audio Watermarking Tool☆209Updated last month
- Speaker Diarization with Transformers☆68Updated 2 weeks ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆174Updated 2 months ago
- Collection of Open Source Speech Data☆159Updated 7 months ago
- An implementation of the Nvidia's Parakeet models for Apple Silicon using MLX.☆296Updated last week
- LLaVA server (llama.cpp).☆180Updated last year
- Whisper with Medusa heads☆843Updated 3 weeks ago
- Applying the ideas of Deepseek R1 to computer use☆214Updated 4 months ago
- ☆754Updated 2 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆149Updated 5 months ago
- ☆158Updated 2 years ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆852Updated 3 months ago
- ☆238Updated 2 months ago
- Run GGML models with Kubernetes.☆173Updated last year
- Scripts to create your own moe models using mlx☆90Updated last year
- Open source inference code for Rev's model☆407Updated 2 months ago
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆221Updated 6 months ago
- ☆296Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆187Updated 10 months ago
- Joint speech-language model - respond directly to audio!☆30Updated last year
- ☆365Updated 9 months ago