☆32Mar 3, 2026Updated 3 weeks ago
Alternatives and similar repositories for modal-nvidia-asr
Users that are interested in modal-nvidia-asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 6 months ago
- Bitcoin utilities and protocol library for interacting with the network☆15Oct 27, 2025Updated 5 months ago
- ICASSP2026 HumDial Challenge☆37Dec 13, 2025Updated 3 months ago
- ☆18May 27, 2025Updated 10 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆51Mar 20, 2026Updated last week
- ☆11Jan 20, 2025Updated last year
- ☆10Nov 14, 2025Updated 4 months ago
- ☆34Feb 26, 2026Updated last month
- ☆12Nov 12, 2024Updated last year
- [ICASSP'24] Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification☆16Mar 20, 2024Updated 2 years ago
- ☆15Apr 11, 2025Updated 11 months ago
- [ASRU 2025] Omni-R1: Do You Really Need Audio to Fine-Tune Your Audio LLM?☆44Nov 21, 2025Updated 4 months ago
- ☆11May 18, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆22Dec 3, 2025Updated 3 months ago
- Bundle Pino logger runtime dependencies with Bun☆12Feb 27, 2025Updated last year
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆27Feb 21, 2026Updated last month
- 🏆 The 1st Place Solution for AICity2022 Challenge Track2: Natural Language-Based Vehicle Retrieval.☆12Jul 25, 2022Updated 3 years ago
- Production-tested commands, skills, and workflow patterns for Claude Code. Developed through 6+ months of daily use. Includes explore→pla…☆49Jan 14, 2026Updated 2 months ago
- Generate music videos starring yourself.☆11Apr 3, 2025Updated 11 months ago
- Hybrid-Anchor Rotation Detector for Oriented Object Detection (ICCV'25-SEA)☆16Aug 11, 2025Updated 7 months ago
- An elegant way to send message between Swift and WKWebView☆11Jun 26, 2025Updated 9 months ago
- A multi-agent AI chat running on Convex☆18Apr 4, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Transfer learning approach to pronunciation scoring☆12Jan 17, 2024Updated 2 years ago
- Introduction about Open AI's latest release of real-time voice streaming. It demonstrates how to implement human-computer conversation us…☆25Oct 14, 2024Updated last year
- ☆35Updated this week
- ☆14Jan 5, 2025Updated last year
- official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification☆14Mar 14, 2025Updated last year
- ☆16Oct 26, 2025Updated 5 months ago
- A Medical / Clinical Note Taking Demo Application using Deepgram Voice Agent API☆14Jul 9, 2025Updated 8 months ago
- An AWS S3 file manager. It supports keyword search, upload, preview video and archive files into a zip then download it.☆11Mar 20, 2023Updated 3 years ago
- Calibration and Depth Map Generation for Active and Passive Stereo Vision Systems.☆17Jan 30, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆19Jun 9, 2023Updated 2 years ago
- 내 손 안의 작은 수어 통역가, HandTalker 👋🏻☆10Nov 15, 2023Updated 2 years ago
- A simplified carrier board for RPI CM4 module featuring reduced footprint.☆11Aug 3, 2021Updated 4 years ago
- This project showcases how to use fal's queue management system and proxy setup to create animated videos from static images.☆18Dec 9, 2025Updated 3 months ago
- MichiAI: A Low Latency, Full Duplex Speech LLM with zero coherence loss☆87Feb 6, 2026Updated last month
- [AAAI 2025] Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding☆34Mar 21, 2025Updated last year
- This project is a proof-of-concept, trying to show surveillance of roads for the safety of motorcycle and bicycle riders can be done with…☆30Feb 7, 2021Updated 5 years ago