☆557May 20, 2026Updated this week
Alternatives and similar repositories for nemotron-january-2026
Users that are interested in nemotron-january-2026 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 8 months ago
- ☆37Mar 31, 2026Updated last month
- A low-level Pipecat debugger.☆105May 19, 2026Updated last week
- A highly compressive and high-quality neural audio codec for speech models.☆266Jan 23, 2026Updated 4 months ago
- Soprano: Instant, Ultra-Realistic Text-to-Speech☆1,231Jan 15, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆1,395Jan 29, 2026Updated 3 months ago
- A TTS that fits in your CPU (and pocket)☆4,487Updated this week
- ☆13Oct 14, 2024Updated last year
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- ☆313Jan 2, 2026Updated 4 months ago
- LongCat Audio Tokenizer and Detokenizer☆301May 9, 2026Updated 2 weeks ago
- PersonaPlex code.☆9,891Mar 2, 2026Updated 2 months ago
- A high quality and fast TTS repository☆511Dec 22, 2025Updated 5 months ago
- SillyInnkeeper is an open-source local character card manager for SillyTavern. It scans PNG cards, extracts metadata, generates previews,…☆51Jan 13, 2026Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Attempt at cog wrapper for a SDXL CLIP Interrogator☆10May 16, 2024Updated 2 years ago
- ☆18Apr 30, 2025Updated last year
- Safely push a Cog model version by making sure it works and is backwards-compatible with previous versions.☆16Dec 4, 2025Updated 5 months ago
- The official repository TimeAudio, a comprehensive framework that incorporates fine-grained acoustic cues into LALMs with enhanced module…☆28Nov 18, 2025Updated 6 months ago
- AI narrator☆15Nov 24, 2023Updated 2 years ago
- Running LLMs against a sandbox airport to see if they can make the correct decisions in real time☆26Jul 22, 2025Updated 10 months ago
- ☆357Aug 28, 2025Updated 8 months ago
- ☆12Dec 11, 2024Updated last year
- This repository is dedicated to maintaining, updating, fixing bugs and keeping up to date my inpainting ComfyUI workflow, previously host…☆71Nov 4, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 📸 https://HeadShots.fun is an open-source SaaS platform that uses Replicate AI models and Stripe for payment processing.☆126May 10, 2025Updated last year
- [ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning☆61Apr 7, 2026Updated last month
- Attempt at cog wrapper for nightmareai/real-esrgan for larger images☆16Sep 28, 2023Updated 2 years ago
- Cog wrapper for Controlnet QR Code Monster v2 For Realistic Vision v5.1☆18Sep 24, 2023Updated 2 years ago
- AndroidSubSystem4GNU/Linux☆44Dec 30, 2025Updated 4 months ago
- SDK for Daily's Video Component System (VCS)☆31May 18, 2026Updated last week
- ☆29Nov 4, 2025Updated 6 months ago
- ☆21May 7, 2026Updated 2 weeks ago
- Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.☆946Feb 27, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆58Updated this week
- ☆24Feb 1, 2025Updated last year
- ☆45Aug 17, 2024Updated last year
- [KDD 2026] Voxlect: A Speech Foundation Model Benchmark for Modeling Dialects and Regional Languages Around the Globe☆32Aug 10, 2025Updated 9 months ago
- A real-time software for turn-taking, backchannel, and head-nodding prediction☆97May 20, 2026Updated last week
- An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.☆243Feb 26, 2026Updated 3 months ago
- ☆13May 7, 2026Updated 2 weeks ago