Emotions recognition from audio and text files (only russian language)
☆82Jun 23, 2025Updated last year
Alternatives and similar repositories for Aniemore
Users that are interested in Aniemore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆146May 21, 2025Updated last year
- whisper.cpp HTTP transcription server with OpenAI-like API in Docker☆32Apr 5, 2026Updated 2 months ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- System for automatic pronominal resolution for Russian☆13Apr 3, 2020Updated 6 years ago
- Fast Russian Text normalization for TTS using only RegEx.☆32Jun 17, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Russian phonetical transcription☆11May 20, 2026Updated last month
- Neural model for prediction of stress position in Russian words☆13Jun 22, 2025Updated last year
- Russian speech technology links☆400Mar 17, 2026Updated 3 months ago
- ☆13Aug 7, 2021Updated 4 years ago
- T5-based (russian) text normalization☆27Jan 25, 2024Updated 2 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated 2 years ago
- Text to Speech with PyTorch (English and Mongolian)☆13May 3, 2020Updated 6 years ago
- ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.☆37May 8, 2026Updated last month
- ☆14Nov 22, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Foundational Model for Speech Recognition Tasks☆627Jun 18, 2026Updated last week
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆61Jun 7, 2024Updated 2 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated last year
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆45Jul 24, 2023Updated 2 years ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- ☆14Mar 26, 2024Updated 2 years ago
- This repository contains a fine-tuning script for the transcription task of Mistral's Voxtral model.☆27Jul 31, 2025Updated 10 months ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆31Sep 20, 2025Updated 9 months ago
- ASR & TTS joint training, asr, tts, machine speech chain☆16Oct 16, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Jun 2, 2023Updated 3 years ago
- ☆28Apr 17, 2023Updated 3 years ago
- ☆17Nov 15, 2018Updated 7 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- A Weakly Supervised Forced Alignment for disluent speech☆15Nov 12, 2023Updated 2 years ago
- A Genius workflow for Alfred 3☆11Nov 17, 2017Updated 8 years ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆67Aug 24, 2025Updated 10 months ago
- ☆46Jun 11, 2025Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆23Jun 7, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official release of StyleTalk dataset.☆75Jul 1, 2024Updated last year
- A stack based screen manager for Unity's UI.☆74Jun 11, 2026Updated 2 weeks ago
- Python клиент API распознавания и синтеза речи Облака ЦРТ☆11Dec 26, 2022Updated 3 years ago
- Web UI for seamless interaction with various Computer Vision tasks, featuring highly configurable visual elements.☆13Mar 3, 2025Updated last year
- ☆10Aug 15, 2023Updated 2 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- Test Framework for few-shot open set KWS☆43Nov 8, 2024Updated last year