Emotions recognition from audio and text files (only russian language)
☆81Jun 23, 2025Updated 11 months ago
Alternatives and similar repositories for Aniemore
Users that are interested in Aniemore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆146May 21, 2025Updated last year
- whisper.cpp HTTP transcription server with OpenAI-like API in Docker☆32Apr 5, 2026Updated 2 months ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- Normalize Text in Russian☆29Nov 7, 2023Updated 2 years ago
- Neural model for prediction of stress position in Russian words☆13Jun 22, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Russian speech technology links☆398Mar 17, 2026Updated 2 months ago
- ☆13Aug 7, 2021Updated 4 years ago
- T5-based (russian) text normalization☆27Jan 25, 2024Updated 2 years ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆48Nov 8, 2023Updated 2 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.☆37May 8, 2026Updated last month
- ☆13May 6, 2026Updated last month
- ☆14Nov 22, 2022Updated 3 years ago
- First Place Solution☆12Dec 19, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Foundational Model for Speech Recognition Tasks☆600Apr 15, 2026Updated last month
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆61Jun 7, 2024Updated 2 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated last year
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆45Jul 24, 2023Updated 2 years ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆31Sep 20, 2025Updated 8 months ago
- ASR & TTS joint training, asr, tts, machine speech chain☆16Oct 16, 2021Updated 4 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Jun 2, 2023Updated 3 years ago
- ☆29Apr 17, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- A Weakly Supervised Forced Alignment for disluent speech☆15Nov 12, 2023Updated 2 years ago
- A Genius workflow for Alfred 3☆11Nov 17, 2017Updated 8 years ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆67Aug 24, 2025Updated 9 months ago
- ☆45Jun 11, 2025Updated 11 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆23Jun 7, 2025Updated last year
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆39Dec 18, 2023Updated 2 years ago
- Official release of StyleTalk dataset.☆74Jul 1, 2024Updated last year
- A stack based screen manager for Unity's UI.☆73Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Python клиент API распознавания и синтеза речи Облака ЦРТ☆11Dec 26, 2022Updated 3 years ago
- This repo contains code with comparison of Pandas speedup libs, such as modin, dask, swifter, pandarallel and numba☆13May 24, 2020Updated 6 years ago
- Web UI for seamless interaction with various Computer Vision tasks, featuring highly configurable visual elements.☆13Mar 3, 2025Updated last year
- ☆10Aug 15, 2023Updated 2 years ago
- A python library to generate speech dataset from Youtube videos☆37Jun 7, 2024Updated 2 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- Test Framework for few-shot open set KWS☆43Nov 8, 2024Updated last year