NhutP / VietSpeechView external linksLinks
☆11Apr 25, 2025Updated 9 months ago
Alternatives and similar repositories for VietSpeech
Users that are interested in VietSpeech are comparing it to the libraries listed below
Sorting:
- ☆13Apr 27, 2025Updated 9 months ago
- Software Engineering Back End Microservices Project☆15Nov 20, 2024Updated last year
- ☆13Jul 23, 2025Updated 6 months ago
- End to End Speech to Speech with Emotion System☆15Feb 6, 2025Updated last year
- GStar Bootcamp - Assignment 1☆16Sep 8, 2025Updated 5 months ago
- ☆11Jan 1, 2024Updated 2 years ago
- EraX-VL-7B-V1 is the multimodal large language model developed by EraX team, base on Qwen2-VL.☆12Dec 31, 2024Updated last year
- A Python library for text normalization, specifically designed for Vietnamese and English text processing. This library provides comprehe…☆13Mar 30, 2025Updated 10 months ago
- Integrate DuckDuckGo search seamlessly into your n8n workflows. Enhance automation with advanced searches and tailored queries. Try it no…☆14Mar 31, 2024Updated last year
- Implementation of F5-TTS in MLX☆15Dec 13, 2024Updated last year
- jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2☆19Aug 15, 2025Updated 6 months ago
- ☆12Nov 1, 2023Updated 2 years ago
- Kokoro Language Model Training Script for Russian (Ruslan Corpus)☆34Updated this week
- VietConizer: Vietnamese OCR with NVIDIA DALI☆16Jul 5, 2025Updated 7 months ago
- A blog on various ML topics, and my thoughts.☆25Jan 14, 2026Updated last month
- ☆43Sep 3, 2025Updated 5 months ago
- Research on training an LLM with DeepSeek & Kimi architecture☆36Sep 30, 2025Updated 4 months ago
- This project combines the power of Retrieval-Augmented Generation (RAG) with AssemblyAI's transcription capabilities, enabling you to int…☆22Sep 30, 2025Updated 4 months ago
- ☆27Jun 12, 2025Updated 8 months ago
- ☆24May 19, 2024Updated last year
- ☆34Jul 8, 2025Updated 7 months ago
- Image to Latex using Encoder-Decoder architecture☆18May 21, 2025Updated 8 months ago
- An Enhanced Version of Piper especially for Vietnamese :)☆26Apr 24, 2025Updated 9 months ago
- Awesome-LLM-Productization: a curated list of tools/tricks/news/regulations about AI and Large Language Model (LLM) productization☆23Jan 23, 2025Updated last year
- ☆136Apr 23, 2025Updated 9 months ago
- finetune llm part for spark-tts model☆120Mar 25, 2025Updated 10 months ago
- Convert LLaMA3.1-8B to DeepSeek R1 MLA & MoE (raw)☆24Mar 10, 2025Updated 11 months ago
- This project involves automating the attendance system of RT Knits using Face Recognition. Due to Covid-19, people are obliged to wear ma…☆22Dec 20, 2021Updated 4 years ago
- a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.☆37Apr 7, 2025Updated 10 months ago
- A repo of a modified version of Diffusion Transformer☆46Sep 14, 2025Updated 5 months ago
- Scalable, cloud-native recommender system with end-to-end MLOps for building, training, and deploying models in research and production☆53Aug 5, 2025Updated 6 months ago
- A database for modern, open-source TTS systems.☆31Feb 4, 2026Updated last week
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis☆50Sep 20, 2025Updated 4 months ago
- This project demonstrates a production-grade MLOps pipeline that deploys a YOLOv11-based face detection service on Google Kubernetes Engi…☆38Jun 9, 2025Updated 8 months ago
- OpenFLAM: Framewise Language Audio Model☆88Jan 14, 2026Updated last month
- Deep Learning Audio Course – AI Masters☆36Updated this week
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24May 1, 2024Updated last year
- ☆56Jan 18, 2025Updated last year
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆52May 22, 2025Updated 8 months ago