scb-10x / typhoon2-audio
The repository of Typhoon2-Audio, Thai audio-language model that supports speech-in and speech-out
☆14Updated 2 months ago
Alternatives and similar repositories for typhoon2-audio:
Users that are interested in typhoon2-audio are comparing it to the libraries listed below
- ☆10Updated 3 months ago
- ☆36Updated 11 months ago
- WangChanGLM 🐘 - The Multilingual Instruction-Following Model☆94Updated last year
- Speech Emotion Recognition using PyTorch sponsored by AIS and VISTEC-DEPA AIResearch Institute Thailand.☆19Updated 3 years ago
- WangchanX Fine-tuning Pipeline☆45Updated 6 months ago
- KhanomTan TTS (ขนมตาล) is an open-source Thai text-to-speech model that supports multilingual speakers such as Thai, English, and others.☆32Updated 2 years ago
- ☆16Updated 7 months ago
- English-Thai Machine Translation Models☆28Updated 11 months ago
- Open TTS models, built for streaming on the edge☆39Updated last month
- Pretraining transformer based Thai language models☆121Updated last year
- It is fine-tune the GPT-Neo model for Thai language.☆12Updated 3 years ago
- Finetune wav2vec2-large-xlsr-53 with Thai Common Voice Corpus 7.0☆48Updated 3 years ago
- Tesseract OCR tools for read Thai National Document used TH Sarabun National Font trained and fine-tuned. Read README.md to see about my …☆25Updated 2 years ago
- Open Source Thai Text-to-speech library in Python☆39Updated last year
- F5-TTS-THAI เครื่องมือสร้างเสียงพูดจากข้อความด้วย Zero-Shot TTS ภาษาไทย☆37Updated this week
- scripts for cleaning and creating train/validation/test splits for Thai commonvoice☆11Updated 3 years ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 5 months ago
- Dataset for fake news detection in healthcare domain☆12Updated 2 years ago
- A lightweight Python library for running TTS models with a unified API.☆17Updated 2 months ago
- ☆24Updated last year
- Fix Thai PDF☆33Updated 2 months ago
- ☆38Updated 4 years ago
- Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on mul…☆15Updated 3 months ago
- ☆88Updated 2 weeks ago
- More than 43+ collections of Thai Natural Language Processing libraries. Update daily.☆27Updated 6 years ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated last month
- Dippy Synthetic Speech Subnet☆16Updated 3 weeks ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 2 weeks ago
- Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.☆17Updated last year
- ☆62Updated 9 months ago