MahtaFetrat / ManaTTS-Persian-Speech-Dataset
ManaTTS is the largest open Persian speech dataset with 100+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.
☆25Updated 2 months ago
Alternatives and similar repositories for ManaTTS-Persian-Speech-Dataset:
Users that are interested in ManaTTS-Persian-Speech-Dataset are comparing it to the libraries listed below
- A collection of inspiring lists, repos, datasets, models, tools and more for Persian language speech to text(stt) and text to speech(tts)…☆63Updated 4 months ago
- Persian text-to-speech streamlit interface☆37Updated 4 months ago
- Open source crawler for Persian websites.☆19Updated last year
- ☆27Updated 2 years ago
- A tool for translating Persian text to IPA (International Phonetic Alphabet).☆64Updated 2 years ago
- ☆44Updated last year
- Persian ASR dataset☆39Updated last year
- Persian/Farsi text to speech(TTS) training using coqui tts☆144Updated 2 months ago
- Takes a number and converts it to Persian word form☆42Updated 6 years ago
- Bert-Based persian spell-checker☆15Updated last year
- Tacotron 2 - Persian☆33Updated 3 years ago
- Persian Grapheme-to-Phoneme (G2P) converter☆40Updated 9 months ago
- An accurate scrapper to scrape popular persian websites, mostly intended to be used as a tool to create large corpora for Persian languag…☆34Updated 3 months ago
- ☆26Updated 10 months ago
- Persian Bert For Long-Range Sequences☆63Updated 3 years ago
- A Grapheme to Phoneme model using LSTM implemented in pytorch☆12Updated 2 years ago
- fine-tune Wav2vec2. an ASR model released by Facebook☆37Updated 3 years ago
- Tihu dictionary for Persian language☆12Updated 5 years ago
- ParsBench provides toolkits for benchmarking LLMs based on the Persian language tasks.☆66Updated 8 months ago
- ☆134Updated 6 years ago
- A Deep-Learning-Based Persian Speech Recognition System☆220Updated last year
- Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G…☆8Updated 2 months ago
- Code, datasets, and models designed to generate product catalogs using LLMs.☆33Updated 7 months ago
- A benchmark for evaluation and comparison of various NLP tasks in Persian language.☆75Updated 3 years ago
- ☆20Updated 2 years ago
- Persian OCR dateset☆75Updated 2 years ago
- A collection of Persian stopwords - فهرست کلمات ایست فارسی☆59Updated 3 years ago
- 🌟 Cache-cool: A fast, flexible LLM caching proxy that reduces latency and API costs by caching repetitive calls to LLM services. 🔄 Su…☆25Updated 7 months ago
- Iranian/Persian Datasets. دیتاستهای فارسی و ایرانی☆114Updated this week
- PCoQA: Persian Conversational Question Answering Dataset☆20Updated 8 months ago