LazarusNLP / IndoT5Links
T5 Language Models for the Indonesian Language!
☆12Updated last year
Alternatives and similar repositories for IndoT5
Users that are interested in IndoT5 are comparing it to the libraries listed below
Sorting:
- Multilingual Speech Recognition for Indonesian Languages☆66Updated 2 years ago
- NLP Datasets for Indonesian☆121Updated 2 years ago
- Experiment with OpenAI Whisper on Indonesian Languages☆13Updated 2 years ago
- Indonesian TTS (text-to-speech) using Coqui TTS☆80Updated 3 years ago
- Kurikulum Tentang Artificial Intelligence☆43Updated 5 years ago
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆37Updated 2 months ago
- ☆35Updated last year
- Finetune VITS and MMS using HuggingFace's tools☆164Updated last year
- Translate large dataset to any language with google translation api and multithreads processing, no key required!☆72Updated 11 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆98Updated last month
- The official implementation of CATT Arabic diacritization models.☆55Updated 2 months ago
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆112Updated 11 months ago
- Welcome to our repository! This repository hosts the data on "IndoCollex: A Testbed for Morphological Transformation of Indonesian Word …☆22Updated 4 years ago
- ☆56Updated 2 months ago
- Indonesian Language Models and its Usage☆161Updated 2 years ago
- Automatic Speech Recognition for Indonesian☆18Updated 4 years ago
- Repository ini berisikan kumpulan data mentah berupa artikel dari berbagai media online di Indonesia. (Raw dataset of Indonesian news art…☆42Updated 6 years ago
- ☆58Updated this week
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs☆22Updated last year
- Arabic cleaning, normalization and segmentation library.☆71Updated last year
- Indonesian-AI Final Project aimed at providing in-depth insights into the 2024 election through social network analysis and sentiment ass…☆16Updated last year
- The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.☆178Updated 3 months ago
- Sarjana is an open source desktop application which is used to assist in reading information materials, be it research papers or technica…☆24Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆338Updated 2 years ago
- ☆158Updated 2 years ago
- High-quality parallel resource on sentiment analysis for 10 low-resource Indonesian languages, English, and Indonesian (Outstanding Paper…☆104Updated 2 years ago
- A benchmark dataset for Indonesian text summarization.☆76Updated 6 years ago
- ☆42Updated 2 years ago
- مستودع الأوراق المسحية في معالجة اللغة العربية (أسبر) A Repository for survey and review papers in Arabic Natural Language processing (AN…☆81Updated last month
- ☆19Updated 2 months ago