cpfriend1721994 / docker-es-cococ-tokenizer
Dockerfile/docker-compose Elasticsearch with plugins elasticsearch-analysis-vietnamese and coccoc-tokenizer
☆9Updated 3 years ago
Alternatives and similar repositories for docker-es-cococ-tokenizer
Users that are interested in docker-es-cococ-tokenizer are comparing it to the libraries listed below
Sorting:
- Solution for Zalo AI Challenge 2022 - E2E Question Answering☆111Updated 2 years ago
- A Robustly Optimized BERT Pretraining Approach for Vietnamese☆32Updated 9 months ago
- SimeCSE_Vietnamese: Simple Contrastive Learning of Sentence Embeddings with Vietnamese☆20Updated 3 years ago
- Source code for Zalo AI 2021 submission☆140Updated 3 years ago
- Thư viện chuẩn hóa văn bản Tiếng Việt☆177Updated 2 weeks ago
- MTet: Multi-domain Translation for English and Vietnamese☆183Updated 2 years ago
- ☆25Updated 8 months ago
- Dịch máy giữa ngôn ngữ anh-viet☆51Updated 4 years ago
- A Vietnamese-English Neural Machine Translation System (INTERSPEECH 2022)☆128Updated 9 months ago
- ☆13Updated 2 years ago
- A project improves the quality and accuracy of the Vietnamese language.☆42Updated 6 months ago
- Sentiment classification for Vietnamese text using PhoBert☆98Updated 4 years ago
- Pre-trained Word2Vec syllable- and word-level embeddings for Vietnamese☆52Updated last year
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆26Updated 2 years ago
- Solution for MC_OCR competition☆95Updated 2 years ago
- Thư viện xữ lý chữ số dành riêng cho Tiếng Việt.☆75Updated 3 months ago
- ☆46Updated last year
- Vietnamese self-supervised Wav2vec2 model☆62Updated 2 years ago
- Corpus tiếng việt☆358Updated 11 months ago
- ☆30Updated 2 years ago
- ☆69Updated 2 years ago
- Cải thiện Elasticsearch trong bài toán semantic search sử dụng phương pháp Sentence Embeddings☆25Updated 3 years ago
- top 1 Zalo AI challenge 2021 task hum to song☆109Updated 3 years ago
- Bud500: A Comprehensive Vietnamese ASR Dataset