cpfriend1721994 / docker-es-cococ-tokenizer
Dockerfile/docker-compose Elasticsearch with plugins elasticsearch-analysis-vietnamese and coccoc-tokenizer
☆9Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for docker-es-cococ-tokenizer
- Tiền xử lý dữ liệu tiếng Việt với 4 bước☆11Updated 3 years ago
- Solution for Zalo AI Challenge 2022 - E2E Question Answering☆109Updated last year
- Pre-trained Word2Vec models for Vietnamese☆152Updated 3 years ago
- Zalo AI chalenge Voice Gender classification (https://challenge.zalo.ai/)☆129Updated 6 years ago
- Corpus tiếng việt☆346Updated 5 months ago
- Source code for Zalo AI 2021 submission☆136Updated 2 years ago
- Dịch máy giữa ngôn ngữ anh-viet☆49Updated 4 years ago
- An extension for bypassing Medium paywall.☆112Updated last year
- Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for t…☆346Updated 2 years ago
- MTet: Multi-domain Translation for English and Vietnamese☆176Updated last year
- Thư viện xữ lý chữ số dành riêng cho Tiếng Việt.☆75Updated 2 months ago
- Python Vietnamese Core NLP Toolkit☆245Updated last month
- Mô hình ngôn ngữ lớn cho người Việt☆60Updated last year
- top 1 Zalo AI challenge 2021 task hum to song☆108Updated 2 years ago
- vietnamese OCR☆134Updated 5 years ago
- Pre-trained Word2Vec syllable- and word-level embeddings for Vietnamese☆50Updated last year
- Vietnamese language model for spacy.io☆103Updated last year
- ☆25Updated 2 months ago
- Sentiment classification for Vietnamese text using PhoBert☆96Updated 3 years ago
- My own Docker tutorial for self-learning☆94Updated 2 years ago
- PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)☆39Updated 3 months ago
- ☆24Updated last year
- ☆46Updated last year
- BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)☆99Updated 3 months ago
- Solution for MC_OCR competition☆91Updated last year
- Creating a chatbot from your facebook data with GPT☆23Updated 2 years ago
- COVID-19 Named Entity Recognition for Vietnamese (NAACL 2021)☆65Updated 3 months ago
- Cải thiện Elasticsearch trong bài toán semantic search sử dụng phương pháp Sentence Embeddings☆24Updated 3 years ago
- PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)☆661Updated 3 months ago