huangjimmy / pg_cjk_parser
Postgres CJK Parser pg_cjk_parser is a fts (full text search) parser derived from the default parser in PostgreSQL 11. When a postgres database uses utf-8 encoding, this parser supports all the features of the default parser while splitting CJK (Chinese, Japanese, Korean) characters into 2-gram tokens. If the database's encoding is not utf-8, th…
☆35Updated last month
Related projects ⓘ
Alternatives and complementary repositories for pg_cjk_parser
- An offline command-line interface to zdic.net dictionary (漢典)☆48Updated 4 years ago
- ☆12Updated 2 years ago
- mecab 형태소 분석기와 mecab-ko-dic 사전을 이용한 PostgreSQL용 한국어 전문검색 full text search 확장모듈☆63Updated 10 months ago
- CJK computer science terms comparison / 中日韓電腦科學術語對照 / 日中韓のコンピュータ科学の用語対照 / 한·중·일 전산학 용어 대조☆128Updated 2 months ago
- PGroonga is a PostgreSQL extension to use Groonga as index. PGroonga makes PostgreSQL fast full text search platform for all languages!☆554Updated this week
- Multi-platform Mastodon client, inspired by Azurea (https://azurea.info)☆9Updated 4 years ago
- Statistics of Korean wiki sites☆21Updated 6 years ago
- A lightweight implementation of the Unicode Text Segmentation (UAX #29)☆57Updated last week
- font subset with google font's ML result☆40Updated last year
- Simple ⏱Time-based 🔑OTP Client☆13Updated 4 years ago
- Create beautiful interactive web application with Recative System.☆57Updated last year
- Small static site generator toolkit☆21Updated last year
- 📸 → 🖼️ with EXIF metadata☆42Updated this week
- Adobe Source Han font family converted to TrueType format☆39Updated 3 years ago
- A package for removing tracing parameters from URLs. This package supports automatically updating filtering rules from Adguard.☆14Updated 2 years ago
- Extension:AWS allows MediaWiki to use Amazon S3 (instead of the local directory) to store images.☆44Updated last week
- ☆10Updated 9 months ago
- ☆22Updated 10 months ago
- A optimizer for Chinese and Japanese typography on websites☆21Updated 7 months ago
- ✂️ Unicode-aware JS string splitting☆10Updated 9 months ago
- The pg_bigm module provides full text search capability in PostgreSQL. This module allows a user to create 2-gram (bigram) index for fast…☆71Updated 2 weeks ago
- pretty JSON schema definition language☆24Updated 8 years ago
- ISO 3166-1 country codes☆20Updated 8 years ago
- 寫嘢 - A chinese friendly zola theme.☆20Updated this week
- WIP: Docker Registry API on Cloudflare Workers☆29Updated last year
- hwp toolkit☆37Updated 3 years ago
- South Korea population data.☆11Updated 11 years ago
- Implementation of the HTCPCP Protocol☆14Updated 11 years ago
- (in alpha) 网页竖排。右起縱書。像古代一样。Make webs vertical lined layout, like traditional CJK writing method in east asian culture circle.(招日韩蒙越翻译)☆70Updated 2 years ago