lighttransport / jagger-python
Python binding for Jagger(C++ implementation of Pattern-based Japanese Morphological Analyzer)
β10Updated 10 months ago
Alternatives and similar repositories for jagger-python:
Users that are interested in jagger-python are comparing it to the libraries listed below
- Japanese synonym libraryβ53Updated 2 years ago
- π₯ Vaporetto is a fast and lightweight pointwise prediction based tokenizer. This is a Python wrapper for Vaporetto.β22Updated 4 months ago
- β25Updated 2 months ago
- This repository has implementations of data augmentation for NLP for Japanese.β64Updated last year
- Japanese-BPEEncoderβ41Updated 3 years ago
- Training and evaluation scripts for JGLUE, a Japanese language understanding benchmarkβ17Updated last week
- β83Updated last year
- Japanese tokenizer for Transformersβ79Updated last year
- JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset, LREC-COLING 2024β22Updated 10 months ago
- Funer is Rule based Named Entity Recognition tool.β22Updated 2 years ago
- β20Updated 4 years ago
- β15Updated 10 months ago
- β19Updated this week
- Finding all pairs of similar documents time- and memory-efficientlyβ58Updated 2 years ago
- Utility scripts for preprocessing Wikipedia texts for NLPβ75Updated 9 months ago
- An integrated Japanese analyzer based on foundation modelsβ131Updated 3 months ago
- γγΌγ·γ2γ‘γγγγγγ―γγΌγ«γγ¦δ½ζγγε―Ύθ©±γ³γΌγγΉβ95Updated 3 years ago
- β22Updated last year
- β15Updated last year
- Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)β76Updated last year
- β47Updated last year
- Exploring Japanese SimCSEβ67Updated last year
- japanese sentence segmentation library for pythonβ70Updated last year
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.β120Updated 3 months ago
- pythonηζ₯ζ¬θͺζε³ε½Ήε²δ»δΈγ·γΉγγ οΌASAοΌβ23Updated 2 years ago
- Repository for JSICKβ44Updated last year
- Mecab + NEologd + Docker + Python3β35Updated 2 years ago
- ζ₯ζ¬θͺT5γ’γγ«β114Updated 4 months ago
- RealPersonaChat: A Realistic Persona Chat Corpus with Interlocutors' Own Personalitiesβ51Updated 10 months ago
- Easily turn large English text datasets into Japanese text datasets using open LLMs.β17Updated last week