KoichiYasuoka / esuparLinks
Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages
☆52Updated 3 months ago
Alternatives and similar repositories for esupar
Users that are interested in esupar are comparing it to the libraries listed below
Sorting:
- GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning☆31Updated 4 years ago
- OpusFilter - Parallel corpus processing toolkit☆113Updated this week
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆14Updated 5 months ago
- cLang-8 is a dataset for grammatical error correction.☆110Updated 3 years ago
- Repository to collect and categorize Grammatical Error Correction papers.☆121Updated 4 months ago
- Utility scripts for preprocessing Wikipedia texts for NLP☆78Updated last year
- X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents (JCDL 2022)☆14Updated 3 years ago
- Code for paper "Kanbun-LM: Reading and Translating Classical Chinese in Japanese Method by Language Models"☆20Updated 2 years ago
- ☆61Updated 2 years ago
- Multilingual sentence alignment using sentence embeddings☆131Updated last year
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆100Updated 2 years ago
- ICU based universal language tokenizer☆33Updated 3 years ago
- An example usage of JParaCrawl pre-trained Neural Machine Translation (NMT) models.☆105Updated 4 years ago
- allennlp-light is a port of AllenNLP's core modules and nn portions into a standalone package with minimum dependencies☆55Updated 3 years ago
- A accurate multilingual word aligner based on LaBSE☆24Updated 2 years ago
- The Business Scene Dialogue corpus☆71Updated 4 years ago
- Kex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public data…☆54Updated 3 years ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆68Updated 3 weeks ago
- ☆32Updated 2 years ago
- ☆57Updated 2 years ago
- You can create datasets from Wikia/Wikipedia that can be used for entity recognition and Entity Linking. Dumps for ja-wiki and VTuber-wik…☆17Updated 4 years ago
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…☆45Updated last year
- TUFS Asian Language Parallel Corpus☆51Updated 2 years ago
- SciWING is a modern toolkit for scientific document processing from WING-NUS☆63Updated 2 years ago
- An easy-to-use API for analyzing INCEpTION annotation projects.☆17Updated 2 years ago
- A Language-consistent Open Relation Extraction Model.☆16Updated 2 years ago
- Scripts for document-level grammatical error correction.☆18Updated 4 years ago
- A tiny BERT for low-resource monolingual models☆31Updated 2 months ago
- mSimCSE: Multilingual SimCSE☆34Updated 3 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆98Updated 2 years ago