cl-tohoku/PheMT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cl-tohoku/PheMT)

cl-tohoku / PheMT

A phenomenon-wise evaluation dataset for Japanese-English machine translation robustness. The dataset is based on the MTNT dataset, with additional annotations of four linguistic phenomena; Proper Noun, Abbreviated Noun, Colloquial Expression, and Variant. COLING 2020.

☆19

Alternatives and similar repositories for PheMT

Users that are interested in PheMT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tsuruoka-lab / BSD
View on GitHub
The Business Scene Dialogue corpus
☆75Nov 10, 2021Updated 4 years ago
mlpnlp / mlpnlp-nmt
View on GitHub
This is a sample code of "LSTM encoder-decoder with attention mechanism" mainly for understanding a recently developed machine translatio…
☆44Mar 14, 2019Updated 7 years ago
snakers4 / emoji-sentiment-dataset
View on GitHub
☆15May 16, 2019Updated 7 years ago
MorinoseiMorizo / jparacrawl-finetune
View on GitHub
An example usage of JParaCrawl pre-trained Neural Machine Translation (NMT) models.
☆105Apr 29, 2021Updated 5 years ago
languagetool-org / languagetool-website-2018
View on GitHub
OUTDATED, do not use anymore
☆12Aug 10, 2021Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
KaniyamFoundation / all_tamil_nouns
View on GitHub
A project to collect all tamil nouns
☆12Dec 14, 2024Updated last year
ThaniThamizhAkarathiKalanjiyam / agarathi
View on GitHub
Open Sourced Tamil Dictionary
☆15Jul 25, 2021Updated 5 years ago
SoYoungCho / Korean-English-NMT
View on GitHub
Neural Machine Translation model for Capstone Project
☆11Apr 11, 2020Updated 6 years ago
IvanWang0730 / StyleAP
View on GitHub
Code and Data for Paper "Controlling Styles in Neural Machine Translation with Activation Prompt" (ACL 2023 Findings)
☆16Dec 20, 2022Updated 3 years ago
vaaaaanquish / dajare-detector
View on GitHub
Japanese joke detection
☆13Dec 11, 2020Updated 5 years ago
shunk031 / human-attention-map-for-text-classification
View on GitHub
Reimplementation of the paper `Human Attention Maps for Text Classification: Do Humans and Neural Networks Focus on the Same Words? (ACL2…
☆17Jul 10, 2020Updated 6 years ago
chakki-works / entitypedia
View on GitHub
Entitypedia is an Extended Named Entity Dictionary from Wikipedia.
☆13Dec 7, 2022Updated 3 years ago
nobu-g / cohesion-analysis
View on GitHub
Code for COLING 2020 Paper
☆13Feb 3, 2026Updated 5 months ago
hppRC / japanese-sentence-breaker
View on GitHub
🧨 Japanese Sentence Breaker 🧨
☆14Jun 6, 2021Updated 5 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
yuvalpinter / nytwit
View on GitHub
New York Times Word Innovation Types dataset
☆21Dec 1, 2020Updated 5 years ago
MuntashirAkon / SlobDict
View on GitHub
A modern, lightweight GTK 4 dictionary app for Linux
☆26Mar 20, 2026Updated 4 months ago
Deerjump / Scripticus
View on GitHub
A Discord Bot for the Legends of Idleon Discord Server
☆11Aug 15, 2022Updated 3 years ago
himkt / interest
View on GitHub
👀 Interest: Organizing papers+materials which you are interested in. Serverless application powered by GitHub pages + Google Spreadshee…
☆16Jan 7, 2023Updated 3 years ago
DHRI-Curriculum / command-line
View on GitHub
@DHRI-Curriculum Session on the command line, a means of interacting with your computer programmatically through text.
☆15May 8, 2024Updated 2 years ago
nttcslab-nlp / word_align
View on GitHub
A Supervised Word Alignment Method based on Cross-Language Span Prediction using Multilingual BERT
☆26Jan 27, 2021Updated 5 years ago
skittlesaur / cairometro
View on GitHub
Cairo Metro System
☆13Sep 7, 2023Updated 2 years ago
ku-nlp / bertknp
View on GitHub
A Japanese dependency parser based on BERT
☆23Oct 26, 2022Updated 3 years ago
hiroshi-manabe / japanese_verb_adj_list
View on GitHub
A list of Japanese verbs and adjectives.
☆23Oct 1, 2025Updated 9 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
jamesohortle / loanwords_gairaigo
View on GitHub
English loanwords in Japanese
☆19Oct 24, 2024Updated last year
ssun32 / CLIRMatrix
View on GitHub
☆18Jul 23, 2021Updated 5 years ago
samrawal / BiLSTM-CRF-Keras
View on GitHub
Easily-configurable implementation of BiLSTM-CRF in Keras for Named Entity Recognition
☆20Sep 23, 2019Updated 6 years ago
WladimirSidorenko / PotTS
View on GitHub
The Potsdam Twitter Sentiment Corpus
☆18Jan 15, 2020Updated 6 years ago
aboueleyes / quran-dl
View on GitHub
Download Quran using CLI
☆13Nov 16, 2023Updated 2 years ago
UKPLab / germeval2017-sentiment-detection
View on GitHub
Sentence Embeddings used in the GermEval-2017 Submission
☆13May 23, 2023Updated 3 years ago
matiasngf / pepito-tracker
View on GitHub
An app to show where is pepito the cat
☆10Sep 4, 2023Updated 2 years ago
chemicaltree / tetra
View on GitHub
☆10Sep 14, 2022Updated 3 years ago
teaspn / teaspn-sdk
View on GitHub
SDK for TEASPN, a framework and a protocol for integrated writing assistance environments
☆59Dec 9, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
CyberJutsu / WebPentest
View on GitHub
☆10Nov 22, 2022Updated 3 years ago
moskomule / chika
View on GitHub
chika is a simple and easy config tool for hierarchical configurations.
☆20Jul 10, 2023Updated 3 years ago
ajb129 / KeyakiTreebank
View on GitHub
Keyaki Treebank Parsed Corpus
☆10May 15, 2019Updated 7 years ago
yahshibu / nested-ner-tacl2020-flair
View on GitHub
Implementation of Nested Named Entity Recognition using Flair
☆24Oct 29, 2021Updated 4 years ago
OFAI / million-post-corpus
View on GitHub
Annotated data set consisting of user comments posted to a German-language newspaper website
☆18Jun 28, 2018Updated 8 years ago
hyungjin-chung / VPS
View on GitHub
☆16Sep 11, 2025Updated 10 months ago
nlevin / figmadesign
View on GitHub
Some misc things for Figma's design team
☆13Mar 25, 2023Updated 3 years ago