💫 A spaCy package for Yohei Tamura's Rust tokenizations library
☆35Mar 27, 2026Updated 3 months ago
Alternatives and similar repositories for spacy-alignments
Users that are interested in spacy-alignments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/☆195Oct 4, 2023Updated 2 years ago
- Code to train Sentence BERT Japanese model for Hugging Face Model Hub☆11Aug 8, 2021Updated 4 years ago
- 📟 Logging utilities for spaCy☆12Nov 3, 2023Updated 2 years ago
- ☆17May 31, 2023Updated 3 years ago
- Code for COLING 2020 Paper☆13Feb 3, 2026Updated 4 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Sidewall is a Python library for interacting with the Dimensions search API.☆17Sep 11, 2024Updated last year
- Indexing project where we index a portion of the web using spark, hadoop and cassandra.☆22Oct 30, 2019Updated 6 years ago
- ☆19Apr 21, 2026Updated 2 months ago
- AJIMEE-Bench (Advanced Japanese IME Evaluation Benchmark)☆21Jan 13, 2025Updated last year
- To be readable without enhancing english power.☆10Jul 22, 2020Updated 5 years ago
- ☆16Dec 10, 2022Updated 3 years ago
- SciDTB: Discourse Dependency TreeBank for Scientific Abstracts☆29Jul 17, 2018Updated 7 years ago
- Integrated model to calculate the effects of resilient foods in catastrophic events☆11May 20, 2025Updated last year
- Finance::YahooJapan - A Perl module that enables GnuCash to get quotes of Japanese stocks and mutual funds from Yahoo! Finance JAPAN.☆13Aug 29, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- script to evaluate pre-trained Japanese word2vec model on Japanese similarity dataset☆12Nov 4, 2024Updated last year
- ☆12Nov 10, 2023Updated 2 years ago
- Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/☆30Jul 12, 2021Updated 4 years ago
- TensorFlow and Numpy implementation of sparsemax☆15Dec 22, 2019Updated 6 years ago
- ☆17Mar 22, 2025Updated last year
- Japanese semantic test suite (FraCaS counterpart and extensions)☆13Apr 21, 2026Updated 2 months ago
- textlint rule that found mismatch between date and weekday.☆12Jun 20, 2025Updated last year
- 【2024年版】BERTによるテキスト分類☆30Jul 8, 2024Updated last year
- Build and (re)start go web apps after saving/creating/deleting source files.☆10Jul 25, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- IPAdic packaged for easy use from Python.☆24Oct 31, 2021Updated 4 years ago
- ☯️ AllenNLP training configurations for promising models on Named Entity Recognition. (BiLSTM-CRF, BiLSTM-CNN-CRF, BERT, BERT-CRF)☆15Nov 26, 2020Updated 5 years ago
- Extract and infer personal attributes from dialogue☆12Sep 6, 2022Updated 3 years ago
- Supporting code for the EMNLP 2019 paper "Answers Unite! Unsupervised Metrics for Reinforced Summarization Models"☆14Jun 12, 2023Updated 3 years ago
- ☆15Aug 3, 2021Updated 4 years ago
- ☆15Mar 15, 2022Updated 4 years ago
- This is the Github repo of "CODA-19: Using a Non-Expert Crowd to Annotate Research Aspects on 10,000+ Abstracts in the COVID-19 Open Rese…☆38Oct 7, 2021Updated 4 years ago
- A Python Implementation of GLAD☆24Jan 18, 2021Updated 5 years ago
- A Japanese Parser☆34Nov 1, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A paper list for box embeddings☆17Jun 9, 2021Updated 5 years ago
- DefSent: Sentence Embeddings using Definition Sentences☆23Aug 5, 2021Updated 4 years ago
- ☆10Jun 11, 2024Updated 2 years ago
- Scripts for creating a Japanese-English parallel corpus and training NMT models☆18Nov 9, 2021Updated 4 years ago
- The data and the PyTorch implementation for the models and experiments in the paper "Language Model Decoding as Likelihood–Utility Alignm…☆14Sep 7, 2023Updated 2 years ago
- Python script to transform the Mobile Detect JSON database into an UA-based mobile detection VCL subroutine easily integrable in any Varn…☆14Nov 13, 2023Updated 2 years ago
- Cython extension module for C++ geographiclib functions☆16Mar 6, 2022Updated 4 years ago