Caucasus languages focused multilingual and monolingual corpuses for Natural Language Processing(NLP)
☆37Nov 29, 2024Updated last year
Alternatives and similar repositories for Lingua-Corpus
Users that are interested in Lingua-Corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- No Language Left Unlocked: scalable backtranslation of NLLB models☆14Aug 4, 2025Updated 9 months ago
- TypeScript library for the LibreTranslate API. With TypeScript type definitions. Can also be used with JavaScript.☆17Mar 13, 2025Updated last year
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆16Jul 27, 2020Updated 5 years ago
- Source code for "N-ary Constituent Tree Parsing with Recursive Semi-Markov Model" published at ACL 2021☆10May 27, 2021Updated 4 years ago
- zero-vocab or low-vocab embeddings☆18Jul 17, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Argos Translate package index☆39Oct 26, 2025Updated 6 months ago
- Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation☆18May 17, 2023Updated 2 years ago
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆16Oct 28, 2022Updated 3 years ago
- Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources (NAACL-2021).☆17Nov 18, 2021Updated 4 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆41Mar 7, 2026Updated 2 months ago
- [COLING'22] Code for "Semantic Role Labeling as Dependency Parsing: Exploring Latent Tree Structures Inside Arguments".☆61Oct 8, 2023Updated 2 years ago
- Source code of ACL2022 "Headed-Span-Based Projective Dependency Parsing" and "Combining (second-order) graph-based and headed-span-based …☆16Jan 12, 2023Updated 3 years ago
- A Language-consistent Open Relation Extraction Model.☆16Mar 24, 2023Updated 3 years ago
- ☆21Feb 3, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Feb 17, 2021Updated 5 years ago
- several algorithms for converting dependency structures into constituency structures.☆10Feb 7, 2022Updated 4 years ago
- A starter kit for evaluating benchmarks on the 🤗 Hub☆16Apr 8, 2026Updated last month
- Convolutional Neural Networks☆17Mar 19, 2015Updated 11 years ago
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆33Jun 14, 2023Updated 2 years ago
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆16Aug 4, 2023Updated 2 years ago
- Source stories from the African Storybook Project in Markdown format☆22Jan 25, 2026Updated 3 months ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆18May 31, 2023Updated 2 years ago
- ☆13Jul 6, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Bilingual sentence similarity classifier using Tensorflow☆24Sep 26, 2019Updated 6 years ago
- ☆12Apr 25, 2022Updated 4 years ago
- ☆13Nov 12, 2025Updated 5 months ago
- Code and models for the paper titled "Better Feature Integration for Named Entity Recognition", NAACL 2021.☆30Nov 5, 2021Updated 4 years ago
- Placeholder repository☆15Mar 16, 2022Updated 4 years ago
- Compare the phonetic inventory of two languages.☆16Sep 5, 2018Updated 7 years ago
- [ACL 2021] Learning Relation Alignment for Calibrated Cross-modal Retrieval☆34May 16, 2023Updated 2 years ago
- SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks☆34Mar 12, 2024Updated 2 years ago
- ☆17Sep 10, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge☆17Nov 16, 2021Updated 4 years ago
- A lexicon compiler for non-suffixational morphologies☆13Jan 29, 2026Updated 3 months ago
- R course for DH master program in HSE☆10Jan 17, 2022Updated 4 years ago
- TensorFlowLiteNet allows to use TensorFlowLite from C#.☆11Apr 14, 2021Updated 5 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆48Nov 30, 2021Updated 4 years ago
- CHOLAN: A Modular Approach for Neural Entity Linking on Wikipedia and Wikidata☆32Jan 21, 2022Updated 4 years ago
- Rationales for Sequential Predictions☆40Mar 10, 2022Updated 4 years ago