Caucasus-Rosetta/Lingua-Corpus

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Caucasus-Rosetta/Lingua-Corpus)

Caucasus-Rosetta / Lingua-Corpus

Caucasus languages focused multilingual and monolingual corpuses for Natural Language Processing(NLP)

☆37

Alternatives and similar repositories for Lingua-Corpus

Users that are interested in Lingua-Corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ymoslem / OpenNMT-Web-Interface
View on GitHub
Machine Translation Web Interface for OpenNMT-py
☆26Dec 24, 2021Updated 4 years ago
LibreTranslate / nllu
View on GitHub
No Language Left Unlocked: scalable backtranslation of NLLB models
☆14Aug 4, 2025Updated 11 months ago
lawl / translate
View on GitHub
☆12Mar 27, 2022Updated 4 years ago
argosopentech / LibreTranslate-cpp
View on GitHub
LibreTranslate C++ bindings
☆19Aug 27, 2021Updated 4 years ago
akashAD98 / Multilingual-RAG
View on GitHub
multilingual RAG
☆15Feb 6, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
surafelml / Afro-NMT
View on GitHub
LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES
☆16Jul 27, 2020Updated 6 years ago
NP-NET-research / Recursive-Semi-Markov-Model
View on GitHub
Source code for "N-ary Constituent Tree Parsing with Recursive Semi-Markov Model" published at ACL 2021
☆10May 27, 2021Updated 5 years ago
tderflinger / libretranslate-ts
View on GitHub
TypeScript library for the LibreTranslate API. With TypeScript type definitions. Can also be used with JavaScript.
☆18Mar 13, 2025Updated last year
ChenghaoMou / embeddings
View on GitHub
zero-vocab or low-vocab embeddings
☆18Jul 17, 2022Updated 4 years ago
pawel-bujnowski / smiler
View on GitHub
SMiLER - Samsung MultiLingual Entity and Relation Extraction dataset
☆18Feb 11, 2021Updated 5 years ago
fbkarsdorp / alignment
View on GitHub
Simple Python library for doing (multiple) sequence alignment
☆17Jun 24, 2018Updated 8 years ago
SapienzaNLP / unify-srl
View on GitHub
Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources (NAACL-2021).
☆17Nov 18, 2021Updated 4 years ago
rossellhayes / ipa
View on GitHub
🗣️ Convert between phonetic alphabets
☆11Feb 7, 2022Updated 4 years ago
yzhangcs / crfsrl
View on GitHub
[COLING'22] Code for "Semantic Role Labeling as Dependency Parsing: Exploring Latent Tree Structures Inside Arguments".
☆61Oct 8, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
sustcsonglin / span-based-dependency-parsing
View on GitHub
Source code of ACL2022 "Headed-Span-Based Projective Dependency Parsing" and "Combining (second-order) graph-based and headed-span-based …
☆16Jan 12, 2023Updated 3 years ago
jacobkrantz / lstm-syllabify
View on GitHub
Breaks a word into syllables using an LSTM-based neural network.
☆20Aug 14, 2023Updated 2 years ago
LibreTranslate / argos-translate-files
View on GitHub
Translate files using Argos Translate
☆38Feb 24, 2026Updated 5 months ago
tomh5905 / LOREM
View on GitHub
A Language-consistent Open Relation Extraction Model.
☆16Mar 24, 2023Updated 3 years ago
Helsinki-NLP / OPUS-MT-testsets
View on GitHub
benchmarks for evaluating MT models
☆11Jun 26, 2024Updated 2 years ago
Bartelds / asr-augmentation
View on GitHub
Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation
☆18May 17, 2023Updated 3 years ago
philschmid / fine-tune-GPT-2
View on GitHub
☆21Feb 3, 2021Updated 5 years ago
omelet25 / CNNCSharp
View on GitHub
Convolutional Neural Networks
☆17Mar 19, 2015Updated 11 years ago
unprodstudio / UnityTranslate
View on GitHub
A mod designed for free live translation, inspired by the QSMP.
☆41Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
wenkokke / dep2con
View on GitHub
several algorithms for converting dependency structures into constituency structures.
☆10Feb 7, 2022Updated 4 years ago
huggingface / hf_benchmarks
View on GitHub
A starter kit for evaluating benchmarks on the 🤗 Hub
☆18Apr 8, 2026Updated 3 months ago
argosopentech / argospm-index
View on GitHub
Argos Translate package index
☆44Jun 27, 2026Updated last month
mmcdermott / structure_inducing_pre-training
View on GitHub
☆16Sep 6, 2022Updated 3 years ago
drgriffis / text-essence
View on GitHub
Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus
☆16Aug 4, 2023Updated 2 years ago
agricolamz / lingglosses
View on GitHub
R package that helps to render interlinear glossed linguistic examples in html rmarkdown documents and then semi-automatically compiles t…
☆17Nov 18, 2025Updated 8 months ago
agricolamz / DS_for_DH
View on GitHub
R course for DH master program in HSE
☆10Jan 17, 2022Updated 4 years ago
global-asp / asp-source
View on GitHub
Source stories from the African Storybook Project in Markdown format
☆22Jan 25, 2026Updated 6 months ago
tapilab / aaai-2021-counterfactuals
View on GitHub
☆13Jul 6, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
UBC-NLP / serengeti
View on GitHub
SERENGETI: Massively Multilingual Language Models for Africa
☆17Oct 26, 2023Updated 2 years ago
luismsgomes / mosestokenizer
View on GitHub
☆20Oct 22, 2021Updated 4 years ago
SYSTRAN / similarity
View on GitHub
Bilingual sentence similarity classifier using Tensorflow
☆24Sep 26, 2019Updated 6 years ago
yumeng5 / RoSTER
View on GitHub
[EMNLP 2021] Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training
☆65Nov 12, 2021Updated 4 years ago
sgepigon / pho-diff
View on GitHub
Compare the phonetic inventory of two languages.
☆16Sep 5, 2018Updated 7 years ago
eric11eca / NeuralLog
View on GitHub
A neural-symbolic joint reasoning approach for Natural Language Inference (NLI). Modeling NLI as inference path planning through a search…
☆17Jun 9, 2021Updated 5 years ago
lancopku / IAIS
View on GitHub
[ACL 2021] Learning Relation Alignment for Calibrated Cross-modal Retrieval
☆34May 16, 2023Updated 3 years ago