modelpredict/language-identification-survey

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/modelpredict/language-identification-survey)

modelpredict / language-identification-survey

Live survey of off-the-shelf language identification tools for python

☆27

Alternatives and similar repositories for language-identification-survey

Users that are interested in language-identification-survey are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

commoncrawl / ia-web-commons
View on GitHub
Web archiving utility library
☆11Updated this week
mingruimingrui / fast-mosestokenizer
View on GitHub
c++ mosestokenizer
☆18Mar 13, 2024Updated 2 years ago
sshleifer / backtranslated-imdb
View on GitHub
Backtranslations of IMDB movie reviews for Data Augmentation Purposes
☆10Apr 1, 2019Updated 7 years ago
continuum-llms / acad-gpt
View on GitHub
A Discord Bot for distilling papers, GitHub repos, Blogposts, and much more using the power of LLMs and vector search.
☆13May 3, 2023Updated 3 years ago
EnnengYang / Efficient-WEMoE
View on GitHub
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.
☆16Oct 28, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
neuml / staticvectors
View on GitHub
🔢 Work with static vector models
☆39Apr 21, 2025Updated last year
NX-AI / xlstm_scaling_laws
View on GitHub
Code and data to explore neural scaling laws of xLSTM and Transformer models.
☆23Apr 8, 2026Updated 3 months ago
ntunlp / Zero-Shot-Cross-Lingual-NER
View on GitHub
A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.
☆47Dec 2, 2022Updated 3 years ago
dejan94it / cc_Rtools
View on GitHub
This plugin allows the Cheshire Cat to use tools written in R language
☆10Dec 23, 2024Updated last year
ELS-RD / anonymisation
View on GitHub
Anonymization of legal cases (Fr) based on Flair embeddings
☆89Dec 9, 2020Updated 5 years ago
Tikquuss / meta_XLM
View on GitHub
Cross-lingual Language Model (XLM) pretraining and Model-Agnostic Meta-Learning (MAML) for fast adaptation of deep networks
☆20Mar 26, 2021Updated 5 years ago
bicici / FDA
View on GitHub
Feature Decay Algorithms
☆11Mar 5, 2014Updated 12 years ago
anakin87 / llama2-haystack
View on GitHub
Using Llama2 with Haystack, the NLP/LLM framework.
☆16Jul 21, 2023Updated 3 years ago
nlp-stat-test / nlp-stat-test
View on GitHub
The NLPStatTest project
☆12Mar 12, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
aboSamoor / pycld2
View on GitHub
☆179Mar 28, 2025Updated last year
ffex / rust-boy
View on GitHub
A Rust journey into Game Boy dev
☆18May 8, 2026Updated 2 months ago
utahnlp / DirectProbe
View on GitHub
☆21Oct 15, 2022Updated 3 years ago
avacaondata / nlpboost
View on GitHub
Python library for automatic training, optimization and comparison of Transformer models on most NLP tasks.
☆20May 6, 2023Updated 3 years ago
shyyhs / CourseraParallelCorpusMining
View on GitHub
Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation
☆15Aug 27, 2024Updated last year
cyrilou242 / learning-lightnr
View on GitHub
Generate multiple choice fill-in-the-blank questions from any article.
☆13Dec 8, 2022Updated 3 years ago
thevasudevgupta / transformers-adapters
View on GitHub
This repositary hosts my experiments for the project, I did with OffNote Labs.
☆10Apr 12, 2021Updated 5 years ago
yfqiu-nlp / swirl
View on GitHub
Materials for paper "Self-improving World Modelling with Latent Actions"
☆20Feb 5, 2026Updated 5 months ago
RLDiary / Wordle-GRPO
View on GitHub
A $100 Agent - Reinforcement tuning a language model to play the game of Wordle
☆18Jul 14, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
frozentoad9 / CMST
View on GitHub
Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages
☆13Oct 12, 2022Updated 3 years ago
shyamupa / biling-survey
View on GitHub
scripts and data for ACL 16 paper
☆14Jul 5, 2016Updated 10 years ago
Calysto / calysto_bash
View on GitHub
A Calysto Bash Kernel
☆18Apr 30, 2024Updated 2 years ago
dagnelies / pysos
View on GitHub
Python Simple Object Storage - provides a list and dictionary interface that seamlessly stores data in a file, like a simplified database…
☆60Jan 31, 2023Updated 3 years ago
GoFigure-LANL / VisHash
View on GitHub
Visual Hash for matching copies of visually similar images.
☆16Mar 17, 2025Updated last year
ruotianluo / lmdbdict
View on GitHub
A simple wrapper for lmdb. Support dict-like operations.
☆23Apr 20, 2023Updated 3 years ago
mmas / docker-scrapy-tor
View on GitHub
Scrapy environment with Tor for anonymous ip routing and Privoxy for http proxy
☆20Jul 5, 2016Updated 10 years ago
polm / ipadic-py
View on GitHub
IPAdic packaged for easy use from Python.
☆24Oct 31, 2021Updated 4 years ago
PythonBiellaGroup / ModernDataEngineering
View on GitHub
Modern Data Engineering Project
☆12Jun 3, 2022Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
microsoft / factored-segmenter
View on GitHub
Unsupervised factor-based text tokenizer for natural-language processing applications
☆17Jul 24, 2020Updated 6 years ago
kamigaito / SLAHAN
View on GitHub
SLAHAN is an implementation of Kamigaito et al., 2020, "Syntactically Look-A-Head Attention Network for Sentence Compression", In Proc. o…
☆17Jan 27, 2021Updated 5 years ago
conradj / pocket-public-archive
View on GitHub
statically generated weekly digest of articles read in Pocket
☆10May 14, 2019Updated 7 years ago
yanshanjing / learning-from-imbalanced-classes
View on GitHub
Learning From Imbalanced Classes
☆14Aug 25, 2016Updated 9 years ago
buhrmi / nuxt-url-sync
View on GitHub
A nuxt module to expose Vuex state in the browser URL for easy sharing
☆12Aug 28, 2017Updated 8 years ago
UKPLab / iclr2024-model-merging
View on GitHub
This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.
☆31May 15, 2024Updated 2 years ago
briankoser / web-typography-css
View on GitHub
A stylesheet based on Richard Rutter's book Web Typography.
☆10Dec 6, 2018Updated 7 years ago