Code for Detecting language from text in python using fasttext
☆13May 25, 2020Updated 5 years ago
Alternatives and similar repositories for language-identification
Users that are interested in language-identification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Easy to download and parse version of the Smartdoc 2015 - Challenge 1 dataset.☆14Mar 5, 2018Updated 8 years ago
- ANYKS Spell-Checker☆19Jan 3, 2023Updated 3 years ago
- ☆12Nov 25, 2018Updated 7 years ago
- Lucene open-domain QA retrieval in python☆11Feb 18, 2021Updated 5 years ago
- Russian words synonyms and antonyms☆11Dec 7, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆37Oct 16, 2025Updated 5 months ago
- Package for word stress detection☆11Jan 27, 2023Updated 3 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 7 months ago
- Material de apoyo de la Materia Análisis Predictivo de la Licenciatura en Analítica (Data Science) ITBA☆14Mar 4, 2026Updated 3 weeks ago
- COMET for African languages☆11Jan 24, 2025Updated last year
- LaTeX snippets for Sublime Text.☆10Sep 20, 2017Updated 8 years ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- I modified some code of K-BERT so that it can be fit to English datasets Topics Resources☆11Dec 15, 2022Updated 3 years ago
- Specialization of BERT architecture both for the Spanish language and the Twitter domain☆13Nov 6, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- TVRecap: A Dataset for Generating Stories with Character Descriptions☆21Jun 5, 2023Updated 2 years ago
- This repository contains all the tools we are working with related to Chequeabot's ecosystem.☆15May 27, 2025Updated 10 months ago
- ☆13Dec 7, 2022Updated 3 years ago
- Replication of "Regularizing and Optimizing LSTM Language Models" by Merity et al. (2017).☆12Sep 17, 2019Updated 6 years ago
- ☆24Apr 15, 2021Updated 4 years ago
- Reimplentation of paper using gzip + knn for text classification☆18Aug 1, 2023Updated 2 years ago
- Provincias y departamentos de Argentina renderizados con Leafletjs.com☆17Feb 23, 2019Updated 7 years ago
- Python module providing a Cython implementation of the classic Louvain algorithm for graph clustering☆15Aug 8, 2019Updated 6 years ago
- Transformer Implementation for NMT using PyTorch Lightning (Korean to English)☆10Oct 19, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- FusionBrain Challenge 2.0: creating multimodal multitask model☆16Oct 29, 2022Updated 3 years ago
- ☆12Aug 9, 2021Updated 4 years ago
- A dataset of news headlines for detecting causalities☆14May 9, 2022Updated 3 years ago
- A collection of textual datasets in Hausa language and the corresponding translation in English language.☆16Mar 5, 2021Updated 5 years ago
- A PySimpleGUI based text and code editor☆14Oct 6, 2019Updated 6 years ago
- Code and Data release for "Improving Multilingual Translation by Representation and Gradient Regularization" (Yang et al. EMNLP 2021), an…☆13Aug 12, 2024Updated last year
- Jojajovai Guarani-Spanish Parallel Corpus☆19Jul 5, 2022Updated 3 years ago
- Rough codebase for exploring initialization strategies for new word embeddings in pretrained LMs☆19Dec 10, 2021Updated 4 years ago
- The official repository of the Eesen project☆12Jun 20, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A curated list of awesome sentiment analysis studies, in which attitude corresponds to the text position conveyed by Subject towards othe…☆19Jan 19, 2025Updated last year
- Telegram bot with options to receive text and response with ChatGPT and also parse voice and response in the same language.☆18Mar 6, 2023Updated 3 years ago
- Deep learning for named entity recognition on CoNLL-2003☆10Dec 23, 2016Updated 9 years ago
- ☆17Jun 12, 2020Updated 5 years ago
- Hack and Tell @ Saarland University☆19Dec 11, 2017Updated 8 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- ☆14Oct 3, 2025Updated 5 months ago