Natural language detection, Java bindings for CLD2
☆17Feb 26, 2026Updated 3 weeks ago
Alternatives and similar repositories for language-detection-cld2
Users that are interested in language-detection-cld2 are comparing it to the libraries listed below
Sorting:
- A neural dependency parser that does its best☆16Mar 6, 2026Updated 2 weeks ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆74Apr 1, 2025Updated 11 months ago
- Targetted language identifier, based on FastText and Hunspell.☆38Sep 4, 2025Updated 6 months ago
- The EHRI project's portal interface.☆15Mar 9, 2026Updated last week
- Load, build and explore Patstat using the Google Cloud Platform☆10Jan 19, 2019Updated 7 years ago
- Specification of a stand-off element for the TEI guidelines☆12Apr 29, 2021Updated 4 years ago
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki☆28Jul 31, 2024Updated last year
- ☆14Sep 6, 2023Updated 2 years ago
- A machine learning software for extracting astronomical entities from scholarly documents☆10Oct 31, 2022Updated 3 years ago
- WindSR Dataset contains more than 22,000 pairs of HR/LR wind speed images, which are processed using the NASA's GEOS-5 Nature Run dataset…☆12Jan 18, 2024Updated 2 years ago
- Rust wrapper for the cld2 language detection library.☆16Nov 28, 2017Updated 8 years ago
- A browser extension providing Open Access bibliographical services☆18Dec 9, 2022Updated 3 years ago
- Get a computer to write regex for you. A front-end for grex (https://github.com/pemistahl/grex).☆12Sep 8, 2022Updated 3 years ago
- A radix tree implementation☆15Sep 22, 2022Updated 3 years ago
- A Knowledge Base for research software relying on large-scale text mining and curated knowledge sources☆17May 14, 2023Updated 2 years ago
- Bouton ISTEX : extension web capable d'insérer dynamiquement sur la page web consultée un lien vers le fulltext d'un document si ce dern…☆11May 30, 2023Updated 2 years ago
- Quarkus extension to that allows proper usage of Neo4j-OGM inside Quarkus.☆13Updated this week
- Rust port of TLSH☆14Oct 12, 2025Updated 5 months ago
- A small python library to parse and write TSV files generated by the WebAnno software.☆11Apr 14, 2025Updated 11 months ago
- Konversation is a tool to generate rich and diversified responses to the user of a voice application.☆12Mar 31, 2020Updated 5 years ago
- A utility to read and write PDFs with Python☆11Apr 28, 2022Updated 3 years ago
- ☆15Dec 18, 2023Updated 2 years ago
- A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…☆15Mar 27, 2023Updated 2 years ago
- Meet Rustacean GPT, an experimental project transforming OpenAi's GPT into a helpful, autonomous software engineer to support senior deve…☆14May 10, 2023Updated 2 years ago
- Common NLP (text mining) tools for materials science and chemistry, for groups at Lawrence Berkeley National Lab (LBNL) and beyond.☆22Jul 8, 2022Updated 3 years ago
- A collection of Flink applications for working with Pravega streams☆12Dec 20, 2022Updated 3 years ago
- Concept Representation (Embedding) and Semantic Relatedness☆15Jul 3, 2019Updated 6 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Apr 21, 2021Updated 4 years ago
- A Go library for configuration management with JSON files in remote storage.☆14Jan 27, 2026Updated last month
- Repo for the paper publishing the superconductor database with 3D crystal structures.☆24Nov 21, 2024Updated last year
- Superconductors material dataset☆27Dec 5, 2023Updated 2 years ago
- LimeSoup is a package to parse HTML or XML papers from different publishers.☆20Jan 4, 2021Updated 5 years ago
- Grobid module for superconductor material and properties extraction☆22May 17, 2025Updated 10 months ago
- WiNER-fr is a free named entity corpus using French Wikinews texts.☆17Feb 12, 2021Updated 5 years ago
- A git pre-receive hook to prevent large files from being committed to your repository☆26Aug 6, 2017Updated 8 years ago
- Ted is a line oriented text editor and formatter☆12Jun 29, 2020Updated 5 years ago
- Node.js bindings for the tantivy search engine☆17Mar 21, 2019Updated 7 years ago
- This repo is a curated list of places I consider for weekends in Athens with my kid.☆11Dec 19, 2021Updated 4 years ago
- Development repository for the maven cookbook☆36Mar 2, 2026Updated 2 weeks ago