jvalhondo / spanish-names-surnamesLinks
Data set with Spanish names and surnames
☆33Updated 2 years ago
Alternatives and similar repositories for spanish-names-surnames
Users that are interested in spanish-names-surnames are comparing it to the libraries listed below
Sorting:
- The World Atlas of Language Structures☆61Updated 9 months ago
- The curation repository for the data behind Concepticon.☆39Updated this week
- Python package for stylometry☆63Updated 4 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- A multilingual parallel corpus created from translations of the Bible.☆182Updated 2 months ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆93Updated last year
- Metaphor detection using NLP techniques, made in Python using NLTK☆18Updated 11 years ago
- A Knowledge Base for research software relying on large-scale text mining and curated knowledge sources☆16Updated 2 years ago
- Labeled segmentation for the document structure of printed books☆14Updated 7 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- Explore your own text collection with a topic model – without prior knowledge.☆63Updated 6 months ago
- Treex NLP framework☆32Updated 2 weeks ago
- Homebase of the IPTC EXTRA project about rule-based text categorization☆13Updated 8 years ago
- Custom French POS and lemmatizer based on Lefff for spacy☆68Updated 2 years ago
- Wrapper for DKPro Core to extract lingustic information from books.☆16Updated 3 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Updated last year
- End to end human text analysis package, specifically suited for social media and social scientific applications. It is written in Python …☆124Updated last month
- a python package for cleaning Gutenberg books and dataset☆34Updated 2 months ago
- Python for Linguists – a Gentle Introduction to Programming☆45Updated 9 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated 2 years ago
- Miscellaneous scripts to gather and process data of wikis.☆21Updated 2 years ago
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68Updated 3 years ago
- Python 3 library for processing historical English☆67Updated 11 months ago
- A gold-standard dataset of software mentions in research publications.☆37Updated last year
- Analysis of gutenberg dataset☆45Updated 6 years ago
- UD Greek☆21Updated last month
- TextComplexityDE dataset consists of 1000 sentences in the German language with subjective complexity rating, collected from German learn…☆13Updated 3 years ago
- linguistics backend☆41Updated 2 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆43Updated last year
- Religious Hate Speech Detection for Arabic Tweets☆24Updated 6 years ago