AndyTheFactory / romanian-nlp-datasetsLinks
A list of Romanian NLP Datasets
☆49Updated 4 months ago
Alternatives and similar repositories for romanian-nlp-datasets
Users that are interested in romanian-nlp-datasets are comparing it to the libraries listed below
Sorting:
- This repo is the home of Romanian Transformers.☆104Updated 2 years ago
- Romanian WordNet (Data + API for Python)☆52Updated 4 years ago
- Romanian Named Entity Corpus (RONEC) version 2.0☆65Updated 2 years ago
- A list of Natural Language Processing Tools for Romanian☆31Updated 4 years ago
- The original transformer implementation from scratch. It contains informative comments on each block☆35Updated last year
- The Greek NLP toolkit for Python. Supports NER/DP/POS Tagging/Greeklish-to-Greek Transliteration. Visit the playground here: https://hugg…☆69Updated last week
- SpanMarker for Named Entity Recognition☆438Updated 6 months ago
- A novel dataset for emotion detection from Romanian text.☆20Updated 4 months ago
- Pre-trained models and language resources for Natural Language Processing in Polish☆345Updated last year
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆107Updated 9 months ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆490Updated 8 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆282Updated 4 months ago
- A curated collection of resources and repositories for Natural Language Processing (NLP) tasks specific to Darija, the Moroccan Arabic di…☆85Updated last year
- Serbian LLM Eval.☆96Updated last year
- Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.☆253Updated last year
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆2,159Updated this week
- Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.☆536Updated 2 months ago
- A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany☆43Updated 10 months ago
- Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.☆1,091Updated 3 weeks ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated 8 months ago
- Efficiently find the best-suited language model (LM) for your NLP task☆123Updated last week
- ☆673Updated 2 months ago
- Clustering sentence embeddings to extract message intent☆174Updated 3 years ago
- A collection of datasets and tasks for legal machine learning☆388Updated last year
- Named Entity Recognition (NER) Annotation tool for SpaCy. Generates Traning Data as a JSON which can be readily used.☆582Updated 4 months ago
- ☆42Updated last month
- A Python library for calculating a large variety of metrics from text☆341Updated 7 months ago
- A Greek edition of BERT pre-trained language model☆147Updated 11 months ago
- pre-trained Language Models☆306Updated 2 months ago
- Translation models for 22 scheduled languages of India☆331Updated 2 months ago