AndyTheFactory / romanian-nlp-datasetsLinks
A list of Romanian NLP Datasets
☆49Updated 4 months ago
Alternatives and similar repositories for romanian-nlp-datasets
Users that are interested in romanian-nlp-datasets are comparing it to the libraries listed below
Sorting:
- This repo is the home of Romanian Transformers.☆103Updated 2 years ago
- Romanian Semantic Textual Similarity Dataset☆16Updated 2 years ago
- A novel dataset for emotion detection from Romanian text.☆20Updated 4 months ago
- A list of Natural Language Processing Tools for Romanian☆31Updated 4 years ago
- Romanian WordNet (Data + API for Python)☆52Updated 4 years ago
- Romanian Named Entity Corpus (RONEC) version 2.0☆65Updated 2 years ago
- Efficiently find the best-suited language model (LM) for your NLP task☆124Updated 3 weeks ago
- SpanMarker for Named Entity Recognition☆434Updated 5 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 7 months ago
- The robust European language model benchmark.☆106Updated this week
- Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.☆536Updated 2 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆280Updated 3 months ago
- Romanian Word Embeddings. Here you can find pre-trained corpora of word embeddings. Current methods: CBOW, Skip-Gram, Fast-Text (from Gen…☆12Updated 2 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆164Updated 3 weeks ago
- Clustering sentence embeddings to extract message intent☆174Updated 3 years ago
- The original transformer implementation from scratch. It contains informative comments on each block☆35Updated last year
- Resources for Faculty of Mathematics and Computer Science, University of Bucharest.☆14Updated 10 months ago
- A Python library for calculating a large variety of metrics from text☆340Updated 6 months ago
- A very simple news crawler with a funny name☆389Updated last week
- Named Entity Recognition for Romanian, based on transformer models☆13Updated 3 years ago
- Neural based model for automatic diacritics restoration.☆25Updated 6 years ago
- A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany☆42Updated 9 months ago
- ☆205Updated last year
- Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.☆1,071Updated this week
- The Greek NLP toolkit for Python. Supports NER/DP/POS Tagging/Greeklish-to-Greek Transliteration. Visit the playground here: https://hugg…☆68Updated 5 months ago
- ☆210Updated 11 months ago
- SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings☆63Updated 4 months ago
- BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them t…☆142Updated last year
- TweetNLP for all the NLP enthusiasts working on Twitter! The Python library tweetnlp provides a collection of useful tools to analyze/und…☆350Updated 2 months ago
- Fine-tune Mistral 7B to generate fashion style suggestions☆34Updated last year