Ermlab / polish-gec-datasetsLinks
Polish datsets for grammatical error correction
☆12Updated last year
Alternatives and similar repositories for polish-gec-datasets
Users that are interested in polish-gec-datasets are comparing it to the libraries listed below
Sorting:
- Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…☆36Updated last year
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆35Updated 4 years ago
- A Simple Bulk Labelling Tool☆592Updated 6 months ago
- Efficiently find the best-suited language model (LM) for your NLP task☆123Updated last week
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆105Updated 3 months ago
- just a bunch of useful embeddings for scikit-learn pipelines☆503Updated 3 months ago
- A library for detecting problematic data segments in structured and unstructured data with few lines of code.☆64Updated last year
- A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany☆43Updated 10 months ago
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- SpanMarker for Named Entity Recognition☆438Updated 6 months ago
- A versatile and powerful library designed to streamline the process of querying LLMs☆86Updated last month
- Generalist and Lightweight Model for Text Classification☆140Updated last month
- Production-ready data processing made easy and shareable☆354Updated last year
- A python package for benchmarking interpretability techniques on Transformers.☆213Updated 9 months ago
- Simple UI for debugging correlations of text embeddings☆288Updated last month
- Active Learning for Text Classification in Python☆618Updated this week
- A library for working with prompt templates locally or on the Hugging Face Hub.☆47Updated 4 months ago
- This is the repo for the container that holds the models for the text2vec-transformers module☆51Updated last week
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆185Updated 10 months ago
- Late Interaction Models Training & Retrieval☆504Updated this week
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated last year
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆208Updated 2 months ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- ☆78Updated last year
- ☆50Updated 2 years ago
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆227Updated last month
- Notebooks for training universal 0-shot classifiers on many different tasks☆131Updated 6 months ago
- FSDL 2021 course project - Active Learning in NLP☆5Updated 7 months ago
- ☆99Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆197Updated last year