Ermlab / polish-gec-datasetsLinks
Polish datsets for grammatical error correction
☆12Updated 2 years ago
Alternatives and similar repositories for polish-gec-datasets
Users that are interested in polish-gec-datasets are comparing it to the libraries listed below
Sorting:
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆35Updated 4 years ago
- Efficiently find the best-suited language model (LM) for your NLP task☆132Updated 4 months ago
- Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…☆36Updated 2 years ago
- ☆26Updated last month
- RoBERTa models for Polish☆89Updated 3 years ago
- A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany☆50Updated last year
- The robust European language model benchmark.☆142Updated this week
- Production-ready data processing made easy and shareable☆356Updated last year
- ☆78Updated last year
- SpanMarker for Named Entity Recognition☆462Updated 11 months ago
- Plug-and-play, zero-shot document AI.☆119Updated last week
- A python package for benchmarking interpretability techniques on Transformers.☆214Updated last year
- This is the repo for the container that holds the models for the text2vec-transformers module☆58Updated last month
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆214Updated 2 months ago
- Generalist and Lightweight Model for Text Classification☆166Updated last week
- Evaluation of Sentence Representations in Polish☆23Updated 2 years ago
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆74Updated last month
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆118Updated 8 months ago
- Pre-trained models and language resources for Natural Language Processing in Polish☆362Updated last year
- just a bunch of useful embeddings for scikit-learn pipelines☆520Updated 2 months ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196Updated last year
- ☆83Updated 3 years ago
- Simple UI for debugging correlations of text embeddings☆302Updated 6 months ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆137Updated 11 months ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆67Updated last week
- ☆50Updated 3 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated 2 years ago
- Custom fastapi server packaged as docker image for Huggingface inference endpoints deployment☆12Updated last year
- A versatile and powerful library designed to streamline the process of querying LLMs☆86Updated 5 months ago