Ermlab / polish-gec-datasetsLinks
Polish datsets for grammatical error correction
☆12Updated 2 years ago
Alternatives and similar repositories for polish-gec-datasets
Users that are interested in polish-gec-datasets are comparing it to the libraries listed below
Sorting:
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆35Updated 4 years ago
 - Infographic about the inner computations of a transformer model, training and inference☆86Updated last year
 - A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany☆48Updated last year
 - Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…☆36Updated last year
 - just a bunch of useful embeddings for scikit-learn pipelines☆518Updated last month
 - Production-ready data processing made easy and shareable☆353Updated last year
 - Efficiently find the best-suited language model (LM) for your NLP task☆127Updated 3 months ago
 - SpanMarker for Named Entity Recognition☆460Updated 9 months ago
 - A Simple Bulk Labelling Tool☆597Updated 3 months ago
 - A framework of open-source technologies to design real-time machine learning systems☆29Updated 2 years ago
 - Evaluation of Sentence Representations in Polish☆23Updated 2 years ago
 - A python package for benchmarking interpretability techniques on Transformers.☆212Updated last year
 - Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆117Updated 7 months ago
 - Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information process…☆241Updated last year
 - Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆195Updated last year
 - Late Interaction Models Training & Retrieval☆632Updated this week
 - ☆50Updated 3 years ago
 - Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆126Updated 2 years ago
 - Enterprise Scale NLP with Hugging Face & SageMaker Workshop series☆240Updated 2 years ago
 - A repository that showcases how you can use ZenML with Git☆71Updated 2 months ago
 - Open-Source Information Retrieval Courses @ TU Wien☆688Updated 2 years ago
 - A library for detecting problematic data segments in structured and unstructured data with few lines of code.☆64Updated last year
 - Web App for generating synthetic data☆48Updated last year
 - Deliver safe & effective language models☆545Updated last week
 - A library that integrates huggingface transformers with the world of fastai, giving fastai devs everything they need to train, evaluate, …☆296Updated last year
 - Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆338Updated 2 years ago
 - ☆77Updated last year
 - A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
 - This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire…☆240Updated last month
 - FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆212Updated last month