Ermlab / polish-gec-datasets
Polish datsets for grammatical error correction
☆12Updated last year
Alternatives and similar repositories for polish-gec-datasets:
Users that are interested in polish-gec-datasets are comparing it to the libraries listed below
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆34Updated 3 years ago
- Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…☆36Updated last year
- Efficiently find the best-suited language model (LM) for your NLP task☆114Updated 2 weeks ago
- ☆50Updated 2 years ago
- A versatile and powerful library designed to streamline the process of querying LLMs☆77Updated last month
- ☆64Updated 8 months ago
- ☆79Updated last month
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆91Updated last month
- A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany☆42Updated 4 months ago
- This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish☆13Updated last year
- Polish BERT☆70Updated 4 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆213Updated 4 months ago
- ☆77Updated 8 months ago
- Late Interaction Models Training & Retrieval☆229Updated last week
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆106Updated last month
- Instruct-tune LLaMA on consumer hardware☆21Updated last year
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆182Updated 3 months ago
- This is the repo for the container that holds the models for the text2vec-transformers module☆43Updated this week
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search☆22Updated last year
- (K3IM) Keras 3 Image Models☆18Updated 11 months ago
- HerBERT is a BERT-based Language Model trained on Polish Corpora using only MLM objective with dynamic masking of whole words.☆67Updated 2 years ago
- Unattended Lightweight Text Classifiers with LLM Embeddings☆182Updated 4 months ago
- Robust and fast topic models with sentence-transformers.☆42Updated 3 weeks ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆123Updated last year
- Command Line Interface for Hugging Face Inference Endpoints☆67Updated 9 months ago
- Fastai community entry to 2020 Reproducibility Challenge☆17Updated 2 years ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196Updated 8 months ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆104Updated 8 months ago