Ermlab / polish-gec-datasetsLinks
Polish datsets for grammatical error correction
☆12Updated last year
Alternatives and similar repositories for polish-gec-datasets
Users that are interested in polish-gec-datasets are comparing it to the libraries listed below
Sorting:
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆35Updated 4 years ago
- just a bunch of useful embeddings for scikit-learn pipelines☆516Updated 2 weeks ago
- SpanMarker for Named Entity Recognition☆451Updated 7 months ago
- A Simple Bulk Labelling Tool☆596Updated last month
- Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…☆36Updated last year
- Efficiently find the best-suited language model (LM) for your NLP task☆127Updated last month
- A framework of open-source technologies to design real-time machine learning systems☆29Updated 2 years ago
- This is the repo for the container that holds the models for the text2vec-transformers module☆55Updated 3 weeks ago
- ☆69Updated 2 months ago
- Late Interaction Models Training & Retrieval☆532Updated this week
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- RoBERTa models for Polish☆88Updated 3 years ago
- ☆78Updated last year
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆211Updated 3 months ago
- A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany☆46Updated 11 months ago
- ☆50Updated 3 years ago
- Neural Search☆332Updated last year
- 🚀 This repo is a showcase of how you can use models deployed on AWS SageMaker in your Haystack Retrieval Augmented Generative AI pipelin…☆13Updated 2 years ago
- Production-ready data processing made easy and shareable☆354Updated last year
- ☆102Updated 3 weeks ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆66Updated 2 months ago
- A component orchestration engine☆28Updated last year
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆189Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆197Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆113Updated 5 months ago
- A library that integrates huggingface transformers with the world of fastai, giving fastai devs everything they need to train, evaluate, …☆295Updated last year
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆337Updated 2 years ago
- Neural Search☆364Updated 5 months ago
- Pre-trained models and language resources for Natural Language Processing in Polish☆351Updated last year
- Simply, faster, sentence-transformers☆143Updated last year