CLARIN-PL / embeddings
Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
☆36Updated last year
Alternatives and similar repositories for embeddings:
Users that are interested in embeddings are comparing it to the libraries listed below
- This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish☆13Updated last year
- RoBERTa models for Polish☆86Updated 3 years ago
- Fine-tuning scripts for evaluating transformer-based models on KLEJ benchmark.☆26Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- Tool for named entity recognition for Polish based on deep learning.☆31Updated 2 years ago
- Late Interaction Models Training & Retrieval☆264Updated last week
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆176Updated 2 months ago
- NLP with Rust for Python 🦀🐍☆61Updated 10 months ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆34Updated 3 years ago
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- Pre-train Static Word Embeddings☆51Updated 3 weeks ago
- Polish datsets for grammatical error correction☆12Updated last year
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆191Updated 5 months ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆213Updated 6 months ago
- Robust and fast topic models with sentence-transformers.☆48Updated 2 weeks ago
- Bi-encoder entity linking architecture☆44Updated 6 months ago
- ☆67Updated 7 months ago
- Generalist and Lightweight Model for Text Classification☆110Updated this week
- Efficiently find the best-suited language model (LM) for your NLP task☆121Updated this week
- ☆11Updated 4 years ago
- ☆78Updated 2 years ago
- XTR/WARP is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆121Updated 5 months ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated 2 years ago
- Chunk your text using gpt4o-mini more accurately☆44Updated 7 months ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆65Updated 2 years ago
- Library that contains implementations of machine learning components in the hyperbolic space☆134Updated 11 months ago
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆156Updated 2 years ago
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆58Updated last month
- Source code and data for Like a Good Nearest Neighbor☆28Updated 2 months ago