seznam / czech-semantic-embedding-modelsLinks
☆26Updated last year
Alternatives and similar repositories for czech-semantic-embedding-models
Users that are interested in czech-semantic-embedding-models are comparing it to the libraries listed below
Sorting:
- Inference engine for GLiNER models, in Rust☆60Updated this week
- ☆50Updated 2 years ago
- German Alpaca Dataset (Cleaned + Translated)☆25Updated 2 years ago
- ☆38Updated last year
- RoBERTa models for Polish☆87Updated 3 years ago
- German Language Understanding Evaluation Benchmark @NAACL24☆10Updated this week
- Data and code: "Answering legal questions from laymen in German civil law system", Büttner & Habernal, EACL'24☆11Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆59Updated 10 months ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆313Updated 2 months ago
- German Text Embedding Clustering Benchmark☆17Updated last year
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆207Updated last month
- Simply, faster, sentence-transformers☆143Updated 10 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- ☆10Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆137Updated last month
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- ✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models☆13Updated last week
- An integration of Qdrant ANN vector database backend with txtai☆24Updated 10 months ago
- Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…☆36Updated last year
- The robust European language model benchmark.☆106Updated this week
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆76Updated 3 years ago
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Updated last year
- Generalist and Lightweight Model for Text Classification☆134Updated 2 weeks ago
- Trainable embedding transformation for confidence estimation, feature extraction, explainability and conversion from dense to sparse.☆22Updated 2 weeks ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆138Updated 3 weeks ago
- German small and large versions of GPT2.☆20Updated 3 years ago
- Polish data.☆11Updated 3 weeks ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆56Updated 2 months ago