piegu / language-models
pre-trained Language Models
☆280Updated 2 weeks ago
Related projects: ⓘ
- Clustering sentence embeddings to extract message intent☆166Updated 2 years ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆251Updated 4 months ago
- SpanMarker for Named Entity Recognition☆384Updated last month
- A Natural Portuguese Language Benchmark (Napolab) for the evaluation of language models.☆63Updated 3 weeks ago
- A Python library for calculating a large variety of metrics from text☆309Updated this week
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆208Updated 3 months ago
- ☆323Updated 9 months ago
- Code for training and evaluating T5 on Portuguese data.☆84Updated last year
- BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them t…☆113Updated 3 months ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆139Updated 5 months ago
- List of resources and tools developed with focus on Portuguese.☆226Updated 2 months ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆240Updated last year
- Portuguese translation of the GLUE benchmark and Scitail dataset☆26Updated 2 years ago
- ☆316Updated 8 months ago
- Simply, faster, sentence-transformers☆127Updated 3 weeks ago
- 1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.☆857Updated this week
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆180Updated last month
- Fine-Tuning Embedding for RAG with Synthetic Data☆456Updated last year
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆318Updated last year
- Streamlit Named Entity Recognition (NER) annotation custom component☆38Updated last year
- ☆44Updated last year
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆373Updated last year
- Software that makes labeling PDFs easy.☆383Updated 4 months ago
- Easily embed, cluster and semantically label text datasets☆434Updated 5 months ago
- Creating class-based TF-IDF matrices☆81Updated last year
- 🏖TagEditor - Annotation tool for spaCy☆185Updated last year
- Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cro…☆571Updated 3 weeks ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆321Updated last year
- Label data using HuggingFace's transformers and automatically get a prediction service☆175Updated last year
- Here you can find all the Tutorials for Haystack 📓☆252Updated last week