UKPLab / sentence-transformers
State-of-the-Art Text Embeddings
β15,368Updated this week
Related projects β
Alternatives and complementary repositories for sentence-transformers
- BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)β6,952Updated last year
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β7,958Updated this week
- π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.β135,166Updated this week
- Unsupervised text tokenizer for Neural Network-based text generation.β10,295Updated 2 weeks ago
- An open-source NLP research library, built on PyTorch.β11,759Updated last year
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β30,552Updated last month
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"β6,178Updated 2 months ago
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) trainingβ20,199Updated 3 months ago
- Data augmentation for NLPβ4,454Updated 4 months ago
- π₯ Fast State-of-the-Art Tokenizers optimized for Research and Productionβ9,052Updated this week
- Leveraging BERT and c-TF-IDF to create easily interpretable topics.β6,178Updated this week
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conveβ¦β4,109Updated 5 months ago
- Train transformer language models with reinforcement learning.β10,086Updated this week
- Ongoing research training transformer models at scaleβ10,595Updated this week
- Minimal keyword extraction with BERTβ3,552Updated 4 months ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)β13,946Updated this week
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821β3,426Updated last month
- A library for efficient similarity search and clustering of dense vectors.β31,488Updated this week
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β16,471Updated this week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalitiesβ20,194Updated last week
- Fast and memory-efficient exact attentionβ14,279Updated this week
- An annotated implementation of the Transformer paper.β5,733Updated 7 months ago
- XLNet: Generalized Autoregressive Pretraining for Language Understandingβ6,182Updated last year
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"β10,776Updated 3 months ago
- TensorFlow code and pre-trained models for BERTβ38,213Updated 3 months ago
- Open Source Neural Machine Translation and (Large) Language Models in PyTorchβ6,773Updated 4 months ago
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languagesβ7,296Updated this week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.β12,427Updated last month
- This repository contains demos I made with the Transformers library by HuggingFace.β9,500Updated 3 weeks ago
- Repo for external large-scale workβ6,516Updated 6 months ago