๐ ๏ธ Tools for Transformers compression using PyTorch Lightning โก
โ85Feb 1, 2026Updated last month
Alternatives and similar repositories for bert-squeeze
Users that are interested in bert-squeeze are comparing it to the libraries listed below
Sorting:
- Few-shot Named Entity Recognitionโ121Mar 30, 2022Updated 3 years ago
- Combining encoder-based language modelsโ11Nov 11, 2021Updated 4 years ago
- โ75Jul 2, 2021Updated 4 years ago
- Accurate word segmentation for hashtags and text, powered by Transformers and Beam Search. A scalable alternative to heuristic splitters โฆโ77Jan 8, 2026Updated last month
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository cโฆโ14May 19, 2020Updated 5 years ago
- โ15Dec 20, 2020Updated 5 years ago
- Prune a model while finetuning or training.โ406Jun 21, 2022Updated 3 years ago
- Simply, faster, sentence-transformersโ144Aug 27, 2024Updated last year
- Collection of NLP model explanations and accompanying analysis toolsโ144Jun 26, 2023Updated 2 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scaleโ157Dec 20, 2023Updated 2 years ago
- A pre-trained model with multi-exit transformer architecture.โ56Dec 10, 2022Updated 3 years ago
- A PyTorch-based model pruning toolkit for pre-trained language modelsโ388Aug 31, 2023Updated 2 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasksโ926Sep 2, 2024Updated last year
- Matching Tabular Data to Knowledge Graphsโ20Apr 27, 2023Updated 2 years ago
- โ17Oct 27, 2020Updated 5 years ago
- Flexible components pairing ๐ค Transformers with Pytorch Lightningโ612Nov 21, 2022Updated 3 years ago
- ๐ธ fastText + Bloom embeddings for compact, full-coverage vectors with spaCyโ335Apr 25, 2025Updated 10 months ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerโ23Mar 21, 2021Updated 4 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.โ157May 24, 2024Updated last year
- State of the art faster Transformer with Tensorflow 2.0 ( NLP, Computer Vision, Audio ).โ85Mar 16, 2023Updated 2 years ago
- Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrievalโ19Sep 24, 2022Updated 3 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/โฆโ28Apr 17, 2024Updated last year
- Model implementation for the contextual embeddings projectโ41Jun 2, 2025Updated 9 months ago
- A library to synthesize text datasets using Large Language Models (LLM)โ152Jan 17, 2023Updated 3 years ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.โ120Oct 20, 2025Updated 4 months ago
- Explainable Zero-Shot Topic Extractionโ65Aug 19, 2024Updated last year
- Search Engines with Autoregressive Language modelsโ295Apr 4, 2023Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puโฆโ41Jan 5, 2022Updated 4 years ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a documโฆโ267Nov 8, 2024Updated last year
- SQuARE: Software for question answering research.โ75Jun 25, 2024Updated last year
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorchโ265Jan 27, 2023Updated 3 years ago
- Efficient, scalable and enterprise-grade CPU/GPU inference server for ๐ค Hugging Face transformer models ๐โ1,688Oct 23, 2024Updated last year
- Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.โ256Nov 2, 2022Updated 3 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Selfโฆโ206Aug 17, 2022Updated 3 years ago
- LUKE -- Language Understanding with Knowledge-based Embeddingsโ727Nov 19, 2023Updated 2 years ago
- DaCy: The State of the Art Danish NLP pipeline using SpaCyโ100Dec 26, 2024Updated last year
- Entity Disambiguation as text extraction (ACL 2022)โ182Apr 17, 2022Updated 3 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in cโฆโ359Feb 22, 2022Updated 4 years ago
- Nearly Inference Free Embeddings: make your RAG queries 500x fasterโ70Feb 20, 2026Updated last week