TakeLab / podiumLinks
Podium: a framework agnostic Python NLP library for data loading and preprocessing
☆60Updated 3 years ago
Alternatives and similar repositories for podium
Users that are interested in podium are comparing it to the libraries listed below
Sorting:
- ☆75Updated 4 years ago
- Create interactive textual heat maps for Jupiter notebooks☆196Updated last year
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Updated 2 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆90Updated this week
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆64Updated 3 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 3 years ago
- LM Pretraining with PyTorch/TPU☆137Updated 6 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated last year
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Updated 5 years ago
- ☆104Updated 5 years ago
- Statistics on multilingual datasets☆17Updated 3 years ago
- A 🤗-style implementation of BERT using lambda layers instead of self-attention☆69Updated 5 years ago
- An Interactive Tool for Scalable and Reproducible Error Analysis.☆109Updated 4 years ago
- Automatically detect errors in annotated corpora.☆48Updated 2 years ago
- Self-training with Weak Supervision (NAACL 2021)☆163Updated 2 years ago
- diagNNose is a Python library that facilitates a broad set of tools for analysing hidden activations of neural models.☆82Updated 2 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105Updated 3 years ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Updated this week
- Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/☆193Updated 2 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated last year
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated last month
- Hyperparameter Search for AllenNLP☆140Updated 11 months ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆69Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Updated 4 years ago
- Generate BERT vocabularies and pretraining examples from Wikipedias☆17Updated 5 years ago
- Question-answers, collected from Google☆131Updated 4 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆81Updated last year
- Custom Natural Language Processing with big and small models 🌲🌱☆66Updated 4 years ago
- Sentence transformers models for SpaCy☆108Updated 2 years ago
- Code and data for the WSDM '19 paper "Crosslingual Document Embedding as Reduced-Rank Ridge Regression (Cr5)"☆30Updated 6 years ago