TakeLab / podium
Podium: a framework agnostic Python NLP library for data loading and preprocessing
β60Updated 2 years ago
Alternatives and similar repositories for podium:
Users that are interested in podium are comparing it to the libraries listed below
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020β62Updated 11 months ago
- β75Updated 3 years ago
- A π€-style implementation of BERT using lambda layers instead of self-attentionβ69Updated 4 years ago
- State of the art Semantic Sentence Embeddingsβ99Updated 2 years ago
- What are the best Systems? New Perspectives on NLP Benchmarkingβ13Updated 2 years ago
- A python tool for building large scale Wikipedia-based Information Retrieval datasetsβ46Updated 3 years ago
- Topic Inference with Zeroshot modelsβ61Updated last year
- Converter from UD-trees to BART representationβ36Updated last year
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.β68Updated 3 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.β102Updated 2 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.β146Updated 3 years ago
- Tutorial to pretrain & fine-tune a π€ Flax T5 model on a TPUv3-8 with GCPβ58Updated 2 years ago
- Execute arbitrary SQL queries on π€ Datasetsβ32Updated last year
- spaCy + UDPipeβ161Updated 2 years ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.β44Updated 10 months ago
- Generate BERT vocabularies and pretraining examples from Wikipediasβ18Updated 4 years ago
- Code for equipping pretrained language models (BART, GPT-2, XLNet) with commonsense knowledge for generating implicit knowledge statementβ¦β16Updated 3 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.β86Updated 2 months ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transferβ39Updated 4 years ago
- GrammarTagger β A Neural Multilingual Grammar Profiler for Language Learningβ27Updated 3 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puβ¦β40Updated 3 years ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vecβ19Updated 2 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.β51Updated last year
- Automatically detect errors in annotated corpora.β47Updated last year
- β21Updated 3 years ago
- LM Pretraining with PyTorch/TPUβ134Updated 5 years ago
- An implementation of GrASP (Shnarch et. al., 2017)β21Updated 2 years ago
- CrossRE: A Cross-Domain Dataset for Relation Extraction (Findings of EMNLP 2022)β47Updated 7 months ago
- A Python library aimed at dissecting and augmenting NER training data.β58Updated last year
- β37Updated 2 years ago