timpal0l / sts-benchmark-swedish
☆11Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for sts-benchmark-swedish
- ☆73Updated 3 years ago
- A High-level Library for Named Entity Recognition in Python.☆22Updated 11 months ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated last year
- Shared code for training sentence embeddings with Flax / JAX☆27Updated 3 years ago
- Codebase, data and models for the Keep it Simple paper at ACL2021☆36Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆52Updated last year
- Open source library for few shot NLP☆77Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆56Updated last year
- A embed able annotation tool for end to end cross document co-reference☆41Updated last year
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 5 months ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated last year
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 2 years ago
- Data programming by demonstration for information extraction and span annotation☆35Updated 3 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆88Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 8 months ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆43Updated 6 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆42Updated last year
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆79Updated last week
- Generate BERT vocabularies and pretraining examples from Wikipedias☆18Updated 4 years ago
- Semantically Structured Sentence Embeddings☆67Updated last month
- ☆42Updated last year
- On Generating Extended Summaries of Long Documents☆77Updated 3 years ago
- Fine-tune transformers with pytorch-lightning☆44Updated 2 years ago
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorch☆75Updated 3 years ago
- ☆83Updated 2 months ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆62Updated 6 months ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆72Updated 2 years ago
- German small and large versions of GPT2.☆20Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 9 months ago