epfl-dlab / Cr5
Code and data for the WSDM '19 paper "Crosslingual Document Embedding as Reduced-Rank Ridge Regression (Cr5)"
☆30Updated 5 years ago
Alternatives and similar repositories for Cr5:
Users that are interested in Cr5 are comparing it to the libraries listed below
- Word embedding approach based on a dynamic log-linear model☆54Updated 7 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- Frame-Semantic and PropBank Semantic Role Labeling with Syntactic Scaffolding.☆50Updated 3 years ago
- WordMoversEmbeddings(WME) is a simple code for generating the vector representation of sentence/document for text classification and clus…☆81Updated 6 years ago
- Code and data for ACL2016 article "Which argument is more convincing? Analyzing and predicting convincingness of Web arguments using bidi…☆28Updated 8 years ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- Getting interpretable dimensions in word embedding spaces.☆14Updated last year
- Multi-Annotator Competence Estimation tool☆63Updated 5 years ago
- Participant Kit for the TextGraphs-15 Shared Task on Explanation Regeneration☆19Updated 3 years ago
- Extended Wikilinks dataset description☆14Updated 7 years ago
- A embed able annotation tool for end to end cross document co-reference☆42Updated 2 years ago
- A collection of English tweets annotated in Universal Dependencies.☆39Updated 3 years ago
- Training Temporal Word Embeddings with a Compass☆64Updated 2 years ago
- This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.☆11Updated 4 years ago
- ☆10Updated 8 years ago
- pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference☆62Updated 2 years ago
- Converter from UD-trees to BART representation☆36Updated last year
- The accompanying code for "Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understandin…☆21Updated 5 years ago
- ☆54Updated 3 years ago
- Word Sense Induction with BERT MLM☆28Updated last year
- ☆33Updated 3 years ago
- This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.☆31Updated 4 years ago
- Corpus and annotations for the CL-Aff Shared Task from the University of Pennsylvania☆19Updated 3 years ago
- Reversible tokenization in Python.☆60Updated 6 years ago
- ☆54Updated 9 years ago
- Code to compute topic coherence for several topic cardinalities and aggregate scores across them☆21Updated 2 months ago
- Data and all☆14Updated 5 years ago
- Twpipe is a pipeline toolkit that parses raw tweets into universal dependencies.☆28Updated 6 years ago
- Auxiliary GAN for WE post-specialisation☆23Updated 6 years ago
- Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation☆63Updated 6 years ago