jungmaier / dirichlet-smoothed-word-embeddingsView external linksLinks
Word embeddings from PPMI-weighted and dirichlet-smoothed co-occurrence matrices
☆10Aug 3, 2020Updated 5 years ago
Alternatives and similar repositories for dirichlet-smoothed-word-embeddings
Users that are interested in dirichlet-smoothed-word-embeddings are comparing it to the libraries listed below
Sorting:
- Text Corpus of African American Fiction and Poetry, from 1853-1923☆10Aug 5, 2020Updated 5 years ago
- Linear Attention for Efficient Bidirectional Sequence Modeling☆15May 13, 2025Updated 9 months ago
- State-of-the-art neural tagger and lemmatizer for ancient languages☆13Mar 9, 2025Updated 11 months ago
- ☆10Oct 2, 2024Updated last year
- Materials for the Text Analysis Pedagogy Institute course on Finding Word Meaning Through Context☆11Jul 21, 2023Updated 2 years ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- Poetry Corpora Annotated on Aesthetic Emotions☆12Aug 2, 2022Updated 3 years ago
- 0-Shot Tokenizer Transplant☆14May 16, 2025Updated 9 months ago
- ☆10Sep 13, 2022Updated 3 years ago
- Data and code: "Answering legal questions from laymen in German civil law system", Büttner & Habernal, EACL'24☆13Mar 2, 2024Updated last year
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- ☆13Nov 28, 2025Updated 2 months ago
- Digitale Geisteswissenschaften rund um Graphentechnologien☆10Updated this week
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- decontamination☆24Dec 3, 2025Updated 2 months ago
- Abstractive text summarization done with the help of LSTMs using encoder-decoder model which was able to achieve accuracy of 77.27% on t…☆10Sep 22, 2020Updated 5 years ago
- 🐝Apiary: The Data API for RRCHNM☆10Updated this week
- Collection of description of concepts, procedures, and simple XSLT files for text processing, e.g. simplify InDesign documents (.idml) to…☆12Jan 9, 2020Updated 6 years ago
- This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Ec…☆11Apr 14, 2025Updated 10 months ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- ☆52Jun 6, 2023Updated 2 years ago
- ☆16Feb 18, 2023Updated 2 years ago
- 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated 10 months ago
- A RegEx GUI☆14Jan 13, 2021Updated 5 years ago
- ChunkNorris is a black belt in document chunking to feed your LLMs and RAG apps 🥋🔪☆22Feb 10, 2026Updated last week
- ☆10Oct 15, 2019Updated 6 years ago
- Support for linguistics-style examples in Org mode☆10Dec 9, 2022Updated 3 years ago
- [Konvens21] This repository contains the DFKI MobIE Corpus, a dataset of 3,232 German-language documents that have been annotated with fi…☆12Sep 17, 2024Updated last year
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆14Jun 6, 2025Updated 8 months ago
- Code for ACL 2023 paper "Learning 'O' Helps for Learning More: Handling the Unlabeled Entity Problem for Class-incremental NER"☆11Jul 17, 2023Updated 2 years ago
- 🕸 GlotWeb: Web Indexing for Low-Resource Languages -- under construction.☆17Aug 13, 2025Updated 6 months ago
- Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.☆12May 11, 2021Updated 4 years ago
- ☆13Jun 16, 2021Updated 4 years ago
- Implementation of Cascaded Head-colliding Attention (ACL'2021)☆11Sep 16, 2021Updated 4 years ago
- ☆10Jun 8, 2024Updated last year
- collections of data science, machine learning and data visualization projects with pandas, sklearn, matplotlib, tensorflow2, Keras, vario…☆10Apr 4, 2022Updated 3 years ago
- IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization☆12Nov 23, 2021Updated 4 years ago
- ☆21Dec 5, 2022Updated 3 years ago
- ☆14Apr 8, 2021Updated 4 years ago