Getting interpretable dimensions in word embedding spaces.
☆15Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for densray
Users that are interested in densray are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Apr 16, 2021Updated 5 years ago
- Extractive and Compressive Neural Summarization Based on Summary State Representations (NAACL 2019)☆16May 12, 2020Updated 6 years ago
- Analyzing mBERT's multilinguality in a small laboratory setting☆13Jun 12, 2023Updated 3 years ago
- ☆25Jan 22, 2024Updated 2 years ago
- Code for SPINE - Sparse Interpretable Neural Embeddings. Jhamtani H.*, Pruthi D.*, Subramanian A.*, Berg-Kirkpatrick T., Hovy E. AAAI 20…☆53Feb 4, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper☆14Aug 9, 2021Updated 4 years ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Sep 17, 2021Updated 4 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 3 years ago
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆42Nov 15, 2023Updated 2 years ago
- Source code accompanying the ICLR2020 publication 'Massively Multilingual Sparse Word Representations' https://openreview.net/forum?id=Hy…☆12Aug 15, 2023Updated 2 years ago
- Implementation of Cascaded Head-colliding Attention (ACL'2021)☆11Sep 16, 2021Updated 4 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Sep 17, 2022Updated 3 years ago
- ☆16May 6, 2021Updated 5 years ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Dec 18, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆45Feb 11, 2026Updated 4 months ago
- benchmarks for LLM tokenizers☆18Mar 25, 2026Updated 2 months ago
- [NAACL 2024] A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- Online Interpretable Word Embeddings☆37Nov 17, 2015Updated 10 years ago
- This repo supports various cross-lingual transfer learning & multilingual NLP models.☆92Sep 13, 2023Updated 2 years ago
- Universal Conceptual Cognitive Annotation (UCCA)☆22Jun 28, 2021Updated 4 years ago
- ☆40May 2, 2021Updated 5 years ago
- This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.☆31May 11, 2020Updated 6 years ago
- Set-Equivariant Deep Learning Models☆22Dec 23, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 11 years ago
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"☆22Feb 14, 2024Updated 2 years ago
- ☆20Mar 30, 2022Updated 4 years ago
- Bidirectional Recurrent Neural Network based sequence labeling for Medical Text.☆24May 22, 2016Updated 10 years ago
- Experimental code used in pre-training the KBIR and KeyBART models☆27Jul 8, 2022Updated 3 years ago
- FlexiTokens☆23Dec 27, 2025Updated 5 months ago
- ☆12Nov 17, 2018Updated 7 years ago
- Massively Multilingual Transfer for NER☆86Oct 7, 2021Updated 4 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Oct 29, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Temporary remove unused tokens during training to save ram and speed.☆23Jun 15, 2025Updated 11 months ago
- ☆14May 8, 2024Updated 2 years ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆26Jun 2, 2021Updated 5 years ago
- ☆12Sep 1, 2021Updated 4 years ago
- CLEARumor: ConvoLving ELMo against Rumors☆11Jul 25, 2024Updated last year
- ALTER: Auxiliary Text Rewriting Tool for Natural Language Generation☆17Dec 10, 2022Updated 3 years ago
- Staged Training for Transformer Language Models☆33Mar 31, 2022Updated 4 years ago