zero-vocab or low-vocab embeddings
☆18Jul 17, 2022Updated 3 years ago
Alternatives and similar repositories for embeddings
Users that are interested in embeddings are comparing it to the libraries listed below
Sorting:
- Source code for "N-ary Constituent Tree Parsing with Recursive Semi-Markov Model" published at ACL 2021☆10May 27, 2021Updated 4 years ago
- Normalize text string☆12Nov 6, 2018Updated 7 years ago
- Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources (NAACL-2021).☆17Nov 18, 2021Updated 4 years ago
- SMiLER - Samsung MultiLingual Entity and Relation Extraction dataset☆18Feb 11, 2021Updated 5 years ago
- Source code of ACL2022 "Headed-Span-Based Projective Dependency Parsing" and "Combining (second-order) graph-based and headed-span-based …☆16Jan 12, 2023Updated 3 years ago
- A Language-consistent Open Relation Extraction Model.☆16Mar 24, 2023Updated 2 years ago
- This is a repository for the paper on testing inductive bias with scaled-down RoBERTa models.☆21Jan 10, 2022Updated 4 years ago
- Implementation of pQRNN in PyTorch☆46Oct 10, 2021Updated 4 years ago
- Accurate and Fast ALSH for Maximum Inner Product Search (KDD 2018)☆25Jul 8, 2021Updated 4 years ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆44Oct 10, 2025Updated 4 months ago
- Caucasus languages focused multilingual and monolingual corpuses for Natural Language Processing(NLP)☆36Nov 29, 2024Updated last year
- SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks☆31Mar 12, 2024Updated last year
- Code and models for the paper titled "Better Feature Integration for Named Entity Recognition", NAACL 2021.☆30Nov 5, 2021Updated 4 years ago
- AdamW optimizer for bfloat16 models in pytorch 🔥.☆39Jun 16, 2024Updated last year
- Java implementation of the EbMS 2.0 specification.☆10Feb 20, 2026Updated 2 weeks ago
- An interpreter for a small ML-ish language☆11Oct 6, 2017Updated 8 years ago
- open source knowledge for Syllabics font design and development☆10Nov 13, 2024Updated last year
- Handles OpenDocument files and translates them to HTML.☆10Oct 8, 2019Updated 6 years ago
- Statistical discontinuous constituent parsing☆11Feb 15, 2018Updated 8 years ago
- TSDG: An efficient index graph for graph-based nearest neighbor search☆10Jul 14, 2022Updated 3 years ago
- Narwhal is a keyword and KEY NARRATIVE manager that creates language-aware classes. Because Narhwal does not use NLP it avoids complexity…☆12Oct 16, 2018Updated 7 years ago
- A memory allocator that aims to eliminate dangling pointer vulnerabilities at a low overhead, using virtualisation via Dune. My Computer …☆10Nov 27, 2019Updated 6 years ago
- Redis distributed lock implementation for Python based on Pub/Sub messaging☆11Feb 14, 2026Updated 3 weeks ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆39Jun 9, 2020Updated 5 years ago
- Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"☆42Aug 20, 2024Updated last year
- Rationales for Sequential Predictions☆40Mar 10, 2022Updated 3 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- 🔠 Evolution of Language and Information Technology☆46May 3, 2024Updated last year
- LUNA: a Framework for Language Understanding and Naturalness Assessment.☆12Sep 9, 2023Updated 2 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆12Aug 10, 2023Updated 2 years ago
- TIS-100 Implementation in JavaScript☆10Sep 19, 2019Updated 6 years ago
- Temporal and Causal Relation extraction module for the Newsreader project.☆10Oct 26, 2015Updated 10 years ago
- ☆11Jul 12, 2021Updated 4 years ago
- ☆11Sep 8, 2017Updated 8 years ago
- Transcribe Greek text to Latin alphabet using the ISO 843:1997 standard (also known as ELOT 743:1987)☆13Oct 12, 2022Updated 3 years ago
- Seminar: intro to deep learning with tensorflow☆13Jun 27, 2017Updated 8 years ago
- Risk Minimization Algorithms in Structured Prediction (JMLR 2016)☆13Jan 26, 2017Updated 9 years ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- Dutch data.☆10Nov 12, 2025Updated 3 months ago