[EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing
☆14Feb 10, 2023Updated 3 years ago
Alternatives and similar repositories for SparseLT
Users that are interested in SparseLT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆26May 1, 2022Updated 4 years ago
- Serial Contrastive Knowledge Distillation for Continual Few-shot Relation Extraction, Findings of ACL 2023☆14May 12, 2023Updated 3 years ago
- ☆15Dec 5, 2019Updated 6 years ago
- An implementation of (Chambers and Jurafsky, 2008), using updated machine learning models, and different training data domains for an ind…☆14Dec 8, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Nov 18, 2023Updated 2 years ago
- Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting☆11Mar 24, 2023Updated 3 years ago
- The code repository for AMR guided joint information extraction model (NAACL-2021).☆37Apr 10, 2022Updated 4 years ago
- Distributed SDDMM Kernel☆12Jul 8, 2022Updated 3 years ago
- Aioli: A unified optimization framework for language model data mixing☆32Jan 17, 2025Updated last year
- Korean Abstract Meaning Representation (AMR) Corpus☆10Feb 27, 2022Updated 4 years ago
- 한국어 상호참조해결 (개체 후보 대상)☆10Aug 12, 2020Updated 5 years ago
- [ACL 2021 Findings] HySPA: Hybrid Span Generation for Scalable Text-to-Graph Extraction☆10Sep 16, 2021Updated 4 years ago
- ALIGNIE: Few-Shot Fine-Grained Entity Typing with Automatic Label Interpretation and Instance Generation☆20Dec 12, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code publication to the paper "Normalized Attention Without Probability Cage"☆17Nov 9, 2021Updated 4 years ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- About Code release for "FlashBias: Fast Computation of Attention with Bias" (NeurIPS 2025), https://arxiv.org/abs/2505.12044☆28Nov 17, 2025Updated 6 months ago
- Code for running the experiments in Deep Subjecthood: Higher Order Grammatical Features in Multilingual BERT☆17Aug 15, 2023Updated 2 years ago
- ☆19Feb 14, 2023Updated 3 years ago
- sigma-MoE layer☆21Jan 5, 2024Updated 2 years ago
- Code for the paper "Factorising Meaning and Form for Intent-Preserving Paraphrasing", Tom Hosking & Mirella Lapata (ACL 2021)☆27Nov 8, 2023Updated 2 years ago
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆24Nov 29, 2025Updated 5 months ago
- MATLAB/Octave generator of Hamming ECC coding. Output format is Verilog HDL.☆12Dec 27, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 2019 국어경진대회 한국어 의존구문 분석 대상(문체부 장관상)☆15Oct 26, 2022Updated 3 years ago
- Archives for Triton Inference Server Practices☆15Feb 28, 2022Updated 4 years ago
- Source code for NeurIPS 2020 paper "Node Classification on Graphs with Few-Shot Novel Labels via Meta Transformed Network Embedding"☆10Nov 17, 2020Updated 5 years ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago
- ☆23Sep 2, 2024Updated last year
- [ACL 2025] Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models☆39Nov 4, 2025Updated 6 months ago
- Baseline code for NAACL 2021 paper "Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge"☆21Jul 6, 2021Updated 4 years ago
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2022 -- Distilling a Powerful Student Model via Online Knowledge Distillation☆31Nov 11, 2021Updated 4 years ago
- A TensorFlow implementation of FlowQA☆15Nov 24, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆21Nov 3, 2021Updated 4 years ago
- This project is concerned with my participating in the RuNNE competition https://github.com/dialogue-evaluation/RuNNE☆13Jun 28, 2023Updated 2 years ago
- 2020-natural-language-processing-project☆10Dec 18, 2020Updated 5 years ago
- DocEE: A Large-Scale and Fine-grained Benchmark for Document-level Event Extraction☆40Apr 19, 2023Updated 3 years ago
- An (incomplete) overview of information extraction☆43Apr 28, 2022Updated 4 years ago
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆78Sep 4, 2023Updated 2 years ago
- The Mingled Structured Predictor☆29Mar 28, 2024Updated 2 years ago