Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"
☆29Jul 14, 2023Updated 2 years ago
Alternatives and similar repositories for MiniMoE
Users that are interested in MiniMoE are comparing it to the libraries listed below
Sorting:
- **ASCM4ABSA** - Our code and proposed data for NLPCC 2022 paper titled "Aspect-specific Context Modeling for Aspect-based Sentiment Analy…☆12Mar 26, 2023Updated 2 years ago
- Code and data for COLING 2022 paper titled "Structural Bias For Aspect Sentiment Triplet Extraction"☆26May 28, 2023Updated 2 years ago
- The code and preprocessed data for ACL 2021 paper titled "Exploiting Position Bias for Robust Aspect Sentiment Classification"☆27Aug 5, 2021Updated 4 years ago
- Code and dataset for paper "End-to-end Emotion-Cause Pair Extraction via Learning to Link"☆16Jan 12, 2022Updated 4 years ago
- Code for SIGIR 2019 paper titled "Syntax-Aware Aspect-Level Sentiment Classification with Proximity-Weighted Convolution Network"☆25Nov 21, 2023Updated 2 years ago
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆102Jul 9, 2024Updated last year
- NLP 相关岗位 笔试面试资源汇总☆16Jun 17, 2021Updated 4 years ago
- decontamination☆25Dec 3, 2025Updated 2 months ago
- Code and dataset for EMNLP 2020 paper titled "A Multi-task Learning Framework for Opinion Triplet Extraction"☆51Aug 23, 2022Updated 3 years ago
- Meta Representation Transformation for Low-resource Cross-lingual Learning☆41May 5, 2021Updated 4 years ago
- codes for paper Combining Dynamic Local Context Focus and Dependency Cluster Attention for Aspect-level sentiment classification☆19Dec 10, 2021Updated 4 years ago
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆19Aug 17, 2022Updated 3 years ago
- MATCH-TUNING☆15Aug 6, 2022Updated 3 years ago
- [ICLR 2022] Code for paper "Exploring Extreme Parameter Compression for Pre-trained Language Models"(https://arxiv.org/abs/2205.10036)☆22May 24, 2023Updated 2 years ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆23Dec 16, 2025Updated 2 months ago
- MasakhaNEWS: News Topic Classification for African Languages☆25May 12, 2024Updated last year
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Jan 27, 2023Updated 3 years ago
- SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects☆23Jan 26, 2025Updated last year
- Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"☆32Jun 20, 2023Updated 2 years ago
- [CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.☆33Jul 12, 2023Updated 2 years ago
- Staged Training for Transformer Language Models☆33Mar 31, 2022Updated 3 years ago
- Crosslingual Question Answering for African Languages☆30Sep 27, 2024Updated last year
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆31Dec 5, 2022Updated 3 years ago
- ☆13Oct 11, 2024Updated last year
- Linear Attention for Efficient Bidirectional Sequence Modeling☆15May 13, 2025Updated 9 months ago
- 机器学习(Machine Learning)、深度学习(Deep Learning)、对抗神经网络(GAN),图神经网络(GNN),NLP,大数据相关的发展路书(roadmap), 并附海量源码(python,pytorch)带大家消化基本知识点,突破面试,完成从新手到合格…☆10Feb 25, 2020Updated 6 years ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Updated this week
- Source code for paper: Knowledge Inheritance for Pre-trained Language Models☆38Apr 24, 2022Updated 3 years ago
- ☆30Sep 27, 2021Updated 4 years ago
- Long-context pretrained encoder-decoder models☆96Oct 28, 2022Updated 3 years ago
- ☆40Oct 30, 2022Updated 3 years ago
- A collection of research on specialized medical LLMs for specific diseases and distinct medical specialties, organized by ICD-10 chapters…☆32Oct 10, 2025Updated 4 months ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- Meta-Learning for End-to-End ASR☆10Aug 8, 2020Updated 5 years ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- 0-Shot Tokenizer Transplant☆14May 16, 2025Updated 9 months ago
- ☆10Sep 27, 2021Updated 4 years ago
- ☆12Jul 25, 2023Updated 2 years ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year