GeneZC/MiniMoE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GeneZC/MiniMoE)

GeneZC / MiniMoE

Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"

☆29

Alternatives and similar repositories for MiniMoE

Users that are interested in MiniMoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BD-MF / ASCM4ABSA
View on GitHub
**ASCM4ABSA** - Our code and proposed data for NLPCC 2022 paper titled "Aspect-specific Context Modeling for Aspect-based Sentiment Analy…
☆12Mar 26, 2023Updated 3 years ago
GeneZC / StructBias
View on GitHub
Code and data for COLING 2022 paper titled "Structural Bias For Aspect Sentiment Triplet Extraction"
☆26May 28, 2023Updated 3 years ago
GeneZC / PWCN
View on GitHub
Code for SIGIR 2019 paper titled "Syntax-Aware Aspect-Level Sentiment Classification with Proximity-Weighted Convolution Network"
☆25Nov 21, 2023Updated 2 years ago
shl5133 / E2EECPE
View on GitHub
Code and dataset for paper "End-to-end Emotion-Cause Pair Extraction via Learning to Link"
☆16Jan 12, 2022Updated 4 years ago
GeneZC / MiniMA
View on GitHub
Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"
☆102Jul 9, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
GeneZC / OTE-MTL
View on GitHub
Code and dataset for EMNLP 2020 paper titled "A Multi-task Learning Framework for Opinion Triplet Extraction"
☆50Aug 23, 2022Updated 3 years ago
stefan-it / xlm-v-experiments
View on GitHub
Experiments for XLM-V Transformers Integeration
☆13Feb 8, 2023Updated 3 years ago
XuMayi / DLCF-DCA
View on GitHub
codes for paper Combining Dynamic Local Context Focus and Dependency Cluster Attention for Aspect-level sentiment classification
☆19Dec 10, 2021Updated 4 years ago
lipiji / uChecker
View on GitHub
Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"
☆19Aug 17, 2022Updated 3 years ago
dadelani / africanlp-resources
View on GitHub
List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond
☆13Aug 15, 2022Updated 3 years ago
tongshoujie / MATCH-TUNING
View on GitHub
MATCH-TUNING
☆15Aug 6, 2022Updated 3 years ago
twinkle0331 / Xcompression
View on GitHub
[ICLR 2022] Code for paper "Exploring Extreme Parameter Compression for Pre-trained Language Models"(https://arxiv.org/abs/2205.10036)
☆23May 24, 2023Updated 3 years ago
bltlab / seqscore
View on GitHub
SeqScore: Scoring for named entity recognition and other sequence labeling tasks
☆23Updated this week
masakhane-io / masakhane-news
View on GitHub
MasakhaNEWS: News Topic Classification for African Languages
☆26May 12, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ivanmontero / autobot
View on GitHub
Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'
☆17Mar 14, 2022Updated 4 years ago
dadelani / sib-200
View on GitHub
SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects
☆26May 20, 2026Updated 2 months ago
gmftbyGMFTBY / MomentumDecoding
View on GitHub
Momentum Decoding: Open-ended Text Generation as Graph Exploration
☆19Jan 27, 2023Updated 3 years ago
masakhane-io / africomet
View on GitHub
COMET for African languages
☆11Jan 24, 2025Updated last year
Yinghao-Li / CHMM-ALT
View on GitHub
Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"
☆32Jun 20, 2023Updated 3 years ago
ARBML / dar
View on GitHub
A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.
☆11Jun 23, 2024Updated 2 years ago
lliai / AlphaTree-graphic-deep-neural-network
View on GitHub
机器学习(Machine Learning)、深度学习(Deep Learning)、对抗神经网络(GAN），图神经网络（GNN），NLP，大数据相关的发展路书(roadmap), 并附海量源码（python，pytorch）带大家消化基本知识点，突破面试，完成从新手到合格…
☆10Feb 25, 2020Updated 6 years ago
fbarez / neuroplasticity
View on GitHub
☆14Mar 31, 2024Updated 2 years ago
dguo98 / SeqMix
View on GitHub
Sequence-Level Mixed Sample Data Augmentation
☆23Mar 7, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
oya163 / nepali-ner
View on GitHub
Named Entity Recognition in Nepali Language
☆10Jan 12, 2023Updated 3 years ago
masakhane-io / afriqa
View on GitHub
Crosslingual Question Answering for African Languages
☆31Sep 27, 2024Updated last year
dsridhar91 / hstm
View on GitHub
Code and data for "Heterogeneous Supervised Topic Models"
☆10Jun 27, 2022Updated 4 years ago
DAMO-NLP-SG / AdamergeX
View on GitHub
☆11Apr 2, 2024Updated 2 years ago
csong27 / auditing-text-generation
View on GitHub
Code for Auditing Data Provenance in Text-Generation Models (in KDD 2019)
☆10Jun 18, 2019Updated 7 years ago
allenai / staged-training
View on GitHub
Staged Training for Transformer Language Models
☆33Mar 31, 2022Updated 4 years ago
UKPLab / emnlp2021-prompt-ft-heuristics
View on GitHub
☆10Sep 27, 2021Updated 4 years ago
yangheng95 / metric-visualizer
View on GitHub
For easy metric logging and visualization
☆14Jan 31, 2025Updated last year
yangheng95 / BoostTextAugmentation
View on GitHub
☆14Aug 6, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jonnypei / acl23-preadd
View on GitHub
☆12Jul 25, 2023Updated 2 years ago
wujwyi / CMC
View on GitHub
[NeurIPS 2024 poster] Cross-model Control: Improving Multiple Large Language Models in One-time Training
☆14Oct 25, 2024Updated last year
krypticmouse / matryoshka-representation-learning
View on GitHub
PyTorch implementation for MRL
☆23Feb 22, 2024Updated 2 years ago
Niger-Volta-LTI / yoruba-voice
View on GitHub
Repo & Project for the Imminent Research Grant code & tasks
☆12May 20, 2024Updated 2 years ago
SalesforceAIResearch / indict_code_gen
View on GitHub
INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness
☆15Jun 2, 2026Updated last month
CommissarSilver / CVT
View on GitHub
This repository contains the replication package of our paper "Assessing the Security of GitHub Copilot’s Generated Code - A Targeted Rep…
☆10Nov 16, 2023Updated 2 years ago
haneul-yoo / HUE
View on GitHub
Hanja Understanding Evaluation Dataset
☆15May 2, 2022Updated 4 years ago