复现论文《Distilling Task-Specific Knowledge from BERT into Simple Neural Networks》
☆16Jun 13, 2021Updated 4 years ago
Alternatives and similar repositories for distill_BERT_into_RNN-CNN
Users that are interested in distill_BERT_into_RNN-CNN are comparing it to the libraries listed below
Sorting:
- Distilling Task-Specific Knowledge from BERT into Simple Neural Networks.☆15Aug 28, 2020Updated 5 years ago
- An implementation of ResNet with mixup and cutout regularizations and soft filter pruning.☆17Feb 23, 2026Updated 2 weeks ago
- Template Filling with Generative Transformers☆22Jun 8, 2021Updated 4 years ago
- An Industry Evaluation of Embedding-based Entity Alignment @ COLING'20☆26Nov 15, 2021Updated 4 years ago
- PyTorch implementation of Near-Lossless Post-Training Quantization of Deep Neural Networks via a Piecewise Linear Approximation☆23Feb 17, 2020Updated 6 years ago
- GPU implementation of Xnor network on inference level.☆22Aug 10, 2020Updated 5 years ago
- 基于Pytorch + BERT的抽取式机器阅读理解☆21Dec 8, 2022Updated 3 years ago
- knowledge distillation: 采用知识蒸馏,训练bert后指导textcnn☆19Apr 29, 2021Updated 4 years ago
- Various implementations and experimentation for deep neural network model compression☆24Sep 6, 2018Updated 7 years ago
- BERT distillation(基于BERT的蒸馏实验 )☆314Jul 30, 2020Updated 5 years ago
- Ancestral Gumbel-Top-k Sampling☆25Apr 11, 2020Updated 5 years ago
- Knowledge distillation in text classification with pytorch. 知识蒸馏,中文文本分类,教师模型BERT、XLNET,学生模型biLSTM。☆229Jul 27, 2022Updated 3 years ago
- A fork of BlenderProc used in the GRADE framework to generate environments and export some additional information for processing.☆10Mar 9, 2023Updated 3 years ago
- ☆11Apr 6, 2019Updated 6 years ago
- Source code for paper "Trajectory of Alternating Direction Method of Multipliers and Adaptive Acceleration" of NeurIPS 2019☆10Jan 25, 2024Updated 2 years ago
- Meta-Reinforcement Learning with Policy Residual Representation☆11Aug 15, 2019Updated 6 years ago
- Light Cube using PYNQ☆10Aug 4, 2018Updated 7 years ago
- 📄Source code variable naming using a seq2seq architecture☆10Mar 19, 2020Updated 5 years ago
- ☆10Jul 5, 2019Updated 6 years ago
- ☆112Oct 27, 2025Updated 4 months ago
- ☆14Aug 26, 2024Updated last year
- Code for our WACV 2021 paper "Exploiting the Redundancy in Convolutional Filters for Parameter Reduction"☆11Jan 6, 2021Updated 5 years ago
- ☆10Jan 5, 2021Updated 5 years ago
- ☆11Dec 11, 2023Updated 2 years ago
- Score and Distribution Matching Policy: Advanced accelerated Visuomotor Policies via matched distillation☆10May 9, 2025Updated 10 months ago
- Information Bottleneck in DNN with PyTorch☆15Jul 6, 2023Updated 2 years ago
- Code for our project CROWN (Conversational Passage Ranking by Reasoning over Word Networks)☆10Jan 11, 2024Updated 2 years ago
- ☆14Dec 14, 2023Updated 2 years ago
- Domain-Adaptive Multibranch Networks☆14Nov 7, 2020Updated 5 years ago
- Implementation for NATv2.☆23Feb 20, 2021Updated 5 years ago
- Official code release accompanying the paper "SimpleNeRF: Regularizing Sparse Input Neural Radiance Fields with Simpler Solutions"☆12Jun 7, 2025Updated 9 months ago
- ☆13Sep 2, 2023Updated 2 years ago
- Large-scale topic discovery with Sampled-MinHashing☆10Jul 3, 2019Updated 6 years ago
- Generates ffi-compatible layer for your rust code☆11Jul 4, 2020Updated 5 years ago
- 中文文本的向量表示方法(Sentence-BERT, CoSENT)的PyTorch简单实现,可以用于文本相似度计算。☆10Mar 27, 2022Updated 3 years ago
- [IROS 2021] ADD: A Fine-grained Dynamic Inference Architecture for Semantic Image Segmentation☆10May 3, 2022Updated 3 years ago
- Implementing DBSCAN using numpy and pytorch☆11Aug 21, 2020Updated 5 years ago
- python metric functions, such as MAP, NDCG, AUC...☆10Jul 25, 2014Updated 11 years ago
- 化工产品品质智能预测算法☆11Dec 10, 2018Updated 7 years ago