bert蒸馏实践,包含BiLSTM蒸馏BERT和TinyBert
☆13Apr 23, 2022Updated 4 years ago
Alternatives and similar repositories for knowledge_distillation
Users that are interested in knowledge_distillation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- bilibili-nlp☆30Sep 24, 2022Updated 3 years ago
- Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation☆15Mar 28, 2026Updated last month
- BERT distillation(基于BERT的蒸馏实验 )☆317Jul 30, 2020Updated 5 years ago
- ☆23Jun 29, 2025Updated 10 months ago
- Source code for the ACL'2025 paper titled "Unveiling privacy risks in llm agent memory"☆30Dec 2, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [CVPR2026 🌟] The first attempt to Marine Open Vocabulary Instance Segmentation☆48Updated this week
- ☆10Oct 23, 2021Updated 4 years ago
- Knowledge Distillation from BERT☆54Jan 7, 2019Updated 7 years ago
- Knowledge distillation in text classification with pytorch. 知识蒸馏,中文文本分类,教 师模型BERT、XLNET,学生模型biLSTM。☆230Jul 27, 2022Updated 3 years ago
- [TIP] Exploring Effective Factors for Improving Visual In-Context Learning☆21Jul 2, 2025Updated 10 months ago
- 上海2022年新冠疫情案例数据地图可视化 Shanghai Covid 2022 Cases Visualization on Map☆18May 15, 2022Updated 3 years ago
- Minimal Jekyll theme for storytellers☆14Mar 8, 2024Updated 2 years ago
- Distilling Task-Specific Knowledge from BERT into Simple Neural Networks.☆15Aug 28, 2020Updated 5 years ago
- Multimodal Contrastive Representation Learning in Augmented Biomedical Knowledge Graphs☆32Dec 8, 2025Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13May 15, 2018Updated 7 years ago
- The code for Multi-Scale Hyperbolic Contrastive Learning for Cross-Subject EEG Emotion Recognition.☆21Jul 24, 2025Updated 9 months ago
- Examples for KubeEdge☆13Sep 29, 2020Updated 5 years ago
- 本科毕设,面向高校学生抑郁心理的聊天机器人☆17Apr 8, 2021Updated 5 years ago
- 基于Vue2+Django+LogisticRegressionCV,KDD-CUP99的网络入侵检测☆14Apr 6, 2022Updated 4 years ago
- ☆21Jul 11, 2019Updated 6 years ago
- Regularized MAML for time-series data prediction☆17Jun 8, 2022Updated 3 years ago
- Feature Structure Distillation with Centered Kernel Alignment in BERT Transferring official code☆11Jul 17, 2023Updated 2 years ago
- 各种算法的Golang实现 Golang算法☆17Jan 8, 2016Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Golang算法练习☆18Apr 8, 2019Updated 7 years ago
- Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD) and fine-tuned DistilBERT (student) using BERT as the teacher m…☆26Feb 13, 2021Updated 5 years ago
- ☆70Apr 8, 2026Updated 3 weeks ago
- tensorflow2_knowledge_distilling_example☆12Apr 26, 2022Updated 4 years ago
- 复现论文《Distilling Task-Specific Knowledge from BERT into Simple Neural Networks》☆16Jun 13, 2021Updated 4 years ago
- This is the official Gtihub repo for our paper: "BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Lang…☆22Jul 3, 2024Updated last year
- Making transparency normal!☆24Dec 18, 2023Updated 2 years ago
- A quick and easy-to-use Rust library for Excel file editing.☆32Apr 17, 2026Updated 2 weeks ago
- ☆15Oct 5, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CVE-2024-43451 is a Windows NTLM vulnerability that allows an attacker to force authentication and capture NTLM hashes by using malicious…☆15Jan 21, 2025Updated last year
- [ICCV 2025] Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.☆55Updated this week
- knowledge distillation on BERT☆29Apr 11, 2020Updated 6 years ago
- LiPar: A Lightweight Parallel Learning Model for Practical In-Vehicle Network Intrusion Detection (arXiv:2311.08000v2)☆25Nov 22, 2025Updated 5 months ago
- ☆24Jul 25, 2024Updated last year
- The implementation for ACL 2022 paper☆20Aug 14, 2022Updated 3 years ago
- Distilling Task-Specific Knowledge from Teacher Model into BiLSTM☆31Dec 14, 2024Updated last year