☆96Nov 28, 2019Updated 6 years ago
Alternatives and similar repositories for TinyBERT
Users that are interested in TinyBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ECIR'21: Simplified TinyBERT: Knowledge Distillation for Document Retrieval☆17Apr 25, 2021Updated 5 years ago
- This is a repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆12Nov 21, 2022Updated 3 years ago
- Baseline Models for Argumentative Text Understanding for AI Debater (NLPCC2021)☆12May 21, 2021Updated 5 years ago
- Source code and data for the EDM 2022 paper☆12May 16, 2022Updated 4 years ago
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,160Jan 22, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Dec 12, 2024Updated last year
- [ACL‘20] Highway Transformer: A Gated Transformer.☆33Dec 5, 2021Updated 4 years ago
- SDSC Summer Institute 2018 Teaching Material☆10Nov 25, 2022Updated 3 years ago
- Keyphrase Generation for Scientific Document Retrieval☆11Oct 2, 2020Updated 5 years ago
- Resources for our IJCAI 2020 paper, TopicKA: Generating Commonsense Knowledge-Aware Dialogue Responses Towards the Recommended Topic Fact☆12Nov 30, 2020Updated 5 years ago
- Official PyTorch code for UAI 2023 paper "Concurrent Misclassification and Out-of-Distribution Detection for Semantic Segmentation via En…☆12Nov 10, 2023Updated 2 years ago
- MAsked Sequence to Sequence (MASS) pre-training for language generation☆20Mar 18, 2019Updated 7 years ago
- ☆13Nov 16, 2020Updated 5 years ago
- Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…☆70Mar 7, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆12Oct 9, 2018Updated 7 years ago
- CIFAR10 ResNets implemented in JAX+Flax☆12Apr 6, 2022Updated 4 years ago
- Neural variational inference and learning in undirected graphical models http://www.stanford.edu/~kuleshov/papers/nips2017.pdf☆17Apr 25, 2018Updated 8 years ago
- Optimizing Hyperparameters with Conformal Quantile Regression☆11May 22, 2023Updated 3 years ago
- ☆13Apr 30, 2026Updated 3 weeks ago
- RelEx - A simple framework for Relation Extraction built on AllenNLP☆15Jun 17, 2020Updated 5 years ago
- Migrated to Codeberg☆38Mar 22, 2026Updated 2 months ago
- Chinese Prosodic Structure Prediction☆10May 18, 2019Updated 7 years ago
- A collection of scripts for the Stack Exchange network☆16Aug 14, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- LLM shell and document interogator☆14Jul 24, 2023Updated 2 years ago
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 6 years ago
- ☆10Oct 15, 2019Updated 6 years ago
- [COLM'24] "Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning"☆21Jun 14, 2024Updated last year
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis☆53Sep 20, 2025Updated 8 months ago
- Greedy Bayesian Posterior Approximation with Deep Ensembles. A. Tiulpin and M. B. Blaschko. (2021)☆11Jul 18, 2022Updated 3 years ago
- JAX implementation of "Fine-Tuning Language Models with Just Forward Passes"☆19Jun 10, 2023Updated 2 years ago
- Consistent dialogue generation☆16Oct 26, 2022Updated 3 years ago
- ☆13Oct 12, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆26Apr 27, 2022Updated 4 years ago
- Point of Concept: To help to automate the collection of evidence for SOC 2 Audits and etc.☆11May 13, 2024Updated 2 years ago
- ☆11Mar 12, 2019Updated 7 years ago
- OWASP Web Security Testing Guide RAG system with ChromaDB, MCP for Claude Code☆20Dec 11, 2025Updated 5 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 10 months ago
- ☆12Dec 7, 2018Updated 7 years ago
- Code of the paper "Beyond calibration: estimating the grouping loss of modern neural networks" published in ICLR 2023.☆12Nov 21, 2023Updated 2 years ago