Knowledge Distillation Toolkit
☆88Jun 27, 2020Updated 5 years ago
Alternatives and similar repositories for aquvitae
Users that are interested in aquvitae are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for scaling Transformers☆26Dec 2, 2020Updated 5 years ago
- McKernel: A Library for Approximate Kernel Expansions in Log-linear Time.☆13Sep 3, 2022Updated 3 years ago
- PyTorch Flexible Hash Embeddings☆29Feb 4, 2020Updated 6 years ago
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- The stand-alone training engine module for the ALOHA.eu project.☆15Oct 27, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A flexible variational inference LDA library.☆23Mar 15, 2019Updated 7 years ago
- python package for calculating famous measures in computational linguistics☆15Nov 5, 2024Updated last year
- A minimal implementation of a VAE with BinConcrete (relaxed Bernoulli) latent distribution in TensorFlow.☆22Feb 1, 2020Updated 6 years ago
- Research boilerplate for PyTorch.☆150May 31, 2023Updated 2 years ago
- ☆21Nov 16, 2018Updated 7 years ago
- High performance pytorch modules☆18Jan 14, 2023Updated 3 years ago
- Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"☆14Apr 28, 2020Updated 6 years ago
- A Python library for time series forecasting☆81Jun 11, 2025Updated 11 months ago
- ☆47May 22, 2017Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆52Mar 7, 2019Updated 7 years ago
- ELECTRA MODEL NLP☆13Apr 8, 2020Updated 6 years ago
- Cross-modal Coherence Modeling for Caption Generation☆11Jul 24, 2020Updated 5 years ago
- ☆10Aug 25, 2018Updated 7 years ago
- Simple Structured Perceptron tagger in Python☆10May 30, 2017Updated 8 years ago
- Code for the UCL Statistical NLP course☆11Jan 19, 2015Updated 11 years ago
- Voice Conversion using Tacotron.☆11Dec 29, 2022Updated 3 years ago
- Train bilingual embeddings as described in our NAACL 2015 workshop paper "Bilingual Word Representations with Monolingual Quality in Mind…☆79Jun 15, 2019Updated 6 years ago
- Tools for extracting tables and results from Machine Learning papers☆439Nov 28, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Jan 9, 2019Updated 7 years ago
- Some small scale experiments for my blog posts 📝☆80Jun 22, 2022Updated 3 years ago
- The codes for recent knowledge distillation algorithms and benchmark results via TF2.0 low-level API☆112Apr 6, 2022Updated 4 years ago
- incremental symbol learning for natural language understanding☆10Jun 12, 2023Updated 2 years ago
- Code and data for SciPy 2018 talk on missing data☆21Jun 29, 2018Updated 7 years ago
- ☆31Apr 2, 2022Updated 4 years ago
- CS224S Course Project☆14Jun 9, 2014Updated 11 years ago
- Script for generating the rotowire-modified dataset (Iso et al; ACL 2019)☆12Sep 19, 2021Updated 4 years ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆33Mar 26, 2026Updated 2 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Scripts to train a bidirectional LSTM with knowledge distillation from BERT☆159Nov 21, 2019Updated 6 years ago
- ☆15Sep 15, 2022Updated 3 years ago
- ☆17Jun 8, 2019Updated 6 years ago
- Flexible Reinforcement Learning Framework with PyTorch☆22Jul 17, 2020Updated 5 years ago
- A runtime shape checker and auto-annotator for tensor programs (pronounced "stanley")☆40Nov 9, 2019Updated 6 years ago
- [CVPR 2020] Dynamic Hierarchical Mimicking Towards Consistent Optimization Objectives☆84Jul 16, 2020Updated 5 years ago
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆470Jun 22, 2022Updated 3 years ago