[NeurIPS 2022] DataMUX: Data Multiplexing for Neural Networks
☆60Nov 24, 2022Updated 3 years ago
Alternatives and similar repositories for DataMUX
Users that are interested in DataMUX are comparing it to the libraries listed below
Sorting:
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆15Feb 8, 2026Updated 3 weeks ago
- [NAACL 2021] Reading and Acting while Blindfolded: The Need for Semantics in Text Game Agents☆11May 31, 2021Updated 4 years ago
- CNN Accelerator in Frequency Domain☆12Feb 22, 2020Updated 6 years ago
- Unifew: Unified Fewshot Learning Model☆18Sep 10, 2021Updated 4 years ago
- [DAC 2020] Analysis and Optimization of the Implicit Broadcasts in FPGA HLS to Improve Maximum Frequency☆32Feb 17, 2021Updated 5 years ago
- ☆46Apr 13, 2022Updated 3 years ago
- ☆21Dec 30, 2021Updated 4 years ago
- ☆13Oct 26, 2023Updated 2 years ago
- incremental symbol learning for natural language understanding☆10Jun 12, 2023Updated 2 years ago
- Benchmark framework of compute-in-memory based accelerators for deep neural network (inference engine focused)☆10Jun 1, 2021Updated 4 years ago
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago
- sketching algorithms implemented in chapel and python☆10Jun 8, 2017Updated 8 years ago
- ☆10Aug 28, 2020Updated 5 years ago
- Guiding Attention for Self-Supervised Learning with Transformers☆12Feb 8, 2023Updated 3 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆35Feb 26, 2026Updated last week
- [ICLR 2022] Linking Emergent and Natural Languages via Corpus Transfer☆33Jun 2, 2024Updated last year
- Implementation of experiments in paper "Learning from Rules Generalizing Labeled Exemplars" to appear in ICLR2020 (https://openreview.net…☆50Feb 28, 2023Updated 3 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- ☆11Feb 11, 2019Updated 7 years ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Jul 20, 2022Updated 3 years ago
- ☆58May 4, 2022Updated 3 years ago
- ☆11Feb 16, 2023Updated 3 years ago
- Wraps the NVDLA project for Chipyard integration☆22Sep 2, 2025Updated 6 months ago
- Some CSS experiments for arXiv HTML documents converted via latexml☆20Feb 26, 2026Updated last week
- Adaptive floating-point based numerical format for resilient deep learning☆14Apr 11, 2022Updated 3 years ago
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆19Oct 12, 2024Updated last year
- Teaching Addition to Small Transformers☆17Nov 28, 2023Updated 2 years ago
- Rationales for Sequential Predictions☆40Mar 10, 2022Updated 3 years ago
- An ultra-lightweight JAX implementation of sparse Gaussian processes via pathwise sampling.☆22Mar 31, 2021Updated 4 years ago
- Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE☆19Sep 22, 2021Updated 4 years ago
- Official implementation of paper Gradient Matching for Domain Generalization☆124Dec 14, 2021Updated 4 years ago
- ☆47Jan 11, 2021Updated 5 years ago
- ☆20Mar 22, 2024Updated last year
- Measuring if attention is explanation with ROAR☆22Mar 3, 2023Updated 3 years ago
- Portfolio REgret for Confidence SEquences☆21Jan 6, 2026Updated 2 months ago
- Source code for the Nature Machine Intelligence paper: When and how convolutional neural networks generalize to out-of-distribution categ…☆24Feb 26, 2022Updated 4 years ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Nov 10, 2020Updated 5 years ago
- Provides the code for the paper "EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerators" by Luk…☆19Oct 6, 2019Updated 6 years ago