U-Sharma / NeuralScaleID
☆12Updated 4 years ago
Alternatives and similar repositories for NeuralScaleID:
Users that are interested in NeuralScaleID are comparing it to the libraries listed below
- ☆11Updated 3 years ago
- Analogous Safe-state Exploration (ASE) is an algorithm for provably safe and optimal exploration in MDPs with unknown, stochastic dynamic…☆11Updated 4 years ago
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain☆33Updated 4 years ago
- Code repository for the AISTATS 2021 paper "Towards Understanding the Optimal Behaviors of Deep Active Learning Algorithms"☆15Updated 4 years ago
- ☆13Updated 3 years ago
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆14Updated 4 years ago
- ☆24Updated 3 years ago
- Implementation for NATv2.☆23Updated 4 years ago
- Symbolic Brittleness in Sequence Models: on Systematic Generalization in Symbolic Mathematics (AAAI 2022)☆14Updated 3 years ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆15Updated 3 years ago
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆23Updated last year
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch☆24Updated 4 years ago
- Energy Based Models are a quite novel technique for density estimation. In this university project I explore this new research topic and …☆16Updated 3 years ago
- ☆24Updated 11 months ago
- A framework for implementing equivariant DL☆10Updated 3 years ago
- ☆11Updated 7 years ago
- Deep Critical Learning. Implementation of ProSelfLC, IMAE, DM, etc.☆31Updated 2 years ago
- reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"☆17Updated 6 months ago
- Large-batch Training, Neural Network Optimization☆9Updated 5 years ago
- Directed masked autoencoders☆14Updated 2 years ago
- Virtual Adversarial Training (VAT) techniques in PyTorch☆17Updated 2 years ago
- An adaptive training algorithm for residual network☆15Updated 4 years ago
- JAX implementation of Graph Attention Networks☆13Updated 3 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated 2 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆49Updated 3 years ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- Implementation of Kronecker Attention in Pytorch☆18Updated 4 years ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆58Updated last year
- NeurIPS 2019 Paper Implementation☆12Updated 2 years ago