☆16Feb 28, 2025Updated last year
Alternatives and similar repositories for Grokking
Users that are interested in Grokking are comparing it to the libraries listed below
Sorting:
- Deep Networks Grok All the Time and Here is Why☆38May 18, 2024Updated last year
- Omnigrok: Grokking Beyond Algorithmic Data☆63Feb 24, 2023Updated 3 years ago
- ☆16May 16, 2023Updated 2 years ago
- Code for verifying deep neural feature ansatz☆22May 3, 2023Updated 2 years ago
- ☆14Oct 18, 2021Updated 4 years ago
- ☆15Sep 19, 2019Updated 6 years ago
- ☆12Aug 6, 2024Updated last year
- Implementation of DeepMind's "Sobolev Training for Neural Networks"☆11Apr 2, 2018Updated 7 years ago
- Code associated with ICML (2024). "Defense against Backdoor Attack on Pre-trained Language Models via Head Pruning and Attention Normaliz…☆10Feb 22, 2026Updated last month
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- Official implementation of "Multi-scale Feature Learning Dynamics: Insights for Double Descent".☆17Jun 10, 2022Updated 3 years ago
- Code for reproducing figures and results in the paper ``Early stopping in deep networks: Double descent and how to eliminate it''☆15Jun 27, 2022Updated 3 years ago
- Neural networks for insurance pricing with frequency and severity data☆11Oct 25, 2023Updated 2 years ago
- Deep Learning Examples☆12Oct 31, 2019Updated 6 years ago
- PyTorch Implementation of Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization☆18Jan 18, 2018Updated 8 years ago
- Understanding Rare Spurious Correlations in Neural Network☆12Jun 5, 2022Updated 3 years ago
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆16Jan 24, 2025Updated last year
- ☆16Mar 1, 2021Updated 5 years ago
- ☆19Sep 10, 2022Updated 3 years ago
- Variational Reinforcement Learning☆17Jul 25, 2024Updated last year
- Reimplementation of simple policy gradient algorithms such as REINFORCE and Actor-Critic methods.☆16Aug 26, 2023Updated 2 years ago
- [NeurIPS 2023] "Learning to Augment Distributions for Out-of-distribution Detection"☆11Nov 14, 2023Updated 2 years ago
- ☆14Mar 4, 2024Updated 2 years ago
- Cross-lingual Language Model (XLM) pretraining and Model-Agnostic Meta-Learning (MAML) for fast adaptation of deep networks☆20Mar 26, 2021Updated 4 years ago
- ☆15Dec 19, 2022Updated 3 years ago
- In this Project, I presented an analysis of a Fashion and Beauty startup’s supply chain data and by collecting, analyzing, and interpreti…☆16Nov 18, 2023Updated 2 years ago
- Python package to accelerate research on generalized out-of-distribution (OOD) detection.☆15Jun 19, 2024Updated last year
- Semi-supervised User Profiling with Heterogeneous Graph Attention Networks, IJCAI 19☆25Aug 18, 2020Updated 5 years ago
- This repository contains the implementation of Concept Activation Regions, a new framework to explain deep neural networks with human con…☆16Oct 7, 2022Updated 3 years ago
- Official Code for Efficient and Effective Augmentation Strategy for Adversarial Training (NeurIPS-2022)☆17Mar 29, 2023Updated 2 years ago
- Reinforcement Learning Generation-Evaluator Architecture for Neural Question Generation☆20Aug 23, 2021Updated 4 years ago
- Multimodal Neurons in Artificial Neural Networks☆16Oct 18, 2021Updated 4 years ago
- taking apart BERT to better understand how to build a classifier pipeline☆11Jun 9, 2019Updated 6 years ago
- ICML 2024 Paper "Adversarial Robustness Limits via Scaling-Law and Human-Alignment Studies"☆17Jul 10, 2024Updated last year
- ☆19Jan 28, 2024Updated 2 years ago
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆62May 11, 2021Updated 4 years ago
- [CVPR 2024] Domain Gap Embeddings for Generative Dataset Augmentation☆22Jun 19, 2024Updated last year
- PG_DIPLOMA_IN_DATA_SCIENCE_IIIT-B_&_UPGRAD☆25Mar 18, 2019Updated 7 years ago
- Components loss for neural networks in mask-based speech enhancement☆33Nov 20, 2020Updated 5 years ago