aks2203 / easy-to-hardView external linksLinks
Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"
☆61Mar 1, 2022Updated 3 years ago
Alternatives and similar repositories for easy-to-hard
Users that are interested in easy-to-hard are comparing it to the libraries listed below
Sorting:
- A centralized place for deep thinking code and experiments☆90Aug 9, 2023Updated 2 years ago
- codebase for the SIMAT dataset and evaluation☆38Feb 16, 2022Updated 4 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Aug 4, 2022Updated 3 years ago
- [ICLR'25] "Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing" by Peihao Wang, Ruisi Cai, Yue…☆17Mar 21, 2025Updated 10 months ago
- ☆11Apr 14, 2022Updated 3 years ago
- Image and video processing toolbox☆10Jun 12, 2020Updated 5 years ago
- Official repository for the paper: "Trees with Attention for Set Prediction Tasks" (ICML21)☆10Jan 19, 2022Updated 4 years ago
- Pytorch Datasets for Easy-To-Hard☆29Jan 9, 2025Updated last year
- ☆11Aug 26, 2021Updated 4 years ago
- Pytorch ImageNet1k Loader with Bounding Boxes.☆13Jan 23, 2022Updated 4 years ago
- Pretraining summarization models using a corpus of nonsense☆13Sep 28, 2021Updated 4 years ago
- ☆11Jun 2, 2021Updated 4 years ago
- Official code for the paper "Provable Compositional Generalization for Object-Centric Learning" (ICLR 2024, oral)☆15Aug 26, 2024Updated last year
- ☆33Nov 27, 2023Updated 2 years ago
- Official Code for "Baseline Defenses for Adversarial Attacks Against Aligned Language Models"☆31Oct 26, 2023Updated 2 years ago
- ☆14Jul 30, 2022Updated 3 years ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Jul 20, 2022Updated 3 years ago
- ☆28Sep 13, 2022Updated 3 years ago
- ☆38Mar 9, 2021Updated 4 years ago
- ☆14Feb 9, 2022Updated 4 years ago
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Jul 14, 2021Updated 4 years ago
- ☆16Jul 17, 2022Updated 3 years ago
- ☆17Jun 4, 2021Updated 4 years ago
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- ☆42Jun 19, 2024Updated last year
- Robust Contrastive Learning Using Negative Samples with Diminished Semantics (NeurIPS 2021)☆39Dec 6, 2021Updated 4 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Jan 16, 2023Updated 3 years ago
- Code for Unbiased Implicit Variational Inference (UIVI)☆15Jan 18, 2019Updated 7 years ago
- The WaveFunctionCollapse algorithm in Julia.☆22Jan 2, 2019Updated 7 years ago
- My own playground for PLP (Programming Language Processing) using DeepLearning techniques☆19Apr 12, 2023Updated 2 years ago
- Parameter-Space Saliency Maps for Explainability☆23Mar 21, 2023Updated 2 years ago
- Tensor-like types – with variadic shapes – that support both static and runtime type checking, and convenient parsing☆20Jan 9, 2026Updated last month
- ☆22Jan 19, 2023Updated 3 years ago
- A highly sophisticated sequence-to-sequence model for code generation☆40Jul 1, 2021Updated 4 years ago
- Code for the paper, "First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization"☆26Jul 5, 2022Updated 3 years ago
- Code for 'Emergent Symbols through Binding in External Memory'.☆21May 4, 2023Updated 2 years ago
- ☆23Dec 15, 2022Updated 3 years ago
- ☆20Jul 6, 2023Updated 2 years ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆19May 19, 2019Updated 6 years ago