MurtyShikhar / structural-grokkingView external linksLinks
Code for our ACL '23 paper titled "Grokking of Hierarchical Structure in Vanilla Transformers"
☆24Oct 8, 2023Updated 2 years ago
Alternatives and similar repositories for structural-grokking
Users that are interested in structural-grokking are comparing it to the libraries listed below
Sorting:
- Code for Pushdown Layers from our EMNLP 2023 paper☆29Dec 3, 2023Updated 2 years ago
- Phys4DGen: A Physics-Driven Framework for Controllable and Efficient 4D Content Generation from a Single Image☆12May 10, 2025Updated 9 months ago
- Main repo for GIOROM☆18Sep 28, 2025Updated 4 months ago
- Repository for the code and dataset for the paper: "Have LLMs Advanced enough? Towards Harder Problem Solving Benchmarks For Large Langu…☆39Dec 18, 2023Updated 2 years ago
- ☆18Jan 3, 2025Updated last year
- A dataset of 80 millon constraint preserving transformations of CAD sketches☆12Nov 22, 2024Updated last year
- ☆17Oct 22, 2024Updated last year
- Simple and extensible hypergradient for PyTorch☆18Feb 23, 2023Updated 2 years ago
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Nov 12, 2024Updated last year
- Official implementation of MARIO: Model Agnostic Recipe for Improving OOD Generalization of Graph Contrastive Learning☆19Jan 27, 2024Updated 2 years ago
- A Concept-Centric Framework for Intelligent Agents☆22Oct 1, 2025Updated 4 months ago
- My personal web page☆11Oct 20, 2025Updated 3 months ago
- The KiloGram Tangrams dataset☆58Apr 25, 2025Updated 9 months ago
- This is the official code repository for the paper "Language Agents Meet Causality -- Bridging LLMs and Causal World Models"☆28May 6, 2025Updated 9 months ago
- Discovering Universal Geometry in Embeddings with ICA (Published in EMNLP 2023)☆20Jun 17, 2025Updated 7 months ago
- Benchmarking Generalization to New Tasks from Natural Language Instructions☆26Jul 2, 2021Updated 4 years ago
- [arXiv, 2024] Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection☆34Jul 2, 2025Updated 7 months ago
- ☆47Oct 2, 2025Updated 4 months ago
- Pytorch implementation of same-family gaussian mixture models with guardrails. Features separable parameter optimization and singularity …☆26May 31, 2025Updated 8 months ago
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆28Oct 28, 2024Updated last year
- [EMNLP 2024 Findings] Benchmarking Language Model Agents for Data-Driven Science☆34Oct 25, 2024Updated last year
- ☆30May 19, 2024Updated last year
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆74Aug 31, 2024Updated last year
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- TyDiP Multilingual Politeness dataset and code☆12Oct 15, 2023Updated 2 years ago
- Program and links to the material for the GloBIAS Training School 2025, Kobe, Japan.☆22Oct 27, 2025Updated 3 months ago
- ☆37Mar 17, 2025Updated 10 months ago
- A collection of code(or link) for awesome blender script for 3D content creation.☆30Aug 7, 2024Updated last year
- Library for the training and evaluation of object-centric models (ICML 2022)☆71Apr 30, 2023Updated 2 years ago
- Agent-based LLM modeling of mechanics problems☆39Feb 10, 2024Updated 2 years ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆235Jul 19, 2025Updated 6 months ago
- MirMachine, a command line tool to detect microRNA homologs in genome sequences.☆13Dec 3, 2025Updated 2 months ago
- openaivec☆20Updated this week
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- Concurrency library☆16Oct 13, 2024Updated last year
- This repository is the official implementation of Topology-Informed Graph Transformer (Choi et al., GRaM Workshop at ICML 2024).☆12Dec 28, 2024Updated last year
- ☆11Dec 23, 2024Updated last year
- ☆11Apr 4, 2017Updated 8 years ago
- Official implementation of paper "Efficient Tuning and Inference for Large Language Models on Textual Graphs"☆37Jun 24, 2024Updated last year