MurtyShikhar/structural-grokking

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MurtyShikhar/structural-grokking)

MurtyShikhar / structural-grokking

Code for our ACL '23 paper titled "Grokking of Hierarchical Structure in Vanilla Transformers"

☆26

Alternatives and similar repositories for structural-grokking

Users that are interested in structural-grokking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dair-iitd / symnet
View on GitHub
☆10Jun 28, 2022Updated 4 years ago
Genius1237 / TyDiP
View on GitHub
TyDiP Multilingual Politeness dataset and code
☆12Oct 15, 2023Updated 2 years ago
pratyushasharma / sw-combinatoriality
View on GitHub
Dataset and Codebase for paper on "Contextual and Combinatorial Structure in Sperm Whale Vocalisations"
☆36Mar 14, 2024Updated 2 years ago
thunlp / THUCBERT
View on GitHub
A Chinese Character BERT Trained with Multi-Level Masking
☆13Sep 24, 2023Updated 2 years ago
petezh / OpenD5
View on GitHub
Tasks for describing differences between text distributions.
☆17Aug 9, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
MurtyShikhar / LanguagePatching
View on GitHub
Code for our EMNLP '22 paper "Fixing Model Bugs with Natural Language Patches"
☆19Dec 7, 2022Updated 3 years ago
hansonhl / antra
View on GitHub
Package for defining computation graphs and performing intervention experiments
☆16Oct 1, 2021Updated 4 years ago
JiajingLin / Phys4DGen
View on GitHub
[ACM MM 2025] Phys4DGen: Physics-Compliant 4D Generation with Multi-Material Composition Perception
☆13Apr 18, 2026Updated 3 months ago
haimengzhao / magic-microlensing
View on GitHub
MAGIC: Microlensing Analysis Guided by Intelligent Computation. A PyTorch framework for automatic analysis of realistic microlensing ligh…
☆13May 30, 2024Updated 2 years ago
moskomule / hypergrad
View on GitHub
Simple and extensible hypergradient for PyTorch
☆18Feb 23, 2023Updated 3 years ago
HLR / TSLM
View on GitHub
The Implementation for the Paper "Time-Stamped Language Model: Teaching Language Models toUnderstand The Flow of Events"
☆11May 6, 2021Updated 5 years ago
OSU-NLP-Group / SeeActChromeExtension
View on GitHub
☆18Jan 3, 2025Updated last year
shentianxiao / FiLM
View on GitHub
☆13Oct 18, 2023Updated 2 years ago
tung-nd / cwbc
View on GitHub
☆11Oct 3, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
WildVision-AI / LMM-Engines
View on GitHub
☆17Oct 22, 2024Updated last year
dair-iitd / jeebench
View on GitHub
JEEBench, EMNLP 2023
☆47Dec 18, 2023Updated 2 years ago
addtt / object-centric-library
View on GitHub
Library for the training and evaluation of object-centric models (ICML 2022)
☆72Apr 30, 2023Updated 3 years ago
INK-USC / FaiRR
View on GitHub
FaiRR: Faithful and Robust Deductive Reasoning over Natural Language (ACL 2022)
☆14May 19, 2022Updated 4 years ago
cvi2snt / CPTSketchGraphs
View on GitHub
A dataset of 80 millon constraint preserving transformations of CAD sketches
☆17Nov 22, 2024Updated last year
leiwu0 / sgd.stability
View on GitHub
Analyze the dynamic stability of SGD
☆13Nov 25, 2018Updated 7 years ago
arpit-saxena / schedule-maker
View on GitHub
Generate ics file given a set of courses and slots
☆12Sep 16, 2024Updated last year
zhaochenyang20 / distirbution_is_all_you_need
View on GitHub
distribution-is-all-you-need is the basic distribution probability tutorial for most common distribution focused on Deep learning using t…
☆15Apr 4, 2022Updated 4 years ago
adapter-hub / hgiyt
View on GitHub
Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"
☆28Oct 3, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Extreme-classification / MUFIN
View on GitHub
Multimodal extreme classification
☆21May 1, 2024Updated 2 years ago
aNOnWhyMooS / connectivity
View on GitHub
☆18Jan 17, 2024Updated 2 years ago
AgentForceTeamOfficial / UA2-Agent
View on GitHub
Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…
☆19Nov 12, 2024Updated last year
furiosa-ai / EfficientRollout
View on GitHub
EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts
☆16Jun 24, 2026Updated last month
shavarani / SpEL
View on GitHub
Structured Prediction for Entity Linking
☆39Aug 2, 2024Updated last year
ml-research / XIConceptLearning
View on GitHub
Explainable Interactive Concept Learning
☆15Mar 26, 2023Updated 3 years ago
exlaw / PaperReading
View on GitHub
个人论文阅读笔记，记录了所有读过的论文总结，基本每天更新。
☆17Nov 6, 2021Updated 4 years ago
ZhuYun97 / MARIO
View on GitHub
Official implementation of MARIO: Model Agnostic Recipe for Improving OOD Generalization of Graph Contrastive Learning
☆19Jan 27, 2024Updated 2 years ago
spiglerg / TF_ContinualLearningViaSynapticIntelligence
View on GitHub
Tensorflow implementation of the `intelligent synapse' model from [Zenke et al., (2017)] and application to the Permuted MNIST benchmark.
☆22Aug 2, 2017Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hrtan / MoSo
View on GitHub
[NeurIPS-2023] The PyTorch Implementation of MoSo. The algorithms are based on our paper: "Data Pruning via Moving-one-Sample-out". MoSo …
☆10May 21, 2026Updated 2 months ago
yanivbenny / MRNet
View on GitHub
Code for "Multi-scale Abstract Reasoning" paper
☆12Oct 17, 2022Updated 3 years ago
retarfi / language-pretraining
View on GitHub
Pre-training Language Models for Japanese
☆50Jul 2, 2023Updated 3 years ago
eminorhan / video-models
View on GitHub
Menagerie of video models trained on various video datasets
☆10Oct 13, 2024Updated last year
jason9693 / FROZEN
View on GitHub
☆14May 3, 2022Updated 4 years ago
aryamanarora / causalgym
View on GitHub
CausalGym: Benchmarking causal interpretability methods on linguistic tasks
☆54Nov 30, 2024Updated last year
yuPeiyu98 / Diffusion-Amortized-MCMC
View on GitHub
[NeurIPS 2023] Learning Energy-Based Prior Model with Diffusion-Amortized MCMC
☆14Mar 1, 2026Updated 4 months ago