Codes for the paper The emergence of clusters in self-attention dynamics.
☆17Dec 18, 2023Updated 2 years ago
Alternatives and similar repositories for 2023-transformers
Users that are interested in 2023-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A toolbox for learning with neural ODEs.☆10Feb 26, 2023Updated 3 years ago
- Codes for the paper "A mathematical perspective on Transformers".☆39Jul 8, 2024Updated last year
- Code for experiments on transformers using Markovian data.☆22Nov 22, 2024Updated last year
- u-MPS implementation and experimentation code used in the paper Tensor Networks for Probabilistic Sequence Modeling (https://arxiv.org/ab…☆19Jul 2, 2020Updated 5 years ago
- ☆25Dec 20, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Multi-Layer Sparse Autoencoders (ICLR 2025)☆29Feb 6, 2026Updated last month
- Efficient implementation of Generative Stochastic Networks☆12Nov 28, 2013Updated 12 years ago
- Code for 'Inference Suboptimality in Variational Autoencoders'☆10May 22, 2020Updated 5 years ago
- Basic implementation of variational autoencoders in Torch☆10Apr 16, 2016Updated 9 years ago
- ☆15Jul 13, 2025Updated 8 months ago
- Official Pytorch implementation of Chromatic Graph Transformers☆10Jun 14, 2023Updated 2 years ago
- Clustered Compositional Embeddings☆11Oct 25, 2023Updated 2 years ago
- Ἀνατομή is a PyTorch library to analyze representation of neural networks☆13Jan 31, 2024Updated 2 years ago
- Estimators for Information Theoretic Functionals using Influence Functions☆11Apr 17, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A Zen approach to configuring your Python project☆16Feb 27, 2026Updated 3 weeks ago
- ☆17Jan 5, 2018Updated 8 years ago
- Dynamic mode decomposition in Python☆13Jun 9, 2015Updated 10 years ago
- Unofficial Scalable-Softmax Is Superior for Attention☆20May 30, 2025Updated 9 months ago
- Create string diagrams with LaTeX!☆14Jan 3, 2025Updated last year
- ☆12Jan 17, 2024Updated 2 years ago
- Personal solutions to the Triton Puzzles☆20Jul 18, 2024Updated last year
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- Least Squares Regression for subspace clustering☆10May 27, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for Unbiased Implicit Variational Inference (UIVI)☆15Jan 18, 2019Updated 7 years ago
- Toolkit for Bayesian scaling analysis☆14Sep 8, 2022Updated 3 years ago
- A multiphase field model based on machine learning method☆49Feb 10, 2022Updated 4 years ago
- Conditional Linear Dynamical Systems☆16Oct 7, 2025Updated 5 months ago
- The Compositionality article class.☆13Mar 16, 2026Updated last week
- Links to recourses for the Lean Theorem Prover☆12Dec 3, 2019Updated 6 years ago
- A collection of important papers on Generalizable Diffusion-generated Image Detection☆17Mar 20, 2025Updated last year
- Experiment with Neural ODE on Pytorch☆14Aug 9, 2019Updated 6 years ago
- Repository for paper Decrypting Cryptic Crosswords☆10Jan 15, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for the KDD 2021 paper 'Filtration Curves for Graph Representation'☆18Aug 8, 2023Updated 2 years ago
- 1.2% test error on MNIST using only least squares and numpy calls.☆21Sep 13, 2023Updated 2 years ago
- ☆12Apr 17, 2025Updated 11 months ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆19Jun 11, 2025Updated 9 months ago
- Code for verifying deep neural feature ansatz☆22May 3, 2023Updated 2 years ago
- コンピュータビジョン研究コミュニティcvpaper.challengeのサマリ。サーベイ資料や研究成果など。☆12Jan 20, 2021Updated 5 years ago
- Several common methods of matrix multiplication are implemented on CPU and Nvidia GPU using C++11 and CUDA.☆15Feb 8, 2023Updated 3 years ago