Codes for the paper "A mathematical perspective on Transformers".
☆39Jul 8, 2024Updated last year
Alternatives and similar repositories for 2023-transformers-rotf
Users that are interested in 2023-transformers-rotf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codes for the paper The emergence of clusters in self-attention dynamics.☆17Dec 18, 2023Updated 2 years ago
- A toolbox for learning with neural ODEs.☆10Feb 26, 2023Updated 3 years ago
- Tutorials for the book.☆16Feb 16, 2022Updated 4 years ago
- Sketching-based matrix computations for numpy arrays☆17Oct 29, 2019Updated 6 years ago
- ☆11May 27, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Real Time Evolving Substrate Hypercube based Neuro-Evolution of Augmenting Topologies.☆17Feb 29, 2024Updated 2 years ago
- Testing Theory of Mind (ToM) in language models with epistemic logic☆22Dec 13, 2023Updated 2 years ago
- ☆14Oct 8, 2016Updated 9 years ago
- A lightweight, multithreaded Python package for sketching, column selection, leverage scores and related computations.☆20Sep 17, 2025Updated 6 months ago
- [ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation☆10May 19, 2025Updated 10 months ago
- Neural network compatible DDEs☆13Apr 8, 2025Updated last year
- ☆14Apr 7, 2023Updated 3 years ago
- ☆18Sep 19, 2023Updated 2 years ago
- Dynamic mode decomposition in Python☆13Jun 9, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Transformers with doubly stochastic attention☆54Sep 14, 2022Updated 3 years ago
- ☆16Apr 26, 2023Updated 2 years ago
- Cross-compilation of PyTorch armv7l (32bit) for RaspberryPi OS☆21Feb 10, 2022Updated 4 years ago
- Code for the paper Progressive Inference-Time Annealing of Diffusion Models for Sampling from Boltzmann Densities.☆36Jul 23, 2025Updated 8 months ago
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆13Dec 1, 2024Updated last year
- Official implementation for the paper "Controlled Sparsity via Constrained Optimization"☆12Aug 10, 2022Updated 3 years ago
- ☆12May 30, 2024Updated last year
- Toolkit for Bayesian scaling analysis☆14Sep 8, 2022Updated 3 years ago
- Implementation of progressive meshes by Hugues Hoppe☆13Jul 14, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A mobile GUI search engine using a vision-language model☆14May 5, 2025Updated 11 months ago
- ☆10May 24, 2021Updated 4 years ago
- Quickly get custom prompt contexts☆14Mar 19, 2026Updated 3 weeks ago
- This repository contains the scripts for reproducing the results presented in Costa AC, Ahamed T, Jordan D, Stephens GJ (2023) "A Markov…☆11Sep 25, 2025Updated 6 months ago
- Blazing fast data loading with HuggingFace Dataset and Ray Data☆16Jan 12, 2024Updated 2 years ago
- Public repository for sharing talk materials☆13Aug 15, 2024Updated last year
- ☆19Sep 15, 2022Updated 3 years ago
- This repository implements a Diffusion Factor Model for financial data.☆47Nov 6, 2025Updated 5 months ago
- Calculates a set of unique abbreviations for a given set of strings☆17Nov 25, 2025Updated 4 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Information and artifacts for "LoRA Learns Less and Forgets Less" (TMLR, 2024)☆20Sep 27, 2024Updated last year
- simple MATLAB code for randomized matrix computation☆23Nov 13, 2015Updated 10 years ago
- Training and evaluation scripts for JGLUE, a Japanese language understanding benchmark☆18Apr 6, 2026Updated last week
- Lyrics crawling, pre-processing, embedding generation, model training, and lyrics generation - all in one tool☆14Nov 4, 2018Updated 7 years ago
- Projection operator method for statistical data analysis☆10Mar 11, 2025Updated last year
- ☆13Oct 14, 2021Updated 4 years ago
- Project page for paper Self-supervised Representation Learning with Relative Predictive Coding☆19Jul 8, 2021Updated 4 years ago