Codes for the paper "A mathematical perspective on Transformers".
☆39Jul 8, 2024Updated last year
Alternatives and similar repositories for 2023-transformers-rotf
Users that are interested in 2023-transformers-rotf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codes for the paper The emergence of clusters in self-attention dynamics.☆18Dec 18, 2023Updated 2 years ago
- A toolbox for learning with neural ODEs.☆10Feb 26, 2023Updated 3 years ago
- ☆18May 25, 2023Updated 3 years ago
- Tutorials for the book.☆17Feb 16, 2022Updated 4 years ago
- Pragmatic models for generating and following instructions☆13Dec 22, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The IGM-Vis project to Interactively visualize intergalactic medium (IGM) and circumgalactic medium (CGM) data in a Cosmic Web context. D…☆15Nov 15, 2019Updated 6 years ago
- Single-step image generation at 306 FPS. Drifting vs Diffusion head-to-head on CIFAR-10.☆44Feb 13, 2026Updated 3 months ago
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)☆10Oct 3, 2022Updated 3 years ago
- Neural network compatible DDEs☆13Apr 8, 2025Updated last year
- implicit-SINDy code example from paper "Inferring Biological Networks by Sparse Identification fo Nonlinear Dynamics" http://ieeexplore.i…☆32Oct 2, 2017Updated 8 years ago
- Transformers with doubly stochastic attention☆54Sep 14, 2022Updated 3 years ago
- non-rigid registration in NIMBLE: A Non-rigid Hand Model with Bones and Muscles☆11Sep 2, 2022Updated 3 years ago
- Renderer used for レイトレ合宿8☆12Sep 10, 2022Updated 3 years ago
- This is a comprehensive guide on how you can automate your feature engineering process.☆11Jun 25, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Sep 1, 2023Updated 2 years ago
- Unsupervised domain adaptation with BERT for Amazon food product reviews sentiment analysis.☆15Oct 6, 2020Updated 5 years ago
- A multiphase field model based on machine learning method☆49Feb 10, 2022Updated 4 years ago
- [ICML 2022] Learning Efficient and Robust Ordinary Differential \\ Equations via Invertible Neural Networks☆10Apr 14, 2023Updated 3 years ago
- Blazing fast data loading with HuggingFace Dataset and Ray Data☆15Jan 12, 2024Updated 2 years ago
- ☆19Nov 11, 2023Updated 2 years ago
- Lightweight arXiv literature digest skill for OpenClaw — Zotero-driven interest profiling, 3-dimensional candidate ranking, abstract-firs…☆44Mar 24, 2026Updated 2 months ago
- Repository for paper Decrypting Cryptic Crosswords☆11Jan 15, 2022Updated 4 years ago
- Official implementation for the paper "Controlled Sparsity via Constrained Optimization"☆12Aug 10, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Visualization Tool for the H2O dataset☆11May 17, 2022Updated 4 years ago
- Information and artifacts for "LoRA Learns Less and Forgets Less" (TMLR, 2024)☆21Sep 27, 2024Updated last year
- Links to recourses for the Lean Theorem Prover☆13Dec 3, 2019Updated 6 years ago
- Training and evaluation scripts for JGLUE, a Japanese language understanding benchmark☆18May 20, 2026Updated last week
- Experiment with Neural ODE on Pytorch☆14Aug 9, 2019Updated 6 years ago
- Projection operator method for statistical data analysis☆10Mar 11, 2025Updated last year
- ☆13Oct 14, 2021Updated 4 years ago
- Project page for paper Self-supervised Representation Learning with Relative Predictive Coding☆19Jul 8, 2021Updated 4 years ago
- Codes for the paper "CausalCite: A Causal Formulation of Paper Citations" (2023)☆16Jan 11, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆13Aug 31, 2020Updated 5 years ago
- Code of Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions☆13May 22, 2023Updated 3 years ago
- Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention☆68Apr 7, 2026Updated last month
- This is a fork of the awesome Joey-NMT with Reinforcement Learning algorithms like Policy Gradient, MRT and Advantage Actor Critic.☆27Feb 10, 2023Updated 3 years ago
- ☆16Apr 12, 2023Updated 3 years ago
- Machine learning algorithms for discovering dimensionless groups from simulation and experimental data☆16Oct 12, 2022Updated 3 years ago
- Jax-based MaxEnt☆17Nov 24, 2019Updated 6 years ago