1.2% test error on MNIST using only least squares and numpy calls.
☆22Sep 13, 2023Updated 2 years ago
Alternatives and similar repositories for mnist_1_pt_2
Users that are interested in mnist_1_pt_2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆19Mar 7, 2025Updated last year
- Official Pytorch implementation of Chromatic Graph Transformers☆10Jun 14, 2023Updated 2 years ago
- Clustered Compositional Embeddings☆13Oct 25, 2023Updated 2 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆11Dec 30, 2024Updated last year
- Implementation of Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems☆15Nov 11, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A Zen approach to configuring your Python project☆17Feb 27, 2026Updated 3 months ago
- ☆12Sep 16, 2024Updated last year
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- Least Squares Regression for subspace clustering☆11May 27, 2018Updated 8 years ago
- ☆12Mar 19, 2021Updated 5 years ago
- An open source forum system written in D Programming Language, based on Hunt Framework.☆12Apr 15, 2022Updated 4 years ago
- Minimalistic display of websites. With tiling, theme, and preset support☆75Dec 12, 2022Updated 3 years ago
- Conditional Linear Dynamical Systems☆17Oct 7, 2025Updated 8 months ago
- Rust library for interfacing with GOG API☆12Mar 26, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Lightweight web service clients in the WasmEdge Runtime using the Rust reqwest framework☆12Feb 6, 2026Updated 4 months ago
- Scripts to create, manage and backup Wordpress on Podman.☆10Nov 28, 2023Updated 2 years ago
- Program memory visualizer for GDB/LLDB (bachelor thesis)☆12Apr 7, 2026Updated 2 months ago
- Codes for the paper The emergence of clusters in self-attention dynamics.☆18Dec 18, 2023Updated 2 years ago
- ☆10Sep 27, 2021Updated 4 years ago
- Code for PAC-Bayes Compression Bounds So Tight That They Can Explain Generalization, NeurIPS 2022☆18Nov 23, 2022Updated 3 years ago
- ☆17Oct 27, 2025Updated 7 months ago
- Widescreen and wider solution for Omikron: The Nomad Soul☆15Jan 9, 2022Updated 4 years ago
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆16May 18, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 9 months ago
- Firefox extension that allows to copy URL with clicked element ID from context menu☆11May 6, 2023Updated 3 years ago
- Computer Systems Lab☆13Oct 16, 2025Updated 7 months ago
- Several common methods of matrix multiplication are implemented on CPU and Nvidia GPU using C++11 and CUDA.☆14Feb 8, 2023Updated 3 years ago
- u-MPS implementation and experimentation code used in the paper Tensor Networks for Probabilistic Sequence Modeling (https://arxiv.org/ab…☆19Jul 2, 2020Updated 5 years ago
- A (very experimental) WebAssembly backend for Cranelift.☆15Aug 5, 2022Updated 3 years ago
- ☆13Jul 12, 2024Updated last year
- AutoAuth is a WIP extension for IndieAuth without the user being present☆13Mar 10, 2019Updated 7 years ago
- Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…☆19Nov 19, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆14Apr 18, 2020Updated 6 years ago
- ☆33Oct 4, 2024Updated last year
- KDS software for Kinase Drug Selectivity☆11Jun 8, 2023Updated 3 years ago
- Tool for loading and testing native shaders translated from crosstl☆13Dec 15, 2024Updated last year
- Flatpak bundle with Wine 32+64 bit (wow64) with Flatpak Sdk 21.08 with Compat.i386 runtime☆16Mar 25, 2024Updated 2 years ago
- Flatpak manifest for distrobox☆16Oct 19, 2022Updated 3 years ago
- Bayesian No-Effect-Concentration estimation in R☆15Apr 9, 2026Updated 2 months ago