bremen79 / preciseLinks
Portfolio REgret for Confidence SEquences
☆20Updated 2 weeks ago
Alternatives and similar repositories for precise
Users that are interested in precise are comparing it to the libraries listed below
Sorting:
- Code for minimum-entropy coupling.☆32Updated 2 weeks ago
- ☆75Updated last year
- Because we don't want a jupyter notebook mess...☆61Updated 7 months ago
- Sparse and discrete interpretability tool for neural networks☆64Updated last year
- Understanding how features learned by neural networks evolve throughout training☆41Updated last year
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆77Updated 2 years ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆40Updated 2 years ago
- ☆60Updated 3 years ago
- ☆44Updated 2 months ago
- Deep Networks Grok All the Time and Here is Why☆38Updated last year
- ☆56Updated last year
- ☆28Updated 2 years ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…☆75Updated 6 months ago
- Parameter-Free Optimizers for Pytorch☆130Updated last year
- ☆62Updated last year
- Implementations of growing and pruning in neural networks☆22Updated 2 years ago
- The Energy Transformer block, in JAX☆63Updated 2 years ago
- Jax like function transformation engine but micro, microjax☆34Updated last year
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Updated 2 years ago
- Code for the paper "Function-Space Learning Rates"☆23Updated 7 months ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated last year
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆132Updated 3 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- gzip Predicts Data-dependent Scaling Laws☆34Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆92Updated last year
- Minimum Description Length probing for neural network representations☆20Updated 11 months ago
- ☆35Updated last year
- ☆27Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- Latent Diffusion Language Models☆70Updated 2 years ago