yobibyte / reportLinks
Because we don't want a jupyter notebook mess...
☆61Updated 6 months ago
Alternatives and similar repositories for report
Users that are interested in report are comparing it to the libraries listed below
Sorting:
- ☆60Updated 3 years ago
- Because we don't have enough time to read everything☆89Updated last year
- ☆44Updated last month
- Code for minimum-entropy coupling.☆32Updated 2 weeks ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆148Updated 2 months ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…☆74Updated 5 months ago
- ☆53Updated last year
- Your favourite classical machine learning algos on the GPU/TPU☆20Updated 11 months ago
- gzip Predicts Data-dependent Scaling Laws☆34Updated last year
- ☆72Updated last year
- ☆10Updated last year
- ☆62Updated last year
- Simplified implementation of UMAP like dimensionality reduction algorithm☆53Updated last year
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆188Updated this week
- Portfolio REgret for Confidence SEquences☆20Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆92Updated last year
- Neural Networks for JAX☆84Updated last year
- nanoGPT using Equinox☆14Updated 2 years ago
- Understanding how features learned by neural networks evolve throughout training☆40Updated last year
- Parametric differentiable curves with PyTorch for continuous embeddings, shape-restricted models, or KANs☆45Updated 2 weeks ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 10 months ago
- ☆82Updated last year
- ☆211Updated last year
- An implementation of PSGD Kron second-order optimizer for PyTorch☆97Updated 4 months ago
- A Python package for generating concise, high-quality summaries of a probability distribution☆55Updated last month
- ☆56Updated last year
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆97Updated last year
- ☆231Updated last week
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆60Updated 3 years ago
- σ-GPT: A New Approach to Autoregressive Models☆70Updated last year