msaroufim / mynotesLinks
☆18Updated 3 weeks ago
Alternatives and similar repositories for mynotes
Users that are interested in mynotes are comparing it to the libraries listed below
Sorting:
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Updated last year
- ML/DL Math and Method notes☆66Updated 2 years ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆47Updated last year
- ☆22Updated last year
- JAX implementation of the Mistral 7b v0.2 model☆35Updated last year
- Highly commented implementations of Transformers in PyTorch☆138Updated 2 years ago
- Code for the note "NF4 Isn't Information Theoretically Optimal (and that's Good)☆21Updated 2 years ago
- Rax is a Learning-to-Rank library written in JAX.☆336Updated this week
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆57Updated 4 months ago
- Functional local implementations of main model parallelism approaches☆95Updated 2 years ago
- some common Huggingface transformers in maximal update parametrization (µP)☆87Updated 3 years ago
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆300Updated last year
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆86Updated 2 years ago
- TorchFix - a linter for PyTorch-using code with autofix support☆152Updated 5 months ago
- NLP Examples using the 🤗 libraries☆40Updated 4 years ago
- See https://github.com/cuda-mode/triton-index/ instead!☆11Updated last year
- ☆69Updated 10 months ago
- Convert scikit-learn models to PyTorch modules☆168Updated last year
- a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.☆23Updated last year
- Experiment of using Tangent to autodiff triton☆82Updated 2 years ago
- Common Python utilities and GitHub Actions in Lightning Ecosystem☆63Updated last week
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆93Updated 2 years ago
- This is a port of Mistral-7B model in JAX☆33Updated last year
- Fast, Modern, and Low Precision PyTorch Optimizers☆124Updated last month
- Gzip and nearest neighbors for text classification☆57Updated 2 years ago
- NLP with Rust for Python 🦀🐍☆71Updated 8 months ago
- ☆192Updated last week
- An implementation of the Llama architecture, to instruct and delight☆21Updated 8 months ago
- ☆20Updated 3 years ago
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.☆53Updated last year