drbh / nnliLinks
π interactively explore `onnx` networks in your CLI.
β25Updated last year
Alternatives and similar repositories for nnli
Users that are interested in nnli are comparing it to the libraries listed below
Sorting:
- Read and write tensorboard data using Rustβ21Updated last year
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.β115Updated 3 weeks ago
- JAX bindings for the flash-attention3 kernelsβ11Updated 11 months ago
- A small python library to run iterators in a separate processβ10Updated last year
- Exploration into the Firefly algorithm in Pytorchβ40Updated 5 months ago
- A place to store reusable transformer components of my own creation or found on the interwebsβ56Updated this week
- A Rust Library for High-Performance Tensor Exchange with Pythonβ47Updated last week
- A collection of optimisers for use with candleβ36Updated last month
- Hacks for PyTorchβ19Updated 2 years ago
- β23Updated 7 months ago
- Modular Rust transformer/LLM library using Candleβ36Updated last year
- Experimental scripts for researching data adaptive learning rate scheduling.β23Updated last year
- H-Net Dynamic Hierarchical Architectureβ22Updated this week
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" givenβ¦β14Updated last year
- β12Updated last year
- Make triton easierβ47Updated last year
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detectionβ46Updated 2 years ago
- Minimal C++ implementation of GPT2β40Updated 2 years ago
- Implementation and explorations into Blackbox Gradient Sensing (BGS), an evolutionary strategies approach proposed in a Google Deepmind pβ¦β17Updated last month
- Opinionated library for managing hyperparameters and mutable state of machine learning training systems.β19Updated last year
- Jax like function transformation engine but micro, microjaxβ33Updated 8 months ago
- β30Updated last year
- Code and data for paper "(How) do Language Models Track State?"β14Updated 3 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.β31Updated last month
- π· Build compute kernelsβ77Updated this week
- LLM training in simple, raw C/CUDA, migrated into Rustβ47Updated 3 months ago
- FlexAttention w/ FlashAttention3 Supportβ26Updated 9 months ago
- Blazingly fast implementation of the Datasaurus paper. Same Stats, Different Graphs.β19Updated 2 years ago
- Utilities for Training Very Large Modelsβ58Updated 9 months ago
- "PyTorch in Rust"β16Updated last year