nlpodyssey / goslideLinks
SLIDE (Sub-LInear Deep learning Engine) written in Go
☆45Updated 5 years ago
Alternatives and similar repositories for goslide
Users that are interested in goslide are comparing it to the libraries listed below
Sorting:
- ☆74Updated last year
- gRPC server for hnswlib☆16Updated 2 years ago
- benchmarking some transformer deployments☆26Updated 2 years ago
- ArrayFire's Machine Learning Library.☆105Updated 6 years ago
- A tracing JIT compiler for PyTorch☆13Updated 3 years ago
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.☆11Updated 5 years ago
- A Clustering Based Classification Algorithm☆28Updated 3 years ago
- Package for estimating the entropy of a mixture distribution☆15Updated 7 years ago
- MozoLM: A language model (LM) serving library☆45Updated 3 weeks ago
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆106Updated 7 months ago
- 🌳 A compressed rank/select dictionary exploiting approximate linearity and repetitiveness.☆11Updated 3 years ago
- Fast and vectorizable algorithms for searching in a vector of sorted floating point numbers☆146Updated 8 months ago
- Clover: Quantized 4-bit Linear Algebra Library☆114Updated 7 years ago
- Test data for DALI project☆43Updated 3 months ago
- Utilities for sequential processing of tar files.☆24Updated 3 years ago
- Fast stand-alone C++ decoder for RNN-based NMT models☆27Updated 4 years ago
- A CUDA implementation of the Tsetlin Machine based on bitwise operators☆26Updated 6 years ago
- A collection of optimizers, some arcane others well known, for Flax.☆29Updated 4 years ago
- "Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation☆29Updated 6 months ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆31Updated 5 months ago
- Standalone commandline CLI tool for compiling Triton kernels☆18Updated 11 months ago
- Personal collection of references for high performance mixed precision training.☆41Updated 5 years ago
- Deep neural network framework for multiple GPUs☆33Updated 10 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆41Updated 6 years ago
- MXNet - nGraph integration☆34Updated 3 years ago
- A Learnable LSH Framework for Efficient NN Training☆32Updated 4 years ago
- A runtime shape checker and auto-annotator for tensor programs (pronounced "stanley")☆40Updated 5 years ago
- Proof of concept on how to use TensorFlow for prediction tasks in a multiprocess setting.☆18Updated 6 years ago
- Distributed Bayesian Optimization☆23Updated 5 years ago
- nGraph™ Backend for ONNX☆42Updated 2 years ago