sandlogic / SandLogic-Lexicons
SandLogic Lexicons
β18Updated 6 months ago
Alternatives and similar repositories for SandLogic-Lexicons:
Users that are interested in SandLogic-Lexicons are comparing it to the libraries listed below
- Notebooks for fine tuning pali gemmaβ100Updated last week
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUsβ38Updated 5 months ago
- π€ Trade any tensors over the networkβ30Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β34Updated 4 months ago
- β32Updated 2 years ago
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Modelsβ56Updated 7 months ago
- Quantization of LLMs and benchmarking.β10Updated last year
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β58Updated 2 weeks ago
- β13Updated last year
- Cray-LM unified training and inference stack.β22Updated 2 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMsβ86Updated this week
- The Triton backend for TensorRT.β73Updated this week
- Memory-Efficient CUDA kernels for training ConvNets with PyTorch.β40Updated last month
- This repository shows various ways of deploying a vision model (TensorFlow) from π€ Transformers.β30Updated 2 years ago
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog poβ¦β91Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and teβ¦β42Updated last year
- A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ/VPTQ, and export to onnx/onnx-runtime easily.β168Updated 3 weeks ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.β36Updated 11 months ago
- Notebooks to demonstrate TimmWrapperβ16Updated 3 months ago
- Presents comprehensive benchmarks of XLA-compatible pre-trained models in Keras.β37Updated last year
- Machine Learning Agility (MLAgility) benchmark and benchmarking toolsβ39Updated last month
- Fast sparse deep learning on CPUsβ53Updated 2 years ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β80Updated 10 months ago
- Model compression for ONNXβ91Updated 5 months ago
- Lightweight, open-source, high-performance Yolo implementationβ25Updated 2 weeks ago
- Google TPU optimizations for transformers modelsβ108Updated 3 months ago
- Set of scripts to finetune LLMsβ37Updated last year
- β20Updated 3 years ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggmlβ266Updated last year
- Composition of Multimodal Language Models From Scratchβ14Updated 8 months ago