sdpython / onnxcustom
Tutorial on how to convert machine learned models into ONNX
β15Updated last year
Related projects: β
- benchmarking some transformer deploymentsβ26Updated last year
- π€ Trade any tensors over the networkβ30Updated 11 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and teβ¦β42Updated 8 months ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found heβ¦β31Updated last year
- A place to store reusable transformer components of my own creation or found on the interwebsβ43Updated 3 weeks ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.β25Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pβ¦β33Updated last year
- Prototype routines for GPU quantization written using PyTorch.β19Updated 6 months ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundryβ36Updated 8 months ago
- β36Updated last year
- Open sourced backend for Martian's LLM Inference Provider Leaderboardβ15Updated last month
- β20Updated last year
- β18Updated this week
- Truly flash T5 realization!β48Updated 4 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Faceβ32Updated last year
- Simple and fast low-bit matmul kernels in CUDAβ48Updated this week
- A library for squeakily cleaning and filtering language datasets.β45Updated last year
- **ARCHIVED** Filesystem interface to π€ Hubβ56Updated last year
- Repository for CPU Kernel Generation for LLM Inferenceβ25Updated last year
- Index of URLs to pdf files all over the internet and scriptsβ20Updated last year
- My explorations into editing the knowledge and memories of an attention networkβ34Updated last year
- Using short models to classify long textsβ20Updated last year
- Techniques used to run BLOOM at inference in parallelβ37Updated last year
- β61Updated 3 weeks ago
- β24Updated last year
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AIβ58Updated last year
- Make triton easierβ39Updated 3 months ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/β¦β22Updated 5 months ago
- Ranking of fine-tuned HF models as base models.β35Updated last year
- A safetensors extension to efficiently store sparse quantized tensors on diskβ26Updated this week