iboB / git-lfs-downloadLinks
Download full or partial git-lfs repos without temporarily using 2x disk space
☆30Updated last year
Alternatives and similar repositories for git-lfs-download
Users that are interested in git-lfs-download are comparing it to the libraries listed below
Sorting:
- Rust bindings for CTranslate2☆14Updated 2 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆55Updated last year
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- Here we collect trick questions and failed tasks for open source LLMs to improve them.☆32Updated 2 years ago
- Web browser version of StarCoder.cpp☆45Updated last year
- Rust crate for some audio utilities☆24Updated 3 months ago
- A simple library for working with Hugging Face models.☆14Updated 5 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated 2 years ago
- A collection of reproducible inference engine benchmarks☆31Updated 2 months ago
- ☆11Updated last year
- A converter and basic tester for rwkv onnx☆42Updated last year
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆21Updated 2 years ago
- Port of Suno AI's Bark in C/C++ for fast inference☆52Updated last year
- ☆39Updated 2 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 11 months ago
- Experiments with BitNet inference on CPU☆54Updated last year
- Make triton easier☆46Updated last year
- ANE accelerated embedding models!☆18Updated 6 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 7 months ago
- Training hybrid models for dummies.☆23Updated 5 months ago
- A fast RWKV Tokenizer written in Rust☆46Updated 2 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆20Updated 6 months ago
- Simple high-throughput inference library☆119Updated last month
- ☆26Updated 2 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Proof of concept for running moshi/hibiki using webrtc☆19Updated 3 months ago
- JAX bindings for the flash-attention3 kernels☆11Updated 10 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Updated 7 months ago
- ☆40Updated 2 years ago