iboB / git-lfs-downloadLinks
Download full or partial git-lfs repos without temporarily using 2x disk space
☆30Updated last year
Alternatives and similar repositories for git-lfs-download
Users that are interested in git-lfs-download are comparing it to the libraries listed below
Sorting:
- GGML implementation of BERT model with Python bindings and quantization.☆55Updated last year
- A converter and basic tester for rwkv onnx☆42Updated last year
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Updated 8 months ago
- Experiments with BitNet inference on CPU☆54Updated last year
- Training hybrid models for dummies.☆25Updated 6 months ago
- A fast RWKV Tokenizer written in Rust☆46Updated this week
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- RWKV-7: Surpassing GPT☆92Updated 7 months ago
- Rust bindings for CTranslate2☆14Updated 2 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆19Updated 2 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated 2 years ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆21Updated 2 years ago
- PostText is a QA system for querying your text data. When appropriate structured views are in place, PostText is good at answering querie…☆32Updated 2 years ago
- ☆64Updated 2 months ago
- ☆26Updated 2 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- ☆37Updated 2 months ago
- RWKV model implementation☆38Updated 2 years ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…☆40Updated 2 years ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 4 months ago
- ANE accelerated embedding models!☆18Updated 7 months ago
- Utilities for Training Very Large Models☆58Updated 9 months ago
- Port of Facebook's LLaMA model in C/C++☆22Updated last year
- Inference of Mamba models in pure C☆188Updated last year
- Web browser version of StarCoder.cpp☆45Updated last year
- Here we collect trick questions and failed tasks for open source LLMs to improve them.☆32Updated 2 years ago
- Make triton easier☆47Updated last year
- Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth☆122Updated this week