A minimal PyTorch re-implementation of Qwen 3.5
☆377Mar 5, 2026Updated 3 weeks ago
Alternatives and similar repositories for tiny-qwen
Users that are interested in tiny-qwen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- GEMV implementation with CUTLASS☆19Aug 21, 2025Updated 7 months ago
- A small RISC-V kernel coding by C, tested on sifive unmatched board.☆16Aug 20, 2022Updated 3 years ago
- ☆16Mar 19, 2026Updated last week
- The Next-gen Language & Compiler Powering Efficient Hardware Design☆36Jan 16, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆139Aug 18, 2025Updated 7 months ago
- mnn asr demo.☆26Mar 24, 2025Updated last year
- A practical way of learning Swizzle☆37Feb 3, 2025Updated last year
- Bert TensorRT模型加速部署☆10Apr 1, 2022Updated 3 years ago
- An MLIR-based source-to-source automatic differentiation system.☆15Mar 30, 2023Updated 2 years ago
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated last year
- Implement a ELF parser and a utility like readelf on Linux.☆11Oct 7, 2019Updated 6 years ago
- Demo for Qwen2.5-VL-3B-Instruct on Axera device.☆16Sep 3, 2025Updated 6 months ago
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- RePo: Language Models with Context Re-Positioning☆74Dec 24, 2025Updated 3 months ago
- Multiple GEMM operators are constructed with cutlass to support LLM inference.☆19Aug 3, 2025Updated 7 months ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆26Mar 17, 2025Updated last year
- Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP☆101Aug 20, 2025Updated 7 months ago
- ☆23Aug 14, 2024Updated last year
- 方便扩展的Cuda算子理解和优化框架,仅用在学习使用☆18Jun 13, 2024Updated last year
- 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级☆10May 6, 2024Updated last year
- 使用 cutlass 实现 flash-attention 精简版,具有教学意义☆59Aug 12, 2024Updated last year
- CLIP and SigLIP models optimized with TensorRT with a Transformers-like API☆32Sep 29, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- User programs for rCore OS☆19Jun 7, 2022Updated 3 years ago
- Exploring Representation-Aligned Latent Space for Better Generation☆18Mar 17, 2026Updated last week
- Collection of Machine Learning examples using MLEK CMSIS-pack.☆10Feb 18, 2026Updated last month
- An Accelerator for Convolution layer designed with Vivado HLS.☆10Dec 4, 2020Updated 5 years ago
- Unofficial implementation of Google's Nested Learning framework in Pytorch☆29Updated this week
- ☆12Jan 23, 2020Updated 6 years ago
- Viterbi decoding in PyTorch☆42Sep 10, 2025Updated 6 months ago
- Code for paper "Spider: Any-to-Many Multimodal LLM"☆15Apr 26, 2025Updated 11 months ago
- PyTorch implementation of the Flash Spectral Transform Unit.☆22Sep 19, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Linux-capable out-of-order superscaler multicore LoongArch32 (LA32 / LA32R) processor.☆34Aug 9, 2024Updated last year
- SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs☆62Mar 25, 2025Updated last year
- Utility that parses stack sizes section from elf objects and displays the preallocated stack size of each function.☆14Jan 15, 2020Updated 6 years ago
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆39Jun 20, 2024Updated last year
- Nano vLLM☆12,353Nov 3, 2025Updated 4 months ago
- ☆20Dec 10, 2018Updated 7 years ago
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆96Feb 20, 2026Updated last month