A minimal PyTorch re-implementation of Qwen 3.5 for hobbyist
☆418Mar 5, 2026Updated 2 months ago
Alternatives and similar repositories for tiny-qwen
Users that are interested in tiny-qwen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Survey on LLM Inference via Search (TMLR 2025)☆14May 6, 2025Updated last year
- GEMV implementation with CUTLASS☆21Aug 21, 2025Updated 9 months ago
- A small RISC-V kernel coding by C, tested on sifive unmatched board.☆16Aug 20, 2022Updated 3 years ago
- A simple MIPS CPU for BUAA CO course (and now NSCSCC).☆10May 15, 2021Updated 5 years ago
- The Next-gen Language & Compiler Powering Efficient Hardware Design☆38Jan 16, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆143Aug 18, 2025Updated 9 months ago
- mnn asr demo.☆27Mar 24, 2025Updated last year
- Bert TensorRT模型加速部署☆10Apr 1, 2022Updated 4 years ago
- An MLIR-based source-to-source automatic differentiation system.☆15Mar 30, 2023Updated 3 years ago
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated last year
- ☆46Nov 1, 2025Updated 6 months ago
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆16Sep 4, 2025Updated 8 months ago
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆36Jul 3, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.☆31Mar 28, 2025Updated last year
- Multiple GEMM operators are constructed with cutlass to support LLM inference.☆20Aug 3, 2025Updated 9 months ago
- RePo: Language Models with Context Re-Positioning☆75Mar 30, 2026Updated last month
- 🍑 relsim: Relational Visual Similarity | pip install relsim 🌍 (CVPR 2026)☆77Apr 8, 2026Updated last month
- ☆23Aug 14, 2024Updated last year
- Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP☆108Aug 20, 2025Updated 9 months ago
- 使用 cutlass 实现 flash-attention 精简版,具有教学意义☆59Aug 12, 2024Updated last year
- User programs for rCore OS☆19Jun 7, 2022Updated 3 years ago
- ☆83Oct 13, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Static file serving and directory listing☆12Apr 21, 2026Updated last month
- Exploring Representation-Aligned Latent Space for Better Generation☆19Mar 17, 2026Updated 2 months ago
- ☆11Nov 15, 2022Updated 3 years ago
- Blood Pressure Estimation using PPG Signal Morphological Features☆11Jul 5, 2021Updated 4 years ago
- Puzzles for learning Triton, play it with minimal environment configuration!☆703Mar 17, 2026Updated 2 months ago
- This code is for the Tiger Re-ID in the Wild track CVWC2019 (Detection part)☆20Aug 27, 2019Updated 6 years ago
- Source code for EMNLP'25 paper "CodeRAG: Finding Relevant and Necessary Knowledge for Retrieval-Augmented Repository-Level Code Completio…☆21Apr 15, 2026Updated last month
- SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores☆15Apr 24, 2024Updated 2 years ago
- Utility that parses stack sizes section from elf objects and displays the preallocated stack size of each function.☆14Jan 15, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A high-throughput and memory-efficient inference and serving engine for LLMs☆17Jun 3, 2024Updated last year
- Image signal process (ISP) parameters auto-tuning program based on Black-Box hyperparameter optimization☆15Nov 29, 2023Updated 2 years ago
- WebAssembly specification, reference interpreter, and test suite.☆13Aug 31, 2023Updated 2 years ago
- Nano vLLM☆13,595Apr 26, 2026Updated last month
- ☆24Aug 11, 2024Updated last year
- Advanced Embodied Intelligence Brain Model☆36Nov 5, 2025Updated 6 months ago
- ☆27Feb 18, 2026Updated 3 months ago