xytpai / kfuncaLinks
KFunca: A minimalist, high-performance GPU-based automatic differentiation framework
☆28Updated 2 weeks ago
Alternatives and similar repositories for kfunca
Users that are interested in kfunca are comparing it to the libraries listed below
Sorting:
- Single-thread, end-to-end C++ implementation of the Bitnet (1.58-bit weight) model☆13Updated 9 months ago
- ☆25Updated 4 months ago
- ☆20Updated last year
- Build CUDA Neural Network From Scratch☆21Updated last year
- official implementation of paper SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training☆40Updated 8 months ago
- Implementation of semantic segmentation of FCN structure using KITTI road dataset😝😝😝☆33Updated last year
- Mixed precision inference by Tensorrt-LLM☆81Updated 10 months ago
- University of Toronto / ECE1782 - Programming Massively Parallel Multiprocessors and Heterogeneous Systems / Project: an optimized CUDA I…☆25Updated last year
- computing the non-convex risk parity porfolio problems by the non-convex quadratic approxiamtion (NCQA), interior point method (IPM) and…☆26Updated 2 years ago
- Official Implementation of "Accel-GNN: High-Performance GPU Accelerator Design for Graph Neural Networks"☆51Updated 5 months ago
- [ICML 2025] Official implementation of paper "Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning"☆48Updated 2 months ago
- ☆26Updated 3 years ago
- [IROS 2024] SCANet: Correcting LEGO Assembly Errors with Self-Correct Assembly Network (FINALIST BEST APPLICATION PAPER)☆22Updated 10 months ago
- Ein multimodaler, multi-intelligenter Entwicklungsrahmen☆44Updated 2 months ago
- A naive kernel.☆17Updated 3 years ago
- ☆18Updated 10 months ago
- 自动生成 markdown 标题序号☆27Updated last year
- Typeless Programming Language `sicpy` and Compiler;☆33Updated 2 years ago
- ☆28Updated 3 years ago
- 使用donut多模态模型,身份证识别,对身份证做端对端识别,无需中间处理,识别率达到商用☆18Updated last year
- responsive canvas☆31Updated 8 months ago
- A tool for automatically authenticate the network of Wen Yuan Talent Apartment.☆16Updated last year
- ☆64Updated 10 months ago
- This includes the original implementation of Retrieval-Oriented Knowledge for Click-Through Rate Prediction.☆21Updated 10 months ago
- EDC20: Code repository for the auto_navigation_car based on stm32. Contributed by the team A_star(champion team of the 20th Tsinghua Univ…☆20Updated 6 years ago
- Study of the optimization of chatbot behavior based on LLMs in the face of inappropriate behaviors in French conversations using semantic…☆40Updated 8 months ago
- ROSE: Robust Cross Supervision with Neighborhood Mining for Source-free Graph Domain Adaptation☆19Updated 10 months ago
- ☆10Updated 9 months ago
- A Go library implementation of the Model Controller Protocol (MCP). This library allows developers to easily parse MCP service configurat…☆48Updated 4 months ago
- By converting single-channel grayscale images into multi-channel images through various data enhancement techniques, SimOTM enhances the …☆31Updated 3 months ago