P1ayer-1 / Llama-LibTorch
Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5
☆13Updated 7 months ago
Alternatives and similar repositories for Llama-LibTorch:
Users that are interested in Llama-LibTorch are comparing it to the libraries listed below
- ☆32Updated 9 months ago
- mnn asr demo.☆16Updated last month
- Multiple GEMM operators are constructed with cutlass to support LLM inference.☆17Updated 6 months ago
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.☆15Updated last year
- ncnn HiFi-GAN☆26Updated 6 months ago
- ☆124Updated last year
- Real-time Timbre Remapping with Differentiable DSP.☆14Updated 5 months ago
- Awesome code, projects, books, etc. related to CUDA☆16Updated last week
- Stable Diffusion in TensorRT 8.5+☆14Updated 2 years ago
- SGEMM optimization with cuda step by step☆18Updated last year
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI music☆25Updated last year
- Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using CUDA core for the decoding stage of LLM inference.☆36Updated 3 weeks ago
- ☆11Updated last month
- Basic library for spatial audio SOFA files☆12Updated 4 years ago
- Official source codes of airsep☆36Updated last year
- Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.☆14Updated last year
- 使用 CUDA C++ 实现的 llama 模型推理框架☆50Updated 5 months ago
- ⚡️Write HGEMM from scratch using Tensor Cores with WMMA, MMA and CuTe API, Achieve Peak⚡️ Performance.☆73Updated 3 weeks ago
- llm deploy project based onnx.☆36Updated 6 months ago
- Standalone Flash Attention v2 kernel without libtorch dependency☆108Updated 7 months ago
- HunyuanDiT with TensorRT and libtorch☆17Updated 11 months ago
- ☆48Updated last week
- Whisper in TensorRT-LLM☆15Updated last year
- A practical way of learning Swizzle☆18Updated 2 months ago
- 很好用的tnn classify demo☆11Updated 4 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆18Updated last week
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated this week
- A build project for ONNX Runtime☆40Updated 2 weeks ago
- ☆55Updated 2 weeks ago
- [DEPRECEATED] Multi-Instrumental Music Transformer trained on 12GB/400k MIDIs☆17Updated 2 years ago