P1ayer-1 / Llama-LibTorch
Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5
☆11Updated last month
Related projects ⓘ
Alternatives and complementary repositories for Llama-LibTorch
- ncnn HiFi-GAN☆24Updated last month
- A build project for ONNX Runtime☆32Updated last week
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆16Updated this week
- (WIP) Parallel inference for black-forest-labs' FLUX model.☆4Updated last week
- ☆28Updated 3 months ago
- ☆123Updated 10 months ago
- Inference RWKV with multiple supported backends.☆26Updated 2 months ago
- ☆11Updated 9 months ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated last year
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆56Updated last year
- A Toolkit to Help Optimize Onnx Model☆75Updated this week
- Inference rwkv5 or rwkv6 with Qualcomm AI Engine Direct SDK☆36Updated this week
- some ncnn demos of FunASR☆16Updated last month
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆17Updated this week
- 很好用的tnn classify demo☆11Updated 3 years ago
- ☆46Updated last month
- ☆18Updated last month
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆60Updated 3 weeks ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆13Updated 2 weeks ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆40Updated last year
- Inference TinyLlama models on ncnn☆25Updated last year
- FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme☆46Updated 2 months ago
- onnxruntime pre-compiled libs☆77Updated last week
- ONNX Script editor & visualiser running completely in the browser thanks to Pyodide and Netron☆19Updated last year
- Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.☆11Updated 11 months ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated last year
- the C++ version of Transformer with ncnn☆19Updated 3 years ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting mixed English and Chinese languages.☆18Updated this week
- ☆11Updated last week