NexaAI / nexa-sdk
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
☆634Updated this week
Related projects: ⓘ
- Awesome LLMs on Device: A Comprehensive Survey☆613Updated this week
- The official implementation of Self-Play Preference Optimization (SPPO)☆461Updated last month
- Pytorch Library for Relational Table Learning with LLMs.☆270Updated last week
- The Official Repo of ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://a…☆350Updated last week
- Code for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"☆192Updated last month
- A multimodal agent framework for solving complex tasks☆505Updated last week
- 🎨 Infinite Drawboard in Python☆948Updated 2 months ago
- Accelerate your Stable Diffusion inference with the library's universal C/C++ framework design, powered by ONNXRuntime & across platforms…☆610Updated last month
- Unofficial Implementation of ReplaceAnything: https://aigcdesigngroup.github.io/replace-anything/☆526Updated 3 months ago
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models☆295Updated this week
- A deployment, monitoring and autoscaling service towards serverless LLM serving.☆152Updated last week
- An AI agent powered by LLMs that streamlines the entire process of data analysis. 🚀☆319Updated last month
- This is the official reproduction of FancyVideo.☆492Updated last week
- LLM-based text extraction from unstructured data like PDFs, Words and HTMLs. Transform and cluster the text into your desired format. Les…☆168Updated 3 months ago
- An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions☆1,220Updated last month
- The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"☆754Updated 2 weeks ago
- Multilingual Corpus of Web Fiction☆211Updated 2 months ago
- ☆374Updated this week
- Dive into Nature Simulation v1, a dynamic ecosystem game. Experience life's balance with interactive controls and stunning visuals of flo…☆334Updated 7 months ago
- Next-Generation Interactive Intelligent Programming Assistant☆733Updated 3 weeks ago
- PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.☆1,114Updated 2 months ago
- Fullstack engineer's checklist for your cybersecurity.☆532Updated 2 months ago
- Comprehensive Deep Learning Tutorial : From Zero To Hero☆806Updated last month
- ☆318Updated 2 months ago
- ☆366Updated 3 weeks ago
- An MBTI Exploration of Large Language Models☆448Updated 7 months ago
- The Official Implementation of PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling☆480Updated last month
- Large-Scale Selfie Video Dataset (L-SVD): A Benchmark for Emotion Recognition☆407Updated last month
- ☆353Updated last month
- Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation☆1,087Updated 10 months ago