SWHL / LLaMADemoLinks
🎉LLaMA Demo 7B🎉
☆17Updated 2 years ago
Alternatives and similar repositories for LLaMADemo
Users that are interested in LLaMADemo are comparing it to the libraries listed below
Sorting:
- rwkv finetuning☆37Updated last year
- 🎮Manipulates mobile phones just like how you would. Official code for "MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficien…☆26Updated 3 months ago
- share data, prompt data , pretraining data☆36Updated 2 years ago
- Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.☆13Updated last year
- ✅4g GPU可用 | 简易实现ChatGLM单机调用多个计算设备(GPU、CPU)进行推理☆34Updated 2 years ago
- Kanchil(鼷鹿)是世界上最小的偶蹄目动物,这个开源项目意在探索小模型(6B以下)是否也能具备和人类偏好对齐的能力。☆113Updated 2 years ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- Instruct-tune LLaMA on consumer hardware☆72Updated 2 years ago
- Torchserve + TensorRT + Detection☆19Updated 3 years ago
- ☆17Updated 2 years ago
- ☆80Updated last year
- Cross-platform, customizable ML solutions for live and streaming media.☆24Updated 4 years ago
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆40Updated 2 years ago
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Updated last year
- minichatgpt - To Train ChatGPT In 5 Minutes☆169Updated 2 years ago
- nllb-200 distilled 350M for English to Korean translation☆28Updated last year
- Inference TinyLlama models on ncnn☆24Updated 2 years ago
- Our data munging code.☆34Updated 2 months ago
- Whisper in TensorRT-LLM☆17Updated 2 years ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated last year
- ☆34Updated last year
- qwen2 and llama3 cpp implementation☆49Updated last year
- NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…☆26Updated 2 years ago
- The paddle implementation of meta's LLaMA.☆45Updated 2 years ago
- 使用onnxruntime部署实时视频帧插值,包含C++和Python两个版本的程序☆27Updated last year
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated 2 years ago
- A light proxy solution for HuggingFace hub.☆49Updated 2 years ago
- NASRec Weight Sharing Neural Architecture Search for Recommender Systems☆31Updated 2 years ago
- Frontend for the MOSS chatbot.☆47Updated last year
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆110Updated 2 years ago