aws-samples / fine-tune-qwen2-vl-with-llama-factoryLinks
☆29Updated 3 months ago
Alternatives and similar repositories for fine-tune-qwen2-vl-with-llama-factory
Users that are interested in fine-tune-qwen2-vl-with-llama-factory are comparing it to the libraries listed below
Sorting:
- ☆55Updated last week
- vLLM Router☆45Updated last year
- Inference deployment of the llama3☆11Updated last year
- ☆22Updated 9 months ago
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆11Updated 3 months ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆50Updated last year
- Self-host LLMs with vLLM and BentoML☆150Updated last week
- A multimodal chat interface with many tools.☆123Updated 7 months ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆42Updated last year
- GLM Series Edge Models☆149Updated 3 months ago
- Question Answering Generative AI application with Large Language Models (LLMs) and Amazon OpenSearch Service☆27Updated 10 months ago
- Maximizing the Performance of a Simple RAG using RL☆81Updated 6 months ago
- ☆56Updated 10 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆16Updated last year
- ☆12Updated 3 months ago
- ☆82Updated 10 months ago
- 研究GOT-OCR-项目落地加速,不限语言☆62Updated 11 months ago
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆61Updated 5 months ago
- Evaluation of bm42 sparse indexing algorithm☆68Updated last year
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆32Updated last year
- 💡💡💡awesome compute vision app in gradio☆55Updated last year
- A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ/VPTQ, and export to onnx/onnx-runtime easily.☆180Updated 6 months ago
- YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds☆135Updated last month
- qwen2 and llama3 cpp implementation☆47Updated last year
- ☆53Updated last year
- Large Language Model Hosting Container☆90Updated 2 weeks ago
- llm deploy project based onnx.☆44Updated last year
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆72Updated last year
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆60Updated last year
- Using LangChain's SQL Database Chain and Agent with various LLMs to perform Natural Language Queries (NLQ) of an Amazon RDS for PostgreSQ…☆48Updated 2 years ago