aws-samples / fine-tune-qwen2-vl-with-llama-factoryLinks
☆31Updated 6 months ago
Alternatives and similar repositories for fine-tune-qwen2-vl-with-llama-factory
Users that are interested in fine-tune-qwen2-vl-with-llama-factory are comparing it to the libraries listed below
Sorting:
- ☆62Updated this week
- Maximizing the Performance of a Simple RAG using RL☆90Updated 10 months ago
- Fine-Tuning LLM and embedding models☆28Updated 2 years ago
- vLLM Router☆54Updated last year
- Question Answering Generative AI application with Large Language Models (LLMs) and Amazon OpenSearch Service☆28Updated last year
- ☆23Updated last month
- MyAssistant Playground --powered by Bedrock Claude & AutoGen☆12Updated last year
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆11Updated 7 months ago
- ☆21Updated last year
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆51Updated 2 years ago
- Inference deployment of the llama3☆11Updated last year
- 研究GOT-OCR-项目落地加速,不限语言☆62Updated last year
- ☆54Updated last year
- A multimodal chat interface with many tools.☆131Updated 10 months ago
- ☆52Updated 8 months ago
- 用于学习GOT/Qwen/OnnxLLm☆53Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆88Updated last year
- qwen2 and llama3 cpp implementation☆49Updated last year
- Large Language Model Hosting Container☆91Updated 3 months ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆44Updated last year
- A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ/VPTQ, and export to onnx/onnx-runtime easily.☆184Updated 9 months ago
- Summarize and perform RAG on PPTx/PPT file formats☆20Updated last year
- ☆11Updated 2 years ago
- Model compression for ONNX☆99Updated last year
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆64Updated 8 months ago
- Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform☆22Updated 11 months ago
- Encountering 14 different Naive RAG fails and using KG to solve it☆17Updated last month
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot☆46Updated 7 months ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22Updated 8 months ago
- Tencent Hunyuan 7B (short as Hunyuan-7B) is one of the large language dense models of Tencent Hunyuan☆71Updated 5 months ago