NVIDIA / workbench-llamafactoryLinks

This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.

☆62

Alternatives and similar repositories for workbench-llamafactory

Users that are interested in workbench-llamafactory are comparing it to the libraries listed below

Sorting:

18907305772 / FuseAI
FuseAI Project
☆87Updated 6 months ago
cxcscmu / RAGViz
Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]
☆86Updated 6 months ago
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆45Updated last year
Cerebras / DocChat
GPT-4 Level Conversational QA Trained In a Few Hours
☆63Updated 11 months ago
read-agent / read-agent.github.io
☆64Updated last year
milvus-io / milvus-model
A library integrating embedding and reranker models from OpenAI, SentenceTransformers etc for semantic search in vector database.
☆47Updated 4 months ago
FreedomIntelligence / ApolloMoE
[ICLR'25] ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts
☆45Updated 8 months ago
nateraw / replicate-examples
☆74Updated last year
deep-diver / gradio-chat
HuggingChat like UI in Gradio
☆71Updated 2 years ago
IEIT-Yuan / Yuan2.0-M32
Mixture-of-Experts (MoE) Language Model
☆189Updated 10 months ago
uukuguy / speechless
LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.
☆104Updated this week
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆88Updated 2 months ago
Bui1dMySea / MemLong
☆94Updated 7 months ago
HITsz-TMG / KaLM-Embedding
Code for KaLM-Embedding models
☆86Updated last month
kyegomez / Algorithm-Of-Thoughts
My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"
☆97Updated last year
TIGER-AI-Lab / LongRAG
Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".
☆236Updated 11 months ago
chu-tianxiang / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆131Updated last year
wade1010 / graphrag-ui
The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation
☆154Updated 9 months ago
nyunAI / PruneGPT
☆51Updated last year
CogNLP / CogAGENT
☆35Updated 2 years ago
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆53Updated 8 months ago
myeon9h / PlanRAG
Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24
☆142Updated last year
LLM360 / amber-data-prep
Data preparation code for Amber 7B LLM
☆91Updated last year
jina-ai / submodular-optimization
Submodular optimization for context engineering: query fan-out, text selection, passage reranking
☆62Updated 2 weeks ago
AlexBodner / How_Much_VRAM
☆101Updated 11 months ago
SqueezeAILab / LLM2LLM
[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
☆186Updated last year
EasyShopAI / rag-lab
Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot
☆47Updated last month
M1n9X / GraphRAG_Lite
☆16Updated last year
etalab-ia / albert-models
Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.
☆42Updated last year
asprenger / ray_vllm_inference
A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.
☆69Updated last year