Hemanthkumar2112 / Gemma2-9B-Llama3-8B-Finetune-and-RAG
Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform
☆20Updated 7 months ago
Alternatives and similar repositories for Gemma2-9B-Llama3-8B-Finetune-and-RAG:
Users that are interested in Gemma2-9B-Llama3-8B-Finetune-and-RAG are comparing it to the libraries listed below
- Minimal zero-shot intent classifier for arbitrary intent slot filling, via LLM prompting w LangChain.☆33Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆76Updated last week
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot☆41Updated 3 months ago
- ☆44Updated 4 months ago
- RAG with Knowledge Graph☆36Updated 11 months ago
- 7 query strategies for navigating knowledge graphs with LlamaIndex☆41Updated last year
- A new novel multi-modality (Vision) RAG architecture☆23Updated 3 months ago
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆34Updated last year
- Notebooks and code with some RAG techniques using llamaindex☆25Updated 9 months ago
- Universal text classifier for generative models☆22Updated 6 months ago
- Chat with Qwen2-VL. Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆10Updated 4 months ago
- ☆13Updated last year
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever☆36Updated last week
- ☆49Updated 4 months ago
- ☆26Updated 10 months ago
- RuleRAG: Rule-guided Retrieval-Augmented Generation with Language Models for Question Answering☆18Updated 2 months ago
- HybridRAG is a hybrid model of Vector and Graph☆22Updated 5 months ago
- ☆60Updated 3 months ago
- Fine-Tuning LLM and embedding models☆27Updated last year
- GGUF Quantization of any LLM.☆35Updated 10 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆29Updated 8 months ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆19Updated 4 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆36Updated last year
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆40Updated 3 months ago
- LLM reads a paper and produce a working prototype☆48Updated last month
- ☆25Updated 3 months ago
- Representing Rule-based Chatbots with Transformers☆19Updated 6 months ago
- Measuring RAG solutions throughput and latency☆15Updated 6 months ago
- The project demonstrates an example of how to use a supervised learning task using GPT-3.5 with JSON export, evaluating reviews in differ…☆16Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated last month