wenqiglantz / text-embedding-inference-server-edd
Experimenting text-embeddings-inference server on both CPU and GPU
☆18Updated last year
Alternatives and similar repositories for text-embedding-inference-server-edd:
Users that are interested in text-embedding-inference-server-edd are comparing it to the libraries listed below
- ☆1Updated 9 months ago
- This repository contains the source code for running llamaindex tutorials from https://howaibuildthis.substack.com/☆40Updated last year
- ☆29Updated last year
- ☆31Updated last year
- Building a Chain of Thought RAG Model with DSPy, Qdrant and Ollama☆31Updated last year
- ☆45Updated 11 months ago
- ☆88Updated last year
- Widest collection of generative ai usecases in enterprise & startups☆18Updated last year
- LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Ll…☆16Updated 11 months ago
- Tutorial for DSPy☆23Updated 11 months ago
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆23Updated last year
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆44Updated last year
- Dynamic Metadata based RAG Framework☆72Updated 8 months ago
- ☆11Updated 10 months ago
- This repository is a combination of llama workflows and agents together which is a powerful concept.☆17Updated 8 months ago
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- Rag Chatbot React And Tyepscript base boilerplate☆33Updated 11 months ago
- ☆19Updated 5 months ago
- ☆10Updated 10 months ago
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆29Updated last year
- Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelines☆31Updated last year
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆13Updated 11 months ago
- Metadata Enrichment using KeyBERT for advanced and improved RAG.☆10Updated last year
- ☆20Updated last year
- Example implementation of Iteration of Tought - Gives a star if you like the project☆40Updated 3 months ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆48Updated last year
- ☆12Updated 11 months ago
- purpose of this repo is to Implement LLMOPs as shared in Deeplearning AI course☆12Updated this week