chroma-core / onnx-embeddingLinks
A repository for creating, and sample code for consuming an ONNX embedding model
β34Updated 2 years ago
Alternatives and similar repositories for onnx-embedding
Users that are interested in onnx-embedding are comparing it to the libraries listed below
Sorting:
- Unofficial python bindings for the rust llm library. πβ€οΈπ¦β76Updated 2 years ago
- β46Updated 2 years ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β43Updated last year
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA onβ¦β48Updated last year
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPIβ112Updated 2 years ago
- Vector search demo with the arXiv paper dataset, RedisVL, HuggingFace, OpenAI, Cohere, FastAPI, React, and Redis.β150Updated 8 months ago
- Using LlamaIndex with Ray for productionizing LLM applicationsβ71Updated 2 years ago
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.β29Updated 2 years ago
- π A list of Haystack Integrations, maintained by the community or deepset.β97Updated this week
- Using LlamaIndex, Redis, and OpenAI to chat with PDF documents. Supplementary material for blog post on Microsoft Developer Blogβ114Updated 2 years ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β78Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first appβ¦β170Updated last year
- Split and analyze text files using langchain and streamlitβ50Updated last year
- β93Updated 2 years ago
- π Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platformβ38Updated last year
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlitβ58Updated last year
- Develop, evaluate and monitor LLM applications at scaleβ98Updated last year
- Embedding models from Jina AIβ65Updated last year
- Demo example of consumer goods categorizationβ30Updated 2 years ago
- GPT-4 Level Conversational QA Trained In a Few Hoursβ66Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ115Updated 8 months ago
- Universal text classifier for generative modelsβ25Updated last year
- Simple, Fast, Parallel Huggingface GGML model downloader written in pythonβ24Updated 2 years ago
- GLiNER model in a FastAPI microservice.β47Updated last year
- Experimenting text-embeddings-inference server on both CPU andΒ GPUβ18Updated 2 years ago
- LLM finetuningβ42Updated 2 years ago
- Adding NeMo Guardrails to a LlamaIndex RAG pipelineβ41Updated last year
- β65Updated last year
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.β73Updated 11 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β68Updated last month