nixiesearch / onnx-convert
An ONNX converter script focused on embedding models
☆28Updated 2 weeks ago
Alternatives and similar repositories for onnx-convert:
Users that are interested in onnx-convert are comparing it to the libraries listed below
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆55Updated last month
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆20Updated 10 months ago
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆60Updated 3 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆29Updated 5 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆137Updated 6 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆100Updated last month
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆44Updated 4 months ago
- Routing on Random Forest (RoRF)☆100Updated 4 months ago
- Evaluation of bm42 sparse indexing algorithm☆65Updated 6 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆63Updated 2 months ago
- An OpenAI Completions API compatible server for NLP transformers models☆63Updated last year
- Complete example of how to build an Agentic RAG architecture with Redis, AWS Bedrock, and LlamaIndex.☆89Updated last month
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆66Updated 3 months ago
- experiments with inference on llama☆104Updated 7 months ago
- A framework for evaluating function calls made by LLMs☆36Updated 6 months ago
- Generalist and Lightweight Model for Text Classification☆59Updated last week
- Using open source LLMs to build synthetic datasets for direct preference optimization☆52Updated 11 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆145Updated 4 months ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆185Updated last month
- ☆76Updated 7 months ago
- ☆138Updated 6 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆82Updated 3 weeks ago
- ☆13Updated last year
- ☆207Updated 6 months ago
- Efficient few-shot learning with cross-encoders.☆44Updated 11 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated last month
- Solving data for LLMs - Create quality synthetic datasets!☆144Updated last week
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆56Updated 3 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated 10 months ago