nixiesearch / onnx-convert
An ONNX converter script focused on embedding models
☆25Updated 9 months ago
Alternatives and similar repositories for onnx-convert:
Users that are interested in onnx-convert are comparing it to the libraries listed below
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆20Updated 8 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆53Updated 3 weeks ago
- Python API for https://vespa.ai, the open big data serving engine☆105Updated this week
- ☆112Updated this week
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆62Updated 3 weeks ago
- Vector Database with support for late interaction and token level embeddings.☆54Updated last month
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 2 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆106Updated this week
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆73Updated last year
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆134Updated 4 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆81Updated last week
- Simple examples using Argilla tools to build AI☆43Updated last week
- minimal pytorch implementation of bm25 (with sparse tensors)☆90Updated 8 months ago
- A framework for evaluating function calls made by LLMs☆35Updated 4 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆163Updated 2 months ago
- Late Interaction Models Training & Retrieval☆169Updated this week
- Efficient few-shot learning with cross-encoders.☆40Updated 9 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆134Updated 2 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆63Updated this week
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆77Updated 8 months ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆47Updated 10 months ago
- ☆46Updated 9 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆41Updated 8 months ago
- ☆15Updated 6 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆55Updated 3 months ago
- Experimental Code for StructuredRAG: Structured Outputs in Retrieval-Augmented Generation☆94Updated last week
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆64Updated 11 months ago
- Reimplementation of the task generation part from the Alpaca paper☆119Updated last year
- A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.☆168Updated 3 months ago
- ☆94Updated 2 months ago