wizenheimer / cyyrusLinks
Transform Unstructured Data into Synthetic Datasets
☆26Updated last year
Alternatives and similar repositories for cyyrus
Users that are interested in cyyrus are comparing it to the libraries listed below
Sorting:
- Repo to experiment with Graph RAG strategies using Kùzu☆63Updated 3 months ago
- Simple UI for debugging correlations of text embeddings☆306Updated 7 months ago
- Lightweight Nearest Neighbors with Flexible Backends☆324Updated last week
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Updated last month
- ☆68Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆81Updated last year
- ☆128Updated 4 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆89Updated 3 weeks ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆47Updated last year
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 8 months ago
- Python library to use Pleias-RAG models☆67Updated 8 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆45Updated last year
- ☆160Updated last year
- lossily compress representation vectors using product quantization☆59Updated 2 months ago
- Query language for blending SQL and LLMs across structured + unstructured data, with type constraints.☆125Updated this week
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆295Updated 2 months ago
- ☆125Updated last year
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆64Updated last year
- Generalist and Lightweight Model for Text Classification☆166Updated last month
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆59Updated last year
- Solving data for LLMs - Create quality synthetic datasets!☆151Updated 11 months ago
- Python API for https://vespa.ai, the open big data serving engine☆154Updated this week
- Pre-train Static Word Embeddings☆94Updated 4 months ago
- Datamodels for hugging face tokenizers☆86Updated last week
- Tools to make language models a bit easier to use☆63Updated last week
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆201Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 9 months ago
- A strongly typed Python DSL for developing message passing multi agent systems☆53Updated last year
- 🐮📢 The first AI voice assistant that interrupts *you*☆148Updated last year
- PyLate efficient inference engine☆69Updated 4 months ago