weaviate / arXiv-demo-dataset
This repository will contain a demo using Weaviate with data and metadata from the arXiv dataset.
☆12Updated 2 years ago
Alternatives and similar repositories for arXiv-demo-dataset:
Users that are interested in arXiv-demo-dataset are comparing it to the libraries listed below
- Writing Blog Posts with Generative Feedback Loops!☆47Updated 11 months ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- ☆18Updated 4 months ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated 10 months ago
- PyTorch implementation for MRL☆18Updated 11 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆38Updated 11 months ago
- Resources for exploring Generative Feedback Loops with Weaviate!☆36Updated last month
- A sample pattern for running CI tests on Modal☆14Updated 5 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆28Updated 2 months ago
- Creating Generative AI Apps which work☆16Updated 7 months ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆32Updated 8 months ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- Using short models to classify long texts☆21Updated last year
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated 10 months ago
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- Ingest PDFs into Weaviate☆33Updated 8 months ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆23Updated 2 years ago
- Example code using the DSPy framework.☆18Updated 8 months ago
- A specification for OpenInference, a semantic mapping of ML inferences☆46Updated 10 months ago
- Build Agentic workflows with function calling using open LLMs☆26Updated 2 weeks ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated last year
- Tools for various benchmarking scenarios☆27Updated this week
- ☆8Updated 7 months ago
- ☆76Updated 8 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆29Updated 4 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆44Updated 5 months ago
- ☆22Updated 9 months ago
- ☆31Updated 11 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"