perone / feste
Feste is a free and open-source framework allowing scalable composition of NLP tasks using a graph execution model that is optimized and executed by specialized schedulers.
☆40Updated last year
Related projects: ⓘ
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆33Updated 5 months ago
- Vector Database with support for late interaction and token level embeddings.☆51Updated last week
- Creating Generative AI Apps which work☆16Updated 2 months ago
- Production-grade embedding generation, for any length of text, for transformer models.☆21Updated last week
- A library for squeakily cleaning and filtering language datasets.☆45Updated last year
- NLP with Rust for Python 🦀🐍☆57Updated 3 months ago
- Writing Blog Posts with Generative Feedback Loops!☆41Updated 6 months ago
- ☆31Updated last year
- ☆24Updated last year
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated 5 months ago
- A file utility for accessing both local and remote files through a unified interface.☆36Updated last month
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆26Updated last year
- This is the repo for the container that holds the models for the text2vec-transformers module☆38Updated 3 weeks ago
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)☆59Updated last year
- Python API for https://vespa.ai, the open big data serving engine☆89Updated this week
- Hassle-free ML Pipelines on Kubernetes☆38Updated last year
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆27Updated 3 weeks ago
- Cortex-compatible model server for Python and TensorFlow☆16Updated last year
- Vespa application making an index of the CORD-19 dataset.☆39Updated 2 weeks ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Updated last year
- ☆28Updated this week
- Adversarial Training and SFT for Bot Safety Models☆38Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- Tools to make language models a bit easier to use☆22Updated last week
- Training and Inference Notebooks for the RedPajama (OpenLlama) models☆18Updated last year
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated 8 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated 8 months ago
- Voyage AI Official Python Library☆37Updated 3 months ago
- 🤝 Trade any tensors over the network☆30Updated 11 months ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 7 months ago