perone / feste
Feste is a free and open-source framework allowing scalable composition of NLP tasks using a graph execution model that is optimized and executed by specialized schedulers.
☆41Updated last year
Alternatives and similar repositories for feste:
Users that are interested in feste are comparing it to the libraries listed below
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆29Updated 5 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated 10 months ago
- Writing Blog Posts with Generative Feedback Loops!☆47Updated 11 months ago
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆38Updated last year
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- Vector Database with support for late interaction and token level embeddings.☆53Updated 5 months ago
- ☆20Updated last year
- Production-grade embedding generation, for any length of text, for transformer models.☆23Updated 4 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆29Updated 6 months ago
- Training and Inference Notebooks for the RedPajama (OpenLlama) models☆18Updated last year
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Latent Large Language Models☆17Updated 6 months ago
- Chat Markup Language conversation library☆55Updated last year
- This repository serves as a collection of scrapers procuring and structuring various legal datasets☆17Updated last year
- NLP with Rust for Python 🦀🐍☆61Updated 9 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆34Updated last year
- Experimenting with LLMs to Research, Reflect, and Plan (LLM assistants, retrieval, and Discord integration)☆29Updated 7 months ago
- Framework for building and maintaining self-updating prompts for LLMs☆61Updated 8 months ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- Create visualizations that are reproducible, easy to organize, and automatically detect if anything changes☆17Updated 3 months ago
- Tools for formatting large language model prompts.☆12Updated last year
- A library for squeakily cleaning and filtering language datasets.☆46Updated last year
- ☆8Updated 7 months ago
- [Added T5 support to TRLX] A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆47Updated 2 years ago
- ☆13Updated last year
- PyTorch implementation for MRL☆18Updated last year