Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
☆29Apr 7, 2023Updated 3 years ago
Alternatives and similar repositories for community
Users that are interested in community are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19May 23, 2023Updated 3 years ago
- 💙 Unstructured Data Connectors for Haystack 2.0☆18Sep 21, 2023Updated 2 years ago
- Code created for blog series on unsupervised feature/topic extraction from corporate email content. An implementation for cleaning raw e…☆10Oct 21, 2021Updated 4 years ago
- This project is a Python script that scrapes a Linkedin PDF, generates a customized portfolio site using OpenAI's GPT-4 model with the he…☆29May 16, 2023Updated 3 years ago
- ☆936Jun 19, 2026Updated 2 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- deprecated, use https://github.com/octohelm/piper instead.☆14Sep 3, 2024Updated last year
- ☆13Oct 18, 2022Updated 3 years ago
- A Firefox and Google Chrome extension to clip websites and download them into a readable markdown file.☆43Jan 13, 2019Updated 7 years ago
- ☆15Nov 13, 2018Updated 7 years ago
- ☆22Mar 18, 2024Updated 2 years ago
- An assortment of Obsidian Web Clipper Templates☆31Mar 14, 2025Updated last year
- Strong, Simple, and Precise, (and now async!) security for Sanic APIs☆14May 29, 2026Updated last month
- ☆11Oct 25, 2023Updated 2 years ago
- Pythoness: use natural language to define Python functions.☆21Apr 22, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A tool to convert Gherkin into Sphinx documentation☆12Sep 4, 2023Updated 2 years ago
- Streamlit Demo Use Cases☆28Oct 15, 2023Updated 2 years ago
- A microservices-based streaming platform for Ingenic T-series based embedded IP cameras☆29May 31, 2026Updated last month
- A prompt-engineering technique for creating personalized custom instructions on ChatGPT☆17Oct 26, 2023Updated 2 years ago
- Model Context Protocol Server for Accessing twitter☆25Jun 3, 2025Updated last year
- Bindings for H3 to SQLite3☆19Feb 12, 2026Updated 4 months ago
- Strategy on how to create a k8s cluster in aws EKS☆17Aug 13, 2021Updated 4 years ago
- Chrome and Firefox extensions for Slurp☆29Apr 9, 2024Updated 2 years ago
- This guide is made to help you deploy your own document RAG pipline with Open-WebUI and Local LLM.☆39Mar 20, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Demonstrate using MCP with Pydantic AI framework☆14Mar 14, 2025Updated last year
- Examples in the MLX framework☆11Sep 23, 2024Updated last year
- This is an Ai based Yoga Pose Detection System☆10Jul 5, 2022Updated 3 years ago
- BDD testing library for Elixir☆22Sep 10, 2020Updated 5 years ago
- Viewer for text datasets in formats like HuggingFace, JSONL, etc.☆15Feb 25, 2025Updated last year
- ChatGPT Google SLides AppScript☆26Mar 5, 2023Updated 3 years ago
- PyTorch Implementation of Attention Prompt Tuning: Parameter-Efficient Adaptation of Pre-Trained Models for Action Recognition☆16Mar 12, 2024Updated 2 years ago
- ClickYaml reads a .yaml file and creates click Commands out of it.☆16Mar 5, 2023Updated 3 years ago
- Muse 2/S EEG Headset Vanilla Javascript Library☆12Jun 24, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆15Jan 10, 2025Updated last year
- Automatically comments Python code, adding docstrings and type annotations, with optional translation to other languages.☆28Feb 29, 2024Updated 2 years ago
- Materiais do Curso de Introdução à Pesquisa Jurimétrica☆11Oct 25, 2023Updated 2 years ago
- Copy the web as markdown☆41Aug 17, 2025Updated 10 months ago
- This repo guides you through building a chatbot on your own data with self hosted LLM☆90Feb 14, 2023Updated 3 years ago
- A project designed to extract relevant metadata from databases and transform it into context for Retrieval-Augmented Generation (RAG) in …☆14Aug 6, 2025Updated 10 months ago
- This is a template retrieval repo to create a Flask api server using LangChain with Cohere embeddings and Qdrant Vector Database☆78Apr 30, 2023Updated 3 years ago