Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
☆29Apr 7, 2023Updated 3 years ago
Alternatives and similar repositories for community
Users that are interested in community are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code created for blog series on unsupervised feature/topic extraction from corporate email content. An implementation for cleaning raw e…☆10Oct 21, 2021Updated 4 years ago
- Filter RSS Feed with GPT-4☆16May 22, 2023Updated 3 years ago
- An assortment of Obsidian Web Clipper Templates☆31Mar 14, 2025Updated last year
- Email Client as MCP Server. Feature: multiple configuration, more than just gmail☆18Apr 22, 2025Updated last year
- A microservices-based streaming platform for Ingenic T-series based embedded IP cameras☆29May 31, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆29May 27, 2025Updated last year
- Chrome and Firefox extensions for Slurp☆29Apr 9, 2024Updated 2 years ago
- This guide is made to help you deploy your own document RAG pipline with Open-WebUI and Local LLM.☆39Mar 20, 2025Updated last year
- Prompt templating tools designed for interacting with language interfaces like OpenAI's ChatGPT in Obsidian.☆25Apr 3, 2024Updated 2 years ago
- Demonstrate using MCP with Pydantic AI framework☆14Mar 14, 2025Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Aug 15, 2024Updated last year
- Viewer for text datasets in formats like HuggingFace, JSONL, etc.☆15Feb 25, 2025Updated last year
- Utilities methods for interacting with Htmx in Dream☆17Aug 7, 2022Updated 3 years ago
- Tool4AI: A model agnostic, LLM friendly router for tool/function call☆20Aug 19, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆15Jan 10, 2025Updated last year
- The official Python library for Formulaic☆18Apr 25, 2024Updated 2 years ago
- TypeScript SDK for gorse recommender system☆13Mar 31, 2026Updated 2 months ago
- A template for Python projects that need to use a relational database, including tooling for managing schema migrations and testing again…☆13Dec 13, 2024Updated last year
- This repository hosts an advanced ROS2 package designed to seamlessly integrate WebRTC into robotic applications. Its primary purpose is …☆10Nov 9, 2023Updated 2 years ago
- Copy the web as markdown☆41Aug 17, 2025Updated 9 months ago
- This repo guides you through building a chatbot on your own data with self hosted LLM☆90Feb 14, 2023Updated 3 years ago
- A project designed to extract relevant metadata from databases and transform it into context for Retrieval-Augmented Generation (RAG) in …☆14Aug 6, 2025Updated 10 months ago
- This is a template retrieval repo to create a Flask api server using LangChain with Cohere embeddings and Qdrant Vector Database☆78Apr 30, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Build Contact Form 7 forms from PDF forms. Get PDFs auto-filled and attached to email messages and/or website responses on form submissio…☆11Apr 2, 2026Updated 2 months ago
- ☆35Jun 22, 2024Updated last year
- Prompting Techniques for Attorneys☆18May 30, 2026Updated 2 weeks ago
- API client for fetching and comparing passages from legislation☆14Updated this week
- I will be adding different kind of opensource data extraction tools code using python☆10Nov 15, 2024Updated last year
- Supercharge your workflow automation with this curated collection of n8n templates! Instantly connect your favorite apps-like Gmail, Tele…☆12May 22, 2025Updated last year
- ☆15Jun 9, 2023Updated 3 years ago
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus☆13Jan 2, 2021Updated 5 years ago
- A lightweight React hook that automatically manages fade overlays for scrollable containers. Provides smooth gradient transitions at the …☆12Aug 11, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Integrate Microsoft's Markitdown tool to convert various file formats to Markdown for your vault.☆68Mar 26, 2026Updated 2 months ago
- ☆14Jul 25, 2024Updated last year
- A simple RAG implementation using langchain and lmstudio☆38Dec 1, 2023Updated 2 years ago
- a cross-platform local-first open source alternative to AI recall apps (windows Recall, rewind.ai)☆15Jul 11, 2024Updated last year
- ☆25Apr 6, 2026Updated 2 months ago
- A simple swift app for MacOS/iOS to test large language models (LLM)☆31May 9, 2023Updated 3 years ago
- Open Source AI Database for Voice Agent Transcripts | Call Analysis & Insights | Extraction | Labelling & Classification☆29Nov 3, 2025Updated 7 months ago