datametica / DataIngestionFrameworkLinks
☆15Updated last year
Alternatives and similar repositories for DataIngestionFramework
Users that are interested in DataIngestionFramework are comparing it to the libraries listed below
Sorting:
- Common crawl extractor☆84Updated last year
- Python library for Entities, relationships and schemas extraction from documents☆46Updated last year
- ☆20Updated last year
- ☆20Updated last month
- A repository for maintaining a list of the top domains based on multiple lists☆23Updated 3 years ago
- A UserScript to detect GPT generated comments on Hackernews.☆13Updated 3 years ago
- The script uses an Google maps API to download photos of places in the area specified by coordinates and search radius☆18Updated 2 years ago
- Neuron is a composable agent framework inspired by cognitive neuroscience. It enables you to build, orchestrate, and monitor intelligent …☆24Updated last week
- DomainsProject.org HTTP worker☆25Updated 3 years ago
- ☆12Updated last year
- Powerful LLM Query Framework with YAML Prompt Templates. Made for Automation☆34Updated 4 months ago
- TLS & API keys for your LLM APIs☆19Updated last month
- TextractAI: Extract and process text from PDFs using Python, OpenAI API, and OCR techniques.☆14Updated last year
- ☆18Updated 2 years ago
- Progzee is a Python library for simplifying IP proxy usage in HTTP requests.☆16Updated 11 months ago
- Squey is an open-source cross-platform visualization software designed to interactively explore and understand large amounts of tabular d…☆34Updated last month
- Browser interface to Telegram's API with additional modules for generating datasets and network graphs☆13Updated 2 years ago
- AI Document Intelligence is an AI service that applies advanced machine learning to extract text, key-value pairs, tables, and structures…☆15Updated last year
- A toolset to test data classification engines that generates mock data in various file formats, sizes and data profiles.☆43Updated 2 years ago
- Extract web archive data using Wayback Machine and Common Crawl☆171Updated last year
- One-click install for WizardLM-13B-Uncensored with oobabooga webui☆21Updated 2 years ago
- Taranis NG is an OSINT gathering and analysis tool for CSIRT teams and organisations. It allows team-to-team collaboration, and contains …☆10Updated 2 years ago
- Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applications☆106Updated last year
- 🛡️ Managed isolated environments for Python☆109Updated this week
- Capture a URL with Playwright☆30Updated this week
- Security and compliance proxy for LLM APIs☆50Updated 2 years ago
- Blueprint by Mozilla.ai for answering questions about structured documents☆37Updated 10 months ago
- Spider ported to Python☆103Updated 3 weeks ago
- Neuroengine is a service to share LLMs in the form of a webchat and API.☆45Updated last year
- 🐝 Create powerful, collaborative AI applications.☆65Updated last year