peopledatalabs / peopledatalabs-pythonLinks
A Python client for the People Data Labs API
☆35Updated last week
Alternatives and similar repositories for peopledatalabs-python
Users that are interested in peopledatalabs-python are comparing it to the libraries listed below
Sorting:
- A python script to loop through urls in a csv and look for specific keywords on the scraped homepage.☆16Updated 3 years ago
- API for OpenSanctions with support for entity search and bulk matching of data collections. Supports Reconciliation API spec.☆108Updated this week
- scraping and querying documents for LLMs☆24Updated 3 weeks ago
- Time-Series Manipulation☆71Updated 2 years ago
- A simple and streamlined Python script to extract and filter links from a remote HTML resource.☆24Updated 9 months ago
- API client for fetching and comparing passages from legislation☆14Updated 9 months ago
- A FastAPI extension for integrating common AI agent frameworks.☆45Updated 9 months ago
- an easy way to create JSONL files for fine-tuning openai models.☆18Updated 11 months ago
- Python SDK for Inngest: Durable functions and workflows in Python, hosted anywhere☆142Updated this week
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆63Updated last week
- A python client for Crunchbase's REST API☆41Updated 2 years ago
- Parse government documents into well formed JSON☆73Updated 2 months ago
- Crawl any Web page and generate XML sitemap compatible with Google's indexing robots.☆45Updated last year
- Scrape various open data directories to create an index of what's available out there☆37Updated 8 months ago
- Scrapfly Python SDK for headless browsers and proxy rotation☆48Updated last month
- Lightweight AI agent library. Turn Python functions/classes into AI tools instantly. No verbose configs or complex abstractions.☆29Updated this week
- pai: A Python REPL with a built in AI agent☆42Updated 2 years ago
- Singer.io Tap for extracting data from the Google Analytics Reporting API☆12Updated 3 weeks ago
- Various Jupyter notebooks about Common Crawl data☆59Updated 6 months ago
- Common interface for data container classes☆68Updated last month
- Track changes to GraphQL APIs by git scraping their schemas☆30Updated 6 months ago
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆149Updated 9 months ago
- Spider ported to Python☆94Updated 8 months ago
- A low-code microservices platform designed for legal engineers. Given a document, Gremlin will apply a series of Python scripts to it and…☆30Updated 3 years ago
- Download client for legal opinions☆14Updated 9 months ago
- Entity resolution, also known as Data Matching or Record linkage is the task of finding a data set that refer to the same or similar real…☆29Updated 6 months ago
- 🖍️ Highlight text in documents☆109Updated 6 months ago
- Library for scraping websites or apis at any scale☆53Updated last year
- Web scraping Page Objects core library☆101Updated this week
- Legal Matter Standard Specification (LMSS) library for Python☆15Updated last year