reanalytics-databoutique / advanced-scrapy-proxies
Scrapy rotation proxy package with advanced functions
☆94Updated 2 years ago
Alternatives and similar repositories for advanced-scrapy-proxies:
Users that are interested in advanced-scrapy-proxies are comparing it to the libraries listed below
- a high-performance, lightweight and human friendly serving engine for scrapy☆29Updated 3 years ago
- WarcDB: Web crawl data as SQLite databases.☆398Updated 6 months ago
- Image AES256 crypt-decrypt☆38Updated 3 years ago
- Piazza-Updater automates updates to a Weaviate database with real-time vectorial data. By continuously searching the internet and integra…☆28Updated last month
- Optimizing Python code by implementing a C++ extension☆47Updated last year
- Medical wordlists in EN/FR/UA☆86Updated last year
- I worked through 100 Pandas Puzzles (actually only 45) and did some data visualizations with the 2010 Denver Census data.☆12Updated 4 years ago
- Parse natural language time and date expressions in python☆195Updated 10 months ago
- Create a SQLite database containing metadata from Google Drive☆153Updated 2 years ago
- Python module to parse ingredient names. Splitting them into the ingredient, unit and quantity. It is trained on a publicly available dat…☆151Updated last year
- Python DBAPI simplified☆45Updated 3 years ago
- Command line parser for common log format.☆142Updated 6 months ago
- A generator for OpenAPI 3☆97Updated 4 years ago
- A Python-based static site generator using Jinja templates.☆96Updated 9 months ago
- An end to end semantic and meta-data search engine for personal data.☆153Updated 2 weeks ago
- Simple Python Calculation Engine☆134Updated 2 years ago
- Reverse Geocode for OpenStreetmap☆122Updated 4 months ago
- Flask code to deploy an API that pulls structured data from online news articles☆230Updated 2 years ago
- Shell scripting for serverless☆141Updated 2 years ago
- Better Bookmarks Search w/ Transformers☆190Updated 11 months ago
- This is a numpy implementation of the Skip-gram algorithm described in Mikolov et al's Word2Vec paper. It is intended for didactic purpos…☆36Updated last year
- A Node.js REPL with built-in GPT3 completion☆13Updated last year
- A recursive dependency scanner for Python projects☆70Updated last year
- Library that helps use puppeteer in scrapy.☆52Updated 3 weeks ago
- Add website scraping abilities to Datasette☆62Updated last year
- Minimalist log collector☆114Updated this week
- Build a RAG dataset for your domain in just a few lines of codes, using your XML sitemap☆41Updated 4 months ago
- Command-line tool to remotely execute code in the cloud☆134Updated 2 years ago