mmas/docker-scrapy-tor

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mmas/docker-scrapy-tor)

mmas / docker-scrapy-tor

Scrapy environment with Tor for anonymous ip routing and Privoxy for http proxy

☆20

Alternatives and similar repositories for docker-scrapy-tor

Users that are interested in docker-scrapy-tor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

EdAbati / outlines-haystack
View on GitHub
Use `outlines` generators with Haystack.
☆14Updated this week
54rt1n / shardmerge
View on GitHub
Using fourier interpolation to merge large language models
☆11Jul 11, 2026Updated 2 weeks ago
ShuHuang / chemdatawriter
View on GitHub
ChemDataWriter is a transformer-based library for automatically generating research books in the chemistry area.
☆13Oct 7, 2023Updated 2 years ago
dejan94it / cc_Rtools
View on GitHub
This plugin allows the Cheshire Cat to use tools written in R language
☆10Dec 23, 2024Updated last year
guilouro / devent
View on GitHub
☆10Jul 29, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
TheZalRevolt / italian-tech-podcast
View on GitHub
Repository github che contiene info e link riguardanti i podcast italiani in ambito tech
☆12Dec 18, 2023Updated 2 years ago
mateuspadua / django-admin-report
View on GitHub
Crie relátorios utilzando todo o potencial do admin django
☆15Dec 19, 2019Updated 6 years ago
anakin87 / llama2-haystack
View on GitHub
Using Llama2 with Haystack, the NLP/LLM framework.
☆16Jul 21, 2023Updated 3 years ago
scrapinghub / page_clustering
View on GitHub
A simple algorithm for clustering web pages, suitable for crawlers
☆33Mar 6, 2017Updated 9 years ago
alexbrandsen / jsonl2bio
View on GitHub
Script that converts JSONL output from Doccano to the BIO format
☆10Jul 5, 2019Updated 7 years ago
avacaondata / nlpboost
View on GitHub
Python library for automatic training, optimization and comparison of Transformer models on most NLP tasks.
☆20May 6, 2023Updated 3 years ago
RLDiary / Wordle-GRPO
View on GitHub
A $100 Agent - Reinforcement tuning a language model to play the game of Wordle
☆18Jul 14, 2025Updated last year
johnthebrave / nlidb-datasets
View on GitHub
☆21Jun 14, 2018Updated 8 years ago
Principled-Intelligence / orbitals
View on GitHub
☆17Jul 21, 2026Updated last week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sky-ecosystem / spells-kovan
View on GitHub
☆12Sep 1, 2021Updated 4 years ago
roycehaynes / scrapy-rabbitmq
View on GitHub
A RabbitMQ Scheduler for Scrapy
☆87Aug 9, 2022Updated 3 years ago
mattes / rotating-proxy
View on GitHub
Rotating TOR proxy with Docker
☆1,218Apr 25, 2024Updated 2 years ago
kaievns / git-rainbow
View on GitHub
A rainbow commits generator for git
☆23Oct 24, 2015Updated 10 years ago
owen9825 / captcha-middleware
View on GitHub
A middleware layer for Scrapy that detects CAPTCHA tests and solves them
☆44Jul 6, 2023Updated 3 years ago
stephenc / envsub
View on GitHub
An alternative envsubst that allows for ${foo:-default} expansion too
☆35Nov 25, 2020Updated 5 years ago
rdcprojects / scrapy-mq-redis
View on GitHub
A RabbitMQ/Redis tool for Scrapy
☆13Oct 7, 2016Updated 9 years ago
ComputationalFinanceTools / fst-cuda-option-pricing
View on GitHub
Pricing European and American options with jump models using CUDA on the GPU
☆12Apr 12, 2016Updated 10 years ago
docling-project / docling-haystack
View on GitHub
Docling Haystack integration
☆29Apr 9, 2026Updated 3 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
BruceDone / dagobah
View on GitHub
Simple DAG-based job scheduler in Python
☆13May 10, 2017Updated 9 years ago
jiayingwang / smart-match
View on GitHub
The smart-match module contains functions for calculating strings/sets similarity.
☆14Feb 22, 2024Updated 2 years ago
ionicthemes / building-an-ionic3-multi-language-app
View on GitHub
Learn why you need to Internationalize and Localize your Ionic Framework App and how to do it. Also find out how to adapt your Ionic App …
☆15May 7, 2018Updated 8 years ago
VAGOsolutions / SauerkrautLM-Doom-MultiVec
View on GitHub
A tiny 1.3M parameter model that plays DOOM, outperforming LLMs up to 92,000x its size.
☆26May 11, 2026Updated 2 months ago
junivillegas / ionic3-woocommerce
View on GitHub
Ionic 3 Woocommerce Template
☆15Nov 8, 2017Updated 8 years ago
shaunvxc / jsonlike
View on GitHub
little hack for when json.loads() complains
☆12Jul 29, 2017Updated 9 years ago
chainx-org / chainx.js
View on GitHub
chainx sdk
☆12Sep 4, 2020Updated 5 years ago
ExtraMojo / ExtraMojo
View on GitHub
A library of nice to have things not found in the current mojo stdlib
☆15Jul 22, 2026Updated last week
TeamHG-Memex / scrapy-dockerhub
View on GitHub
[UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.
☆12Apr 8, 2026Updated 3 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
istresearch / scrapy-cluster
View on GitHub
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
☆1,226Nov 7, 2023Updated 2 years ago
TeamHG-Memex / url-summary
View on GitHub
Show summary of a large number of URLs in a Jupyter Notebook
☆19Apr 8, 2026Updated 3 months ago
modelpredict / language-identification-survey
View on GitHub
Live survey of off-the-shelf language identification tools for python
☆27Apr 13, 2022Updated 4 years ago
marcopoli / LLaMAntino-3-ANITA
View on GitHub
The 🌟ANITA project🌟 *(Advanced Natural-based interaction for the ITAlian language)* wants to provide Italian NLP researchers with an im…
☆24Sep 11, 2024Updated last year
jjpaulo2 / fastrpa
View on GitHub
🌐 A fast and simple to use abstraction over Selenium
☆19Aug 13, 2024Updated last year
upstarter / acts_as_reviewable
View on GitHub
Reviews for any AR model with multi-dimensional ratings, review commentary, and information graphics.
☆16Sep 12, 2014Updated 11 years ago
cnap / ynacc
View on GitHub
The Yahoo News Annotated Comments Corpus (YNACC)
☆21Oct 19, 2018Updated 7 years ago