skalmadka/web-crawler

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/skalmadka/web-crawler)

skalmadka / web-crawler

Distributed Web Crawler, Parser and Search Engine.

☆10

Alternatives and similar repositories for web-crawler

Users that are interested in web-crawler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jharner / rspark
View on GitHub
This repo is for building Docker containers for RStudio, PostgreSQL, Hadoop, Spark, etc.
☆22May 12, 2021Updated 5 years ago
makhtardiouf / d2d
View on GitHub
LTE-A Proximity-based Services, Device-to-device Communication module for NS-3
☆27Oct 19, 2016Updated 9 years ago
jeschkies / nyan
View on GitHub
NYAN is a news filtering engine written in Python and some Ruby.
☆15Aug 23, 2023Updated 2 years ago
aritter / LDA-SP
View on GitHub
Includes Code for Inference and Evaluation of Topic Models for Selectional Preferences
☆16Mar 10, 2023Updated 3 years ago
CallMeDJ / BigBananaBlockChain
View on GitHub
大蕉的区块链实现
☆16Feb 5, 2018Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
brett-chen / AMC
View on GitHub
Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"
☆21Oct 6, 2015Updated 10 years ago
v5tech / programminghive
View on GitHub
Programming Hive读书笔记
☆12May 29, 2014Updated 12 years ago
TwistedW / TwistedW.github.io
View on GitHub
个人小主页https://twistedw.github.io
☆11Aug 23, 2021Updated 4 years ago
aishack / sift
View on GitHub
☆12Sep 30, 2020Updated 5 years ago
SpiNNakerManchester / SpiNNFrontEndCommon
View on GitHub
Common support code for user-facing front end systems.
☆12Updated this week
ziky90 / tf-idf-Hadoop-MapReduce
View on GitHub
Project from the CTU Big Data course which purpose was to compute tf-idf values for the czech wikipedia
☆10Jul 8, 2014Updated 12 years ago
zhengqisong / swarmui
View on GitHub
跨集群的docker swarm管理UI，包括集群、节点、标签、用户、权限、服务、存储、网络、配置等集中管理，实施简单一个jar包搞定。
☆14Jul 24, 2021Updated 5 years ago
coderaiser / wisdom
View on GitHub
Tool for publishing releases to github and npm
☆19Jun 25, 2026Updated last month
USDA-ARS-ACSL / PhotoSynthesisModule
View on GitHub
Stand alone C++ module to simulate Farquhar Ball-Berry model of photosynthesis and transpiration
☆12Sep 28, 2018Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
backstopmedia / sparkbook
View on GitHub
☆18May 1, 2016Updated 10 years ago
HumanBrainProject / ebrains-neuromorphic-job-queue-api
View on GitHub
Code for the remote-access API of the EBRAINS/HBP Neuromorphic Computing Platform
☆15May 25, 2026Updated 2 months ago
smarzola / anypubsub
View on GitHub
A generic interface wrapping multiple backends to provide a consistent pubsub API
☆13Oct 31, 2018Updated 7 years ago
ufasoft / lisp
View on GitHub
CLISP based Common Lisp interpreter/compiler. Very compact C++ implementation.
☆10Apr 24, 2016Updated 10 years ago
HackPlan / pomo-mailer
View on GitHub
Mail Renderer, Mail Queue, Task Manager
☆14Oct 19, 2016Updated 9 years ago
gtback / dotfiles
View on GitHub
My dotfiles
☆12Updated this week
lateral / hyperplane-hasher
View on GitHub
Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.
☆30Jul 17, 2015Updated 11 years ago
TheGreatRambler / AnimASCII.js
View on GitHub
ASCII art animation library in Javascript using Canvas
☆12Sep 3, 2024Updated last year
icp4a / automation-decision-services-samples
View on GitHub
☆12Jun 26, 2026Updated 3 weeks ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
VoltDB / app-fastdata
View on GitHub
VoltDB Click Stream Processing Example.
☆16Jan 2, 2018Updated 8 years ago
kubeflow / chainer-operator
View on GitHub
Repository for chainer operator
☆17Nov 14, 2021Updated 4 years ago
a3794110 / ns3-SUMO-Interface
View on GitHub
Network Simulation for Urban Mobility: Interface between ns-3 and SUMO. The project allow user to control ns-3 LTE module and the UE mobi…
☆44Sep 8, 2019Updated 6 years ago
strin / DeepBayes
View on GitHub
Code for Max-Margin Deep Generative Models
☆12Jan 1, 2015Updated 11 years ago
hyperhq / www.hyper.sh
View on GitHub
Hyper.sh Website
☆12Mar 5, 2019Updated 7 years ago
Hassankashi / Machine-Learning-ex1-Linear-Regression-University-of-Stanford-Coursera
View on GitHub
☆13Jul 11, 2017Updated 9 years ago
BlueBrain / bluebrain.github.com
View on GitHub
API documentation for BlueBrain projects:
☆12Dec 1, 2021Updated 4 years ago
trec-kba / streamcorpus
View on GitHub
common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text
☆35Sep 30, 2016Updated 9 years ago
ty4cheung / bootdo
View on GitHub
SpringBoot+Thymeleaf+shiro+activity+vue.js 敏捷ERP+OA+CRM开发
☆18Jun 25, 2018Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
igool / docx-html-editor
View on GitHub
WORD 在线编辑
☆10Jul 31, 2017Updated 8 years ago
Parsa33033 / Search-Engine-Core
View on GitHub
a simple search engine for university of tehran URLs
☆16Dec 15, 2018Updated 7 years ago
petergdoyle / SparkCourse
View on GitHub
Taming Big Data with Apache Spark and Python - Hands On - Udemy
☆19Jul 11, 2016Updated 10 years ago
chenlein / springboot-oauth
View on GitHub
Spring boot 2 + Spring security 5 实现 SSO + OAuth认证
☆13Jul 13, 2018Updated 8 years ago
Interbotix / HROS5-Framework
View on GitHub
HR-OS5 Framework, based on the Darwin-OP project. Intended for use on HR-OS5 Research Humanoid Robot platforms.
☆11Mar 22, 2017Updated 9 years ago
diffbot / wikistatsextractor
View on GitHub
Extract statistics from Wikipedia Dump files.
☆26Aug 2, 2021Updated 4 years ago
mislam77-git / examples
View on GitHub
Examples for Apache Oozie book
☆18May 30, 2016Updated 10 years ago