darenr / python-kafka-elasticsearch
Simple learning project pushing CSV data into Kafka then indexing the data in ElasticSearch
☆19Updated 9 years ago
Alternatives and similar repositories for python-kafka-elasticsearch:
Users that are interested in python-kafka-elasticsearch are comparing it to the libraries listed below
- Scrapy extension which writes crawled items to Kafka☆30Updated 6 years ago
- Kafka-based components for Scrapy☆79Updated 7 years ago
- This is a simple streaming application that utilises Kafka and Python☆45Updated 6 years ago
- High Level Kafka Scanner☆19Updated 7 years ago
- Simple demo Python client scripts for Apache Kafka☆16Updated 9 years ago
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- Public code files for the DDL blog☆56Updated 6 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- Simple Web UI for Scrapy spider management via Scrapyd☆51Updated 6 years ago
- ☆16Updated 8 years ago
- experimenting with elasticsearch features for vector fields☆20Updated 2 years ago
- python library for interacting with SolrCloud☆36Updated 4 years ago
- Useful library for dealing with tasks DAG in Celery☆15Updated 7 years ago
- 使用Pykafka的正确姿势☆32Updated 9 years ago
- A Scalable Data Cleaning Library for PySpark.☆27Updated 6 years ago
- Docker compose files for various kafka stacks☆32Updated 7 years ago
- A python implementation of DEPTA☆83Updated 8 years ago
- Twitter sentiment analysis using Spark and Stanford CoreNLP and visualization using elasticsearch and kibana☆20Updated 7 years ago
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Updated 9 years ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40Updated 11 months ago
- Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.☆17Updated 8 years ago
- Fast Python Bloom Filter using Mmap☆13Updated 12 years ago
- Celery tasks to run a sample spider☆21Updated 9 years ago
- docker scrapyd scrapy boot2docker crawler - a spider Python application that can be "Dockerized".☆42Updated 10 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Process large amount of Twitter data using Spark SQL (and its JSON support). Answers questions like "What are the most popular languages?…☆9Updated 10 years ago
- Building an API with the FastAPI framework to serve a scikit-learn model.☆18Updated 6 years ago
- Cubes OLAP Examples☆74Updated 6 years ago
- Scrapy Eagle is a tool that allow us to run any Scrapy based project in a distributed fashion and monitor how it is going on and how many…☆24Updated 4 years ago
- Tools and services for evaluating topic models☆15Updated 9 years ago