readikus / ramekin
An open source, real time trend detection library
☆9Updated 5 years ago
Alternatives and similar repositories for ramekin:
Users that are interested in ramekin are comparing it to the libraries listed below
- Record Linkage ToolKit (Find and link entities)☆110Updated last year
- Scaping of Beauty Products from different UK based beauty sites.☆19Updated 9 years ago
- GraphiPy: Universal Social Data Extractor☆82Updated 2 years ago
- Download DIG to run on your laptop or server.☆101Updated 6 years ago
- General Architecture for Text Engineering☆49Updated 9 years ago
- Deep learning based Smart Web Crawler☆31Updated 6 years ago
- JedAI-WebApp is a GUI that facilitates the execution of JedAI. JedAI is an open source, high scalability toolkit that offers out-of-the-b…☆23Updated 2 years ago
- A search engine for Open Data☆53Updated 2 years ago
- A curated list of promising Web Data Extractors resources☆28Updated 5 years ago
- This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆45Updated 3 years ago
- A Python client for the People Data Labs API☆32Updated this week
- Architecture of Twint scrapper which allow download tweets on many instances without api restrictions☆10Updated 4 years ago
- A free, client-side web scraper that turns websites into structured data without having to use code.☆52Updated 8 years ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆24Updated 7 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- Scalable String Similarity Joins in Python☆39Updated 9 months ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆97Updated 2 years ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆150Updated 2 months ago
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆190Updated 2 years ago
- Adaptive crawler which uses Reinforcement Learning methods☆169Updated 6 years ago
- Trend detection algorithms for Twitter time series data☆192Updated 8 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆16Updated 9 years ago
- Text summarization using spacy☆22Updated 2 years ago
- Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & N…☆268Updated 2 years ago
- Extracting Entities with Limited Evidence☆16Updated 2 years ago
- 🚀GUI for training spaCy models☆54Updated 3 years ago
- Trying to generate name synonyms from wikidata☆32Updated 4 years ago
- Extract social media links and account names from websites.☆38Updated 4 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Updated 3 years ago
- Parsing resumes in a PDF format from linkedIn☆68Updated 8 years ago