OpenOil-UG / aleph
Toys for sifting through large sets of documents.
☆13Updated 7 years ago
Alternatives and similar repositories for aleph:
Users that are interested in aleph are comparing it to the libraries listed below
- Rewrite text in linear time.☆81Updated last year
- OpenTeams is an opensource team visualization tool.☆84Updated 4 years ago
- A queue-controlled browser automation tool for improving web crawl quality☆60Updated 4 years ago
- Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.☆78Updated last year
- Twitter, quick. Fetch and store tweets on short notice.☆80Updated 8 years ago
- ☆24Updated 9 years ago
- Federal Crime Data Standardization and Analysis — The Trace and BuzzFeed News☆35Updated 5 years ago
- A review of the deprecated Freebase knowledge base and Metaweb Query Language (MQL). A brief comparison of MQL and GraphQL.☆42Updated 7 years ago
- Scripts and microservice to feed an ElasticSearch with Wikidata and Inventaire entities, and keep those up-to-date☆41Updated 4 years ago
- A pipeline for detecting novel information about entities from a stream of text, updating a knowledge base about the entities, and genera…☆32Updated 5 years ago
- a simple interface from extracting texts from (almost) any url☆52Updated 5 years ago
- Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+☆47Updated last year
- Deployment of pywb as a CommonCrawl Index Server☆21Updated 7 years ago
- Data Pipes for CSV☆117Updated 2 years ago
- Extract tabular data and semantically discover it with ease! (OS)☆21Updated 8 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- A trend viewer written in Python/JavaScript☆21Updated 2 months ago
- A space for code and projects around analysing news content☆23Updated 6 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 3 years ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆25Updated 7 years ago
- Open source large document set visualization platform☆268Updated 2 years ago
- Meta-repository for the open-source version of the SUMMA Platform☆16Updated 10 months ago
- Persine is an automated tool to study and reverse-engineer algorithmic recommendation systems.☆91Updated 4 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- Code + Jupyter Notebooks for Visualizing Clusters of Clickbait Headlines Using Spark, Word2vec, and Plotly☆47Updated 4 years ago
- Supervised learning for novelty detection in text☆79Updated 8 years ago
- Visualization of interaction between entities☆16Updated 8 years ago
- The Python-language successor to the TABARI event-data coding software.☆45Updated 7 years ago
- A toolbox and web application for working with and presenting textual material from Shakespeare to Schopenhauer, and letters to literatur…☆149Updated 9 years ago
- General Architecture for Text Engineering☆46Updated 8 years ago