aryansbtloe / ExperimentWithTesseractLinks
☆24Updated 12 years ago
Alternatives and similar repositories for ExperimentWithTesseract
Users that are interested in ExperimentWithTesseract are comparing it to the libraries listed below
Sorting:
- Brand disambiguator for tweets to differentiate e.g. Orange vs orange (brand vs foodstuff), using NLTK and scikit-learn☆58Updated 12 years ago
- Term List Matching Plugin for ElasticSearch☆26Updated 12 years ago
- ONLYOFFICE-OnlineEditors☆14Updated 11 years ago
- Focused Crawler for VT's CTRNet☆10Updated 12 years ago
- Analysis plugin for ElasticSearch providing capability for processing inline annotations in documents.☆35Updated 12 years ago
- Parse wikipedia dumps and index (some) page data to elasticsearch☆49Updated 10 years ago
- Facilitates the indexing of content from a CSV into ElasticSearch☆27Updated 12 years ago
- Apache Nutch extensions☆34Updated 3 years ago
- DEPRECATED, since we cannot maintain this Luke repo any longer. Please fork / Luke fork for Lucene 4.3 (mavenized)☆16Updated 4 years ago
- A custom SimilarityProvider example for Elasticsearch☆36Updated 10 years ago
- Parser for KAF NAF files written in Python☆16Updated 4 years ago
- FPtree algorithm to mining frequent pattern☆20Updated 12 years ago
- Python API for Various DB-Backed Simhash Clusters☆64Updated 8 years ago
- OCRonet is optical character recognition (OCR) and document analysis system based on Convolutional Neural Networks (LeNet-5) and OCRopus.☆22Updated 6 years ago
- Full text extraction using the Open Source Tesseract OCR software https://code.google.com/p/tesseract-ocr/ and imagemagick☆13Updated 10 years ago
- Elasticsearch Combo Analyzer☆86Updated 8 years ago
- .NET PDF viewer based on Chrome pdf.dll and xPDF☆35Updated 11 years ago
- Morpha lex stemmer converted using jflex.☆24Updated 5 years ago
- Generator of rule-based lemmatizers (based on examples) for serveral European languages.☆29Updated 4 years ago
- my take at a PDF text extraction utility☆25Updated 10 years ago
- Sample CBIR (Content Based Image Retrieval) application created in .NET, C#☆12Updated 10 years ago
- Language checker and hyphenator extension for LibreOffice☆12Updated 6 years ago
- Text Detection and Recognition in Video☆11Updated 12 years ago
- An Elasticsearch plugin that enables you to keep only the N latest indices.☆18Updated 11 years ago
- iCQA - Intelligent Community Question Answering Framework☆31Updated 9 years ago
- Fast Word Segmentation with Triangular Matrix☆86Updated 4 years ago
- Skeleton for Meetup - Building your own recommendation engine in an hour☆29Updated 4 years ago
- Speech ANDroid Apps☆20Updated 12 years ago
- Exports static methods in a managed DLL as library functions that can be called from an unmanaged Windows application.☆21Updated 10 years ago
- Identity Provider for Elasticsearch☆22Updated 9 years ago