aryansbtloe / ExperimentWithTesseractLinks
☆24Updated 12 years ago
Alternatives and similar repositories for ExperimentWithTesseract
Users that are interested in ExperimentWithTesseract are comparing it to the libraries listed below
Sorting:
- DEPRECATED, since we cannot maintain this Luke repo any longer. Please fork / Luke fork for Lucene 4.3 (mavenized)☆15Updated 4 years ago
- Brand disambiguator for tweets to differentiate e.g. Orange vs orange (brand vs foodstuff), using NLTK and scikit-learn☆58Updated 12 years ago
- ONLYOFFICE-OnlineEditors☆14Updated 10 years ago
- my take at a PDF text extraction utility☆25Updated 10 years ago
- Wrapper for pdftohtml that tries to extract paragraph structure☆52Updated 6 years ago
- Focused Crawler for VT's CTRNet☆10Updated 12 years ago
- A small framework taking over the manual training process described in the Tesseract3 Wiki: https://code.google.com/p/tesseract-ocr/wiki/…☆131Updated 2 years ago
- Matlab based document image analysis and classification system, that makes heavy use of contextual and language cues to decode image glyp…☆12Updated 14 years ago
- OCRonet is optical character recognition (OCR) and document analysis system based on Convolutional Neural Networks (LeNet-5) and OCRopus.☆21Updated 6 years ago
- Term List Matching Plugin for ElasticSearch☆26Updated 11 years ago
- iCQA - Intelligent Community Question Answering Framework☆31Updated 9 years ago
- .NET PDF viewer based on Chrome pdf.dll and xPDF☆35Updated 11 years ago
- A custom SimilarityProvider example for Elasticsearch☆36Updated 10 years ago
- ABBYY Cloud OCR SDK☆525Updated 2 years ago
- Elasticsearch Combo Analyzer☆86Updated 8 years ago
- Recommendations Serving Engine using python☆28Updated 10 years ago
- Facilitates the indexing of content from a CSV into ElasticSearch☆26Updated 12 years ago
- Python API for Various DB-Backed Simhash Clusters☆64Updated 8 years ago
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆122Updated 5 years ago
- Web/FileSystem Crawler Library☆28Updated last week
- The Lightning EAM Project, to create the World's fastest Enterprise Asset Management system.☆31Updated 9 years ago
- Text Detection and Recognition in Video☆11Updated 11 years ago
- A simple office file reader can extract content and summary information from .doc,.docx,.ppt,.pptx files without Microsoft Office or inte…☆74Updated 7 years ago
- From Natural Language Text to Graph Database☆31Updated 9 years ago
- memcached transport plugin for elasticsearch (STOPPED)☆34Updated 2 years ago
- Project template to create VSTO or COM based Visio Addin, in C# or VB.NET☆25Updated 2 years ago
- Full text extraction using the Open Source Tesseract OCR software https://code.google.com/p/tesseract-ocr/ and imagemagick☆13Updated 10 years ago
- Database Benchmark is one of the most powerfull open source tools designed to stress test databases with large data flows.☆82Updated 9 years ago
- A small HTTP API for SyntaxNet☆19Updated 6 years ago
- Base components for Question Answering pipelines☆28Updated 3 years ago