cskau / jpn.traineddataLinks
A framework, data and configs for generating and building Tesseract OCR lang.traineddata model files, specifically for Japanese
☆10Updated 12 years ago
Alternatives and similar repositories for jpn.traineddata
Users that are interested in jpn.traineddata are comparing it to the libraries listed below
Sorting:
- Soft Confidence-Weighted Learning in Python☆15Updated 8 years ago
- Text Detection and Recognition in Video☆11Updated 12 years ago
- Grecka is a python script to convert Greek to Greeklish based on ELOT 743☆12Updated 7 years ago
- Yamabiko (Fluentd based MySQL/MariaDB Replicator for Elasticsearch/Solr)☆30Updated 12 years ago
- Matlab based document image analysis and classification system, that makes heavy use of contextual and language cues to decode image glyp…☆12Updated 14 years ago
- Image classifier built with Chainer, an implementation of OverFeat.☆13Updated 10 years ago
- Focused Crawler for VT's CTRNet☆10Updated 12 years ago
- Data collection for Airbnb business☆13Updated 11 years ago
- a SQL-like command line client for elasticsearch☆45Updated 7 years ago
- Simple Hungarian Sentence Analysis with NLTK☆16Updated 4 years ago
- A Python script to speech some text with Google Translate.☆23Updated 12 years ago
- ☆35Updated 8 years ago
- Term List Matching Plugin for ElasticSearch☆26Updated 12 years ago
- A pretty bare set up for running Flask in nginx through uwsgi in Vagrant deployed by Puppet. Got it?☆24Updated 9 years ago
- Translate between English and Japanese using Google☆31Updated 11 years ago
- Full text extraction using the Open Source Tesseract OCR software https://code.google.com/p/tesseract-ocr/ and imagemagick☆13Updated 10 years ago
- Facilitates the indexing of content from a CSV into ElasticSearch☆27Updated 12 years ago
- Semantic dependency relationship extractor untuk bahasa Indonesia... termasuk bahasa gaul dan alay ;) (terinspirasi oleh OpenCog RelEx)☆10Updated 10 years ago
- Example code for consuming Twitters Streaming API using Python and pycurl☆39Updated 12 years ago
- A simple web framework based on asyncio.☆25Updated 9 years ago
- COrpus based Morphological Analyzer with INtegrated User dictionary☆21Updated 10 months ago
- Distributed Proofreading of Automatic Segmentations☆15Updated 3 years ago
- help source for unite.vim☆22Updated 8 years ago
- Brand disambiguator for tweets to differentiate e.g. Orange vs orange (brand vs foodstuff), using NLTK and scikit-learn☆58Updated 12 years ago
- Latent Dirichlet Allocation with Gibbs sampling☆16Updated 12 years ago
- This is an experimental project to enhance Optical Character Recognition technique to recognize text from natural images☆14Updated 11 years ago
- Speech ANDroid Apps☆20Updated 12 years ago
- Homebrew implementation of IBM Watson DeepQA (NLTK, Semantic Web, AI strategies)☆16Updated 14 years ago
- An application of stacked denoising autoencoders to multi-modal (images and audio) abstract feature discovery☆12Updated 12 years ago
- ☆27Updated 8 years ago