cyb3rk0tik / pyfrancLinks
Text language detection basic on trigrams.
☆16Updated 2 years ago
Alternatives and similar repositories for pyfranc
Users that are interested in pyfranc are comparing it to the libraries listed below
Sorting:
- Extracts a latent knowledge graph from text and index/query it in elasticsearch or solr☆21Updated 3 years ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Updated 7 months ago
- User contributed (non Google) OCR models for Tesseract☆29Updated 7 months ago
- Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for ML☆65Updated 10 months ago
- Multi-Langauge Identification☆28Updated last year
- Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.☆18Updated 2 years ago
- Targetted language identifier, based on FastText and Hunspell.☆37Updated 2 months ago
- This is an Object Oriented implementation of a Trie in python. The class contains setter and getter methods, and implements several usefu…☆15Updated 7 years ago
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- Indri search implementation on top of Lucene search engine☆35Updated last year
- Automatic Text Summarization and Title Generation.☆25Updated 4 years ago
- Extract dates from text☆65Updated 4 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆66Updated last year
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- Finds linguistic patterns effortlessly☆38Updated 2 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆25Updated 2 years ago
- A fast python implementation of the SimHash algorithm.