Analyze XML extracted from PDFs (e.g. from TET or PDFMiner)
☆20Jan 11, 2018Updated 8 years ago
Alternatives and similar repositories for freki
Users that are interested in freki are comparing it to the libraries listed below
Sorting:
- A tool for correcting misspellings in textual input using the Noisy Channel Model.☆11Sep 26, 2020Updated 5 years ago
- Spell checker using Brill and Moore's noisy channel error model☆12Jan 9, 2019Updated 7 years ago
- ☆15Feb 5, 2019Updated 7 years ago
- ☆20Jul 22, 2021Updated 4 years ago
- A curated list of amazingly libraries, services and resources to work with PDF files☆16Jan 28, 2026Updated last month
- Phraseg - 一言:新詞發現工具包☆26Nov 30, 2021Updated 4 years ago
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆29Apr 16, 2023Updated 2 years ago
- ☆26Nov 22, 2022Updated 3 years ago
- Speech Recognition implementation using Artificial Neural Networks☆10Sep 7, 2015Updated 10 years ago
- Follow this "Urban Roast" demo tutorial series to power your app with Visualize.js!☆11Oct 10, 2023Updated 2 years ago
- InternetChessKiller - Program created for automatic use computer chess engine program help for playing on chess servers. Short descriptio…☆12Sep 11, 2015Updated 10 years ago
- Comparable documents miner: Arabic-English morphological analysis, text processing, n-gram features extraction, POS tagging, dictionary t…☆35Apr 24, 2017Updated 8 years ago
- Russian words synonyms and antonyms☆11Dec 7, 2021Updated 4 years ago
- Python tool for normilizing text and text canonicalization (DISCONTINUED)☆41Sep 3, 2013Updated 12 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Mar 8, 2022Updated 3 years ago
- ☆11Aug 23, 2022Updated 3 years ago
- MACE is A C++ Engine☆10Dec 9, 2019Updated 6 years ago
- Codee: An efficient AI programming assistant☆17Jan 16, 2026Updated last month
- NuART-Py: Python Library of Adaptive Resonance Theory Neural Network☆10Jan 26, 2020Updated 6 years ago
- Knock your images before you get stressed.☆11Jan 9, 2022Updated 4 years ago
- Scrape Youtube for videos and extract screenshots from the videos☆12Feb 12, 2021Updated 5 years ago
- ☆15Mar 15, 2022Updated 3 years ago
- Repository for Manning Twitch session about building and deploying APIs with Python☆12Jul 19, 2021Updated 4 years ago
- Business and performance KPIs drawn from game analytics using a large dataset☆11Mar 2, 2019Updated 7 years ago
- Code for our paper "Active Perception using Light Curtains for Autonomous Driving", ECCV 2020☆10Dec 7, 2021Updated 4 years ago
- Using acceleration and heart rate data to classify awake, deep, and light sleep☆10Dec 21, 2017Updated 8 years ago
- A signal processing library, currently sufficient for basic speech recognition stuff like mel frequency cepstrum☆19Mar 15, 2012Updated 13 years ago
- An Offline and Secure Retrieval-Augmented Generation (RAG) system designed for efficient processing of diverse content types with minimal …☆20Dec 29, 2024Updated last year
- Data profiling tools for Big Data☆11Nov 17, 2025Updated 3 months ago
- This repo contains all the cheatsheets that I found Important.☆10Oct 27, 2020Updated 5 years ago
- This repository contains the code of the Rasa workshop at PyData NYC 2018☆12Oct 19, 2018Updated 7 years ago
- Simple implementation of TDOA localization algorithm.☆13Oct 12, 2016Updated 9 years ago
- This is my 2024 course for TAP Institute on Vector Databases and Semantic Searching.☆12Jul 26, 2024Updated last year
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Tool for slot extraction from text☆15Oct 23, 2022Updated 3 years ago
- Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translation☆15Dec 2, 2016Updated 9 years ago
- Faster access to Tesseract-OCR from Python☆13Jun 8, 2021Updated 4 years ago
- This is now the official location of the Kaldi project.☆10Aug 22, 2019Updated 6 years ago
- Character Based Named Entity Recognition.☆40Apr 3, 2018Updated 7 years ago