High-level build project for all LAPDF-Text submodules
☆103Jul 2, 2015Updated 10 years ago
Alternatives and similar repositories for lapdftextProject
Users that are interested in lapdftextProject are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- my take at a PDF text extraction utility☆25Jun 15, 2015Updated 10 years ago
- MOVED TO https://gitlab.com/crossref/pdfextract☆510Jul 26, 2017Updated 8 years ago
- Whole-cell modeling tutorials☆17Jun 26, 2020Updated 5 years ago
- Serverless AI document extraction using Form Recognizer, Azure Functions, and Azure Blob Storage.☆11May 23, 2024Updated last year
- Matlab implementation of polar codes for a BEC☆11Dec 1, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Sep 8, 2022Updated 3 years ago
- Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+☆49Feb 1, 2023Updated 3 years ago
- Download DIG to run on your laptop or server.☆105Jan 9, 2019Updated 7 years ago
- Generate class diagram representing empirical schema of data in a SPARQL endpoint☆10Jan 4, 2018Updated 8 years ago
- Full text extraction using the Open Source Tesseract OCR software https://code.google.com/p/tesseract-ocr/ and imagemagick☆13May 16, 2015Updated 10 years ago
- r4c☆14Mar 2, 2021Updated 5 years ago
- An ongoing series of notebooks aimed at helping fellow NLP enthusiasts think about applying new tools and techniques to practical tasks.☆18Dec 29, 2020Updated 5 years ago
- RUN LENGTH SMOOTHING ALGORITHM(RLSA) is a method mainly used for block segmentation and text discrimination. It helps to extract the nece…☆24Jun 21, 2022Updated 3 years ago
- Sample Python code for driving an HTML-based hypermedia API☆36Jun 27, 2013Updated 12 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Embedr.eu - Image Embedding Service (IES) with support for IIIF, OEmbed, zoomable viewer in an iFrame☆15Dec 5, 2015Updated 10 years ago
- kaggle allen ai competition☆17Feb 23, 2016Updated 10 years ago
- Weakly-supervised Text Classification Based on Keyword Graph☆23Jan 8, 2023Updated 3 years ago
- Reactive Database engine for Java with RocksDB and Lucene Core☆15Feb 24, 2026Updated 2 months ago
- Lint doc structure against templates☆12Jan 16, 2025Updated last year
- Speech waveform synthesis filters☆13Jul 21, 2017Updated 8 years ago
- Implementation Saved Searches a la ElasticSearch Percolator☆12May 20, 2022Updated 3 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59May 3, 2024Updated last year
- Disambiguation of Semantic Resources - Full framework☆30Oct 31, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A set of tools to allow PDF to XML conversion, utilising Apache Beam and other tools. The aim of this project is to bring multiple tools…☆297Apr 21, 2026Updated last week
- Advanced support for working with RDF in Prolog.☆19Aug 31, 2024Updated last year
- DEPRECATED: Use https://github.com/18F/gapps-download instead☆10Oct 27, 2015Updated 10 years ago
- Interactive notebooks for trying analyses and exploring datasets☆32Aug 10, 2015Updated 10 years ago
- Ultimate ReSt Api☆11Nov 22, 2017Updated 8 years ago
- JSON-LD serialization and deserialization for Java REST services.☆15Oct 31, 2025Updated 6 months ago
- Server application for publishing Geographic Linked Open Datasets via Web Feature Services.☆13Jun 3, 2024Updated last year
- Totally awesome Textmate bundle for Turtle – the terse RDF Triple Language.☆28Feb 12, 2018Updated 8 years ago
- A Question Answering System for Domain Knowledge Graphs☆11Feb 24, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Progressively enhance your HTML with dynamic data☆13May 1, 2018Updated 7 years ago
- A Trello webhook server☆10May 18, 2016Updated 9 years ago
- Haversine distance between two points☆13Jun 20, 2023Updated 2 years ago
- A new solr multilingual index and search architecture, it can support index and search across multiple languages at the same time in the …☆13Oct 18, 2019Updated 6 years ago
- Dokku buildpack for GitLab☆22Apr 5, 2015Updated 11 years ago
- DITA RDF ontology and tools to publish DITA metadata to the Semantic Web☆22Jan 6, 2016Updated 10 years ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆23Mar 30, 2026Updated last month