A basic tool that extracts the structure from the PDF files of scientific articles.
☆76Jan 4, 2022Updated 4 years ago
Alternatives and similar repositories for pdfact
Users that are interested in pdfact are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The repository of Icecite, a research paper management system.☆15Mar 29, 2018Updated 8 years ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆70Nov 7, 2020Updated 5 years ago
- Named Entity Disambiguation and Linking☆16May 24, 2024Updated last year
- table understanding dataset for comparative evaluation of different table understanding algorithms☆13Jun 15, 2018Updated 7 years ago
- A Test Collection for Evaluating Retrieval of Studies for Inclusion in Systematic Reviews☆12Sep 22, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Scripts for file processing and analysis in phylogenetics and phylogeography☆13Jan 6, 2021Updated 5 years ago
- Structured Data from PDF image-based files☆91Mar 1, 2013Updated 13 years ago
- Simulated user for TREC 2016-2017 Dynamic Domain track☆10Dec 27, 2017Updated 8 years ago
- The qlever command-line tool. With this you can control (almost) everything QLever can do☆67Apr 9, 2026Updated last week
- A distributed stream querying engine that provides sub-millisecond stateful query at millions of queries per-second over fast-evolving li…☆10Jul 18, 2018Updated 7 years ago
- Tokenize and clean strings in Python☆11Jan 11, 2018Updated 8 years ago
- Spell checker using Brill and Moore's noisy channel error model☆12Jan 9, 2019Updated 7 years ago
- An R package to write Datalog queries and interact with a Datomic database☆11Aug 12, 2021Updated 4 years ago
- Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)☆31Oct 3, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Batch scripts curating BioRxiv and PubMed articles by using Altmetric score.☆11May 9, 2020Updated 5 years ago
- Non-official Fuseki Docker image with GeoSPARQL support☆12Mar 26, 2026Updated 3 weeks ago
- Flint SPARQL editor☆51Oct 16, 2012Updated 13 years ago
- A tool for correcting misspellings in textual input using the Noisy Channel Model.☆11Sep 26, 2020Updated 5 years ago
- ☆16Apr 9, 2026Updated last week
- A command-line DSL budget manager☆13Oct 25, 2022Updated 3 years ago
- Named Entity Recognition with the Nametag Maximum Entropy Markov model☆12Feb 9, 2026Updated 2 months ago
- Science Parse parses scientific papers (in PDF form) and returns them in structured form.☆698May 26, 2024Updated last year
- ☆20Jul 22, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Extract text from your DOCX documents.☆11Feb 10, 2024Updated 2 years ago
- Tree reconstruction of ancestry using incomplete lineage sorting☆11Jan 9, 2026Updated 3 months ago
- Add custom struct tags to protobuf generated structs☆11Mar 3, 2019Updated 7 years ago
- Code and Data for paper: Estimating Attention Flow in Online Video Networks (CSCW '19)☆12Nov 19, 2019Updated 6 years ago
- Jurisdiction ID and abbreviation data files for using with Jurism and other projects.☆44Nov 8, 2023Updated 2 years ago
- demonstrating shunting yard algorithm and evaluation of arithmetic expressions☆15Jan 11, 2025Updated last year
- Analyze XML extracted from PDFs (e.g. from TET or PDFMiner)☆20Jan 11, 2018Updated 8 years ago
- liberate all kinds of data from PDF and other unstructural format and make the information machine-readable and visualizeable for popul…☆31Jun 1, 2018Updated 7 years ago
- User interface components for Prototype.js☆94Jul 8, 2009Updated 16 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Basic RDF Datatypes☆15Feb 23, 2026Updated last month
- TeXoo – A Zoo of Text Extractors☆18Jun 2, 2020Updated 5 years ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆180Mar 18, 2023Updated 3 years ago
- Text pattern search using marisa-trie☆19Jan 26, 2025Updated last year
- Gather meta information from chrome web store.☆11Apr 13, 2021Updated 5 years ago
- Readme2: An R Package for Improved Automated Nonparametric Content Analysis for Social Science☆49Dec 30, 2025Updated 3 months ago
- How to structure a Docker Swarm infrastructure repository in 2023. Uses actually good encryption with swarmsible, nothelm.py and docker-s…☆12Dec 29, 2024Updated last year