Python script to split PDF files into separate files based on bookmarks
☆15Jan 21, 2022Updated 4 years ago
Alternatives and similar repositories for PDFSplitter
Users that are interested in PDFSplitter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Generates the most important key-phrase/key-words from a document based on a corpus☆10Jun 17, 2024Updated 2 years ago
- ☆13Apr 13, 2021Updated 5 years ago
- Detect the text orientation on a page with Tesseract OCR☆14Dec 18, 2020Updated 5 years ago
- Geoscience document layout for figures and figure classification inot geoscience categories☆11Apr 5, 2022Updated 4 years ago
- ☆16Jul 31, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the paper: Combining Graph Degeneracy and Submodularity for Unsupervised Extractive Summarization☆17Apr 24, 2020Updated 6 years ago
- ☆14Sep 22, 2016Updated 9 years ago
- It is a Chrome extension, an alternative to ChatGPT. It is free and no data leaves your computer. Powered by WebLLM.☆16Mar 4, 2024Updated 2 years ago
- UBOS administration tools☆16May 30, 2024Updated 2 years ago
- A helpful package that helps you access shell & shell-based applications via web application☆16Jul 25, 2023Updated 2 years ago
- An unsupervised text summarization and information retrieval library under the hood using natural language processing models☆15Dec 11, 2020Updated 5 years ago
- ☆15Dec 8, 2022Updated 3 years ago
- A simple machine learning package to cluster keywords in higher-level groups.☆18Jul 6, 2022Updated 3 years ago
- test☆22Nov 11, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Jan 28, 2024Updated 2 years ago
- Dataset and Source code of paper 'Enhancing Keyphrase Extraction from Academic Articles with their Reference Information'.☆18Dec 26, 2022Updated 3 years ago
- Keyword extraction with spaCy☆30Nov 8, 2021Updated 4 years ago
- Colab notebooks for d2l-book☆11Dec 5, 2019Updated 6 years ago
- A Python library that enables smooth keyword extraction from any text using the RAKE(Rapid Automatic Keyword Extraction) algorithm.☆29May 3, 2024Updated 2 years ago
- A set of visualization engines.☆14Updated this week
- Python utility to export a user's starred repositories list into a CSV file☆17May 3, 2018Updated 8 years ago
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆14Feb 21, 2025Updated last year
- Antiword: a free MS Word document reader☆73Jun 3, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Visualizes search engine ranking algorithms for a given domain☆30Dec 13, 2010Updated 15 years ago
- Batch scripts curating BioRxiv and PubMed articles by using Altmetric score.☆11May 9, 2020Updated 6 years ago
- Simple wrapper around Puppeteer to take screenshot from command line.☆16Feb 12, 2022Updated 4 years ago
- Scholarly Big Data Subject Category Classifier☆10Jul 15, 2019Updated 6 years ago
- D3-based interactive bubble chart for topic model visualization☆13May 10, 2022Updated 4 years ago
- Create shellcode from executable or assembly code☆12Jul 31, 2017Updated 8 years ago
- 模拟浏览器脚本操作,使用nodejs来批量读取和操作网盘文件信息。 这个代码库是`百度网盘批量清理重复文件计划`的一部分。☆11Mar 16, 2023Updated 3 years ago
- Ask questions about government data.☆38Jan 17, 2019Updated 7 years ago
- Mirror of pdftk. For more information please see http://flowpaper.com☆11Sep 6, 2016Updated 9 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Command line application allowing you to download all issues in the CSV format from the public or private repository☆10Jul 16, 2020Updated 5 years ago
- Kex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public data…☆55Feb 17, 2022Updated 4 years ago
- seq2seq based keyphrase generation model sets, including copyrnn copycnn and copytransfomer☆50Feb 7, 2022Updated 4 years ago
- Algorithms from the book "Elements of Statistical Learning", implemented in Python☆13Mar 29, 2015Updated 11 years ago
- Visual Editor for Natural Language Processing pipelines☆14Apr 11, 2023Updated 3 years ago
- GHRecommender - personalized recommendations for GitHub projects based on information about repositories starred by the user☆14Feb 3, 2026Updated 5 months ago
- This repository contains some examples of using borb in google colab. These examples enable you to try out the features of borb without i…☆13Sep 4, 2022Updated 3 years ago