python based crawler to mine pdfs from websites and extracting useful features for data extraction
☆21Oct 4, 2020Updated 5 years ago
Alternatives and similar repositories for pdf-miner
Users that are interested in pdf-miner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An offline IDE for C++, although similar to ideone.com, but ensures that your code doesn't fall into wrong hands :p☆16Feb 18, 2016Updated 10 years ago
- ☆22May 5, 2020Updated 6 years ago
- Open-source home automation platform for Android and Web☆28Jul 16, 2016Updated 9 years ago
- ☆13Dec 4, 2015Updated 10 years ago
- CryptoGuy is a tool usefull to find out various decryptions of a string☆25Mar 22, 2015Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Flappy Bird Automation using RL and Servo☆30Sep 2, 2016Updated 9 years ago
- A collection of my implementations of commonly used Data Structures and Algorithms in programming competitions☆12Sep 5, 2017Updated 8 years ago
- Writeups for different CTF challenges☆71Mar 25, 2025Updated last year
- Boilerplate to port your bookmarklet to a Firefox Add-on☆12Dec 6, 2015Updated 10 years ago
- iOS tweak to disable Telegram Pornography/Copyright checks for Channels and Groups☆11Jun 20, 2020Updated 5 years ago
- Official web interface for pyLoad☆16Aug 23, 2020Updated 5 years ago
- Example for convert bookmarklet to chrome extension package☆21Jul 1, 2010Updated 15 years ago
- you idiots☆11Oct 30, 2024Updated last year
- Online version of the famous game "Chain Reaction"!☆38Oct 4, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Parse or build bookmark's files☆10Mar 16, 2018Updated 8 years ago
- ☆10Nov 22, 2015Updated 10 years ago
- SPE3D - Fast and Easy Download Manager for Servers☆17Feb 25, 2019Updated 7 years ago
- Python module for Named Entity Recognition (NER) using natural language processing.☆13May 30, 2021Updated 4 years ago
- YouTube Watch Later playlist Workflow integration☆17Jan 20, 2023Updated 3 years ago
- Procedural farm generator in Unity 3D☆14Jul 17, 2017Updated 8 years ago
- ☆10Mar 7, 2017Updated 9 years ago
- Companion repo for my blog post about Wolfram's elementary cellular automata☆10Feb 11, 2017Updated 9 years ago
- Crowdsourced event coverage system developed by Pittsburgh IndyMedia during the G20. Incorporates twitter, flickr, google maps, podcast…☆25Feb 15, 2010Updated 16 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Portable version of the Pentaho Data Integration (Kettle) application, for Windows☆13Jun 7, 2024Updated last year
- A python interface to WolframAlpha.☆15Apr 4, 2010Updated 16 years ago
- Code and Presentation for PyCon2016☆12Sep 14, 2016Updated 9 years ago
- A meta-scan tool used to kick off a number of command-line security tools during VA/PT work.☆23May 1, 2022Updated 4 years ago
- Execute python code directly from inside Apple Notes!☆21Aug 18, 2024Updated last year
- Product-Info-Crawler is a python web crawler developed using scrapy framework to crawl e-commerce websites for products matching search k…☆17Jul 4, 2021Updated 4 years ago
- Junction Tree Variational AutoEncoder Implementation Attempt☆11Jun 21, 2018Updated 7 years ago
- End to end system on recognition of Handwritten Math Symbols☆12Aug 27, 2016Updated 9 years ago
- The shared host ffmpeg is a software fully written in shell script for configuring a video sharing environment in shared hosting accounts…☆16Sep 5, 2012Updated 13 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Mapping Framework powering TNC Coastal Resilience programs☆13Mar 20, 2021Updated 5 years ago
- ☆16Sep 17, 2021Updated 4 years ago
- In this repo you can get many python scripts,games and projects.☆10Feb 15, 2023Updated 3 years ago
- A complete daily plan for studying to become a Google software engineer.☆23Apr 12, 2020Updated 6 years ago
- Uncover IPv6 address harvesting through firewall log analysis☆13Jan 29, 2016Updated 10 years ago
- Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"☆12Nov 22, 2022Updated 3 years ago
- Explanation-centered inference for question answering☆16Feb 7, 2018Updated 8 years ago