python based crawler to mine pdfs from websites and extracting useful features for data extraction
☆21Oct 4, 2020Updated 5 years ago
Alternatives and similar repositories for pdf-miner
Users that are interested in pdf-miner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An offline IDE for C++, although similar to ideone.com, but ensures that your code doesn't fall into wrong hands :p☆16Feb 18, 2016Updated 10 years ago
- ☆22May 5, 2020Updated 6 years ago
- Open-source home automation platform for Android and Web☆28Jul 16, 2016Updated 9 years ago
- ☆20Oct 26, 2020Updated 5 years ago
- CryptoGuy is a tool usefull to find out various decryptions of a string☆25Mar 22, 2015Updated 11 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code Fragments in Python & Perl☆20Feb 17, 2021Updated 5 years ago
- A simple, system independent infrastructure for performing web scraping. Utilizes Vagrant virtualbox interface and puppet provisioning to…☆24Jul 30, 2014Updated 11 years ago
- A collection of my implementations of commonly used Data Structures and Algorithms in programming competitions☆12Sep 5, 2017Updated 8 years ago
- ☆10Mar 31, 2017Updated 9 years ago
- Writeups for different CTF challenges☆71Mar 25, 2025Updated last year
- A list of paper, books and sites for various different topics related to machine learning and deep learning along with various field in w…☆69Feb 9, 2018Updated 8 years ago
- Boilerplate to port your bookmarklet to a Firefox Add-on☆12Dec 6, 2015Updated 10 years ago
- iOS tweak to disable Telegram Pornography/Copyright checks for Channels and Groups☆11Jun 20, 2020Updated 5 years ago
- A python script to download course contents (videos, ppt, pdf, etc) from coursera.org☆57Jan 23, 2014Updated 12 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official web interface for pyLoad☆15Aug 23, 2020Updated 5 years ago
- Python HTTP Live Streaming Downloader (Retrieving complete stream)☆10Apr 13, 2020Updated 6 years ago
- Example for convert bookmarklet to chrome extension package☆21Jul 1, 2010Updated 15 years ago
- Tools of the trade☆85Sep 6, 2022Updated 3 years ago
- An AWS S3 file manager. It supports keyword search, upload, preview video and archive files into a zip then download it.☆11Mar 20, 2023Updated 3 years ago
- Online version of the famous game "Chain Reaction"!☆38Oct 4, 2020Updated 5 years ago
- Python scripts to download course videos off CDEEP☆12Oct 20, 2015Updated 10 years ago
- Lightweight installer/updater for portable desktop applications☆15Oct 5, 2020Updated 5 years ago
- Download musique from mixcloud☆11Aug 12, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is the official code used for WAT 2017 Description Paper titled A Bag of Useful Tricks for Practical Neural Machine Translation: Emb…☆12Oct 24, 2017Updated 8 years ago
- ☆11Nov 13, 2020Updated 5 years ago
- Python module for Named Entity Recognition (NER) using natural language processing.☆13May 30, 2021Updated 4 years ago
- YouTube Watch Later playlist Workflow integration☆17Jan 20, 2023Updated 3 years ago
- ☆21Feb 16, 2015Updated 11 years ago
- Exploiting cameras with a very distinctive HTTP Server header of "JAWS/1.0".☆10Jan 11, 2023Updated 3 years ago
- Procedural farm generator in Unity 3D☆14Jul 17, 2017Updated 8 years ago
- Android app to view HTML pages offline.☆22Dec 10, 2014Updated 11 years ago
- Mirror of https://gitlab.com/xdevs23/linux-nitrous☆11Dec 6, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Bookmarklet compiler encloses, encodes, minifies your Javascript file and opens an HTML page with your new bookmarklet for immediate use…☆178Dec 18, 2017Updated 8 years ago
- ☆10Mar 7, 2017Updated 9 years ago
- Dockerfile for machine learning environment(scikit-learn, chainer, gensim, tensorflow, jupyter)☆10Aug 16, 2018Updated 7 years ago
- node package to convert a javascript source string into a bookmarklet☆24Jan 30, 2015Updated 11 years ago
- Generates a conversation word cloud from exported facebook chat logs☆18Mar 23, 2018Updated 8 years ago
- Companion repo for my blog post about Wolfram's elementary cellular automata☆10Feb 11, 2017Updated 9 years ago
- Crowdsourced event coverage system developed by Pittsburgh IndyMedia during the G20. Incorporates twitter, flickr, google maps, podcast…☆25Feb 15, 2010Updated 16 years ago