talumbau / PyPDF2
A utility to read and write pdfs with Python
☆15Updated 11 years ago
Alternatives and similar repositories for PyPDF2:
Users that are interested in PyPDF2 are comparing it to the libraries listed below
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflow…☆11Updated 2 years ago
- ☆17Updated 2 years ago
- ☆14Updated last year
- This is the repo for the container that holds the models for the text2vec-transformers module☆48Updated 2 weeks ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- Retrieval Augmented Generation applications☆26Updated last year
- ☆21Updated 9 months ago
- Common API for all "second gen" AutoML APIs: Auger.AI, Google Cloud AutoML and Azure AutoML☆41Updated last month
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated 11 months ago
- Hybrid architecture media server, media service and Streamlit client app using FastAPI and Python☆13Updated 2 years ago
- PromptCraft is a prompt perturbation toolkit from the character, word, and sentence levels for prompt robustness analysis. PyPI Package: …☆14Updated last year
- Python bindings for Matroid API☆16Updated last month
- Language detection using Spacy and Fasttext☆55Updated last year
- Model explanation provides the ability to interpret the effect of the predictors on the composition of an individual score.☆13Updated 4 years ago
- A personal knowledge base that I can dump information to and help me learn☆24Updated 8 months ago
- ☆41Updated 2 months ago
- ☄️ Parallel and distributed training with spaCy and Ray☆53Updated last year
- spaCy entry points for Curated Transformers☆26Updated 4 months ago
- Presentations & notebooks from our talks /workshops/meetups/etc☆24Updated 6 years ago
- ☆15Updated last year
- Text Processing & Segmentation Framework☆20Updated 3 weeks ago
- Prototyping a question and answer bot over PDFs☆38Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated 9 months ago
- Use sync mode Playwright interactively, inside a Jupyter notebook☆14Updated 2 months ago
- Convert one or more XML files into Apache Parquet format. Only requires a XSD and XML file to get started.☆32Updated 2 years ago
- ☆13Updated 5 years ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆34Updated last year
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year
- Efficient BM25 with DuckDB 🦆☆39Updated last month
- A textual TUI for Prodigy☆14Updated last year