asepmaulanaismail / pdf-to-txt-python
Simple pdf to text with python using PDFtk and PyPDF2
☆20Updated last year
Alternatives and similar repositories for pdf-to-txt-python:
Users that are interested in pdf-to-txt-python are comparing it to the libraries listed below
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- MFIN7036 NLP Course Project☆9Updated 6 months ago
- Generate True or False questions from any content with OpenAI GPT2 text generation, Sentence-BERT semantic search and Berkley constituenc…☆33Updated 4 years ago
- ☆12Updated last year
- Uses Beautiful Soup to read Wiki pages, Gensim to summarize, NLTK to process, and extracts keywords based on entropy: everything in one b…☆9Updated 4 years ago
- Paraphrasing for academic texts☆14Updated 2 years ago
- Portfolio with data science and machine learning projects I developed during my training in data science.☆11Updated 4 years ago
- Training a model without a dataset for natural language inference (NLI)☆25Updated 4 years ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆11Updated 6 months ago
- Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems☆22Updated 3 years ago
- Abstract Semantic Textual Similarity (STS) measures the meaning similarity of sentences. Applications of this task include machine transl…☆14Updated 4 years ago
- "Unsupervised Paraphrase Generation using Pre-trained Language Model."☆22Updated 4 years ago
- An ongoing series of notebooks aimed at helping fellow NLP enthusiasts think about applying new tools and techniques to practical tasks.☆18Updated 4 years ago
- A set of methods for finding an appropriate number of topics in a text collection☆15Updated 5 months ago
- This repo is about the classification of rhetorical roles in Legal Documents such as: Citation, Findings of Fact, Evidence, Legal Rule, R…☆14Updated 2 years ago
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- Code for Stage-wise Fine-tuning for Graph-to-Text Generation☆26Updated last year
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 2 years ago
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Updated last year
- ☆20Updated 2 years ago
- Tool for the Automatic Assessment of Lexical Diversity☆11Updated 4 years ago
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆19Updated 2 years ago
- A Python package to get useful information from documents using TopicRank Algorithm.☆16Updated last year
- Code for Paper "Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition"☆21Updated 2 years ago
- ☆23Updated 3 years ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆12Updated last year
- How Will Your Tweet Be Received? Predicting theSentiment Polarity of Tweet Replies☆11Updated 3 years ago
- Stuff related to scraping the Code Review StackExchange☆11Updated 2 years ago
- ☆17Updated last year
- A collection of textual datasets in Hausa language and the corresponding translation in English language.☆14Updated 3 years ago