A python library for extracting text from PDFs without losing the formatting of the PDF content.
☆79Jan 11, 2022Updated 4 years ago
Alternatives and similar repositories for multilingual-pdf2text
Users that are interested in multilingual-pdf2text are comparing it to the libraries listed below
Sorting:
- A Security System using the face recognition, which can be monitored from anywhere using a HTTP server, coded using Python and Jinja☆11Mar 4, 2021Updated 5 years ago
- Code for "The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction an…☆11Apr 30, 2024Updated last year
- Streamlit-based Web App for Ai Text Generation based on GPT-2 Models from HuggingFace Model Hub using Python library aitextgen☆27Nov 26, 2020Updated 5 years ago
- 🚀 This repo is a showcase of how you can use models deployed on AWS SageMaker in your Haystack Retrieval Augmented Generative AI pipelin…☆13Jul 27, 2023Updated 2 years ago
- Neural Search System on Arxiv AI/ML Papers☆54Aug 4, 2021Updated 4 years ago
- 暑期研究:神经网络解偏微分方程(Neural networks for solving differential equations)☆14Apr 27, 2019Updated 6 years ago
- Mediapipe Face Mesh☆14Jun 24, 2022Updated 3 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆169Nov 7, 2022Updated 3 years ago
- OpnEco is a Python3 project developed to aid content writers throughout the content writing process. By content writers, for content writ…☆21Feb 15, 2023Updated 3 years ago
- GUI useful to manually annotate text for Named Entity Recognition purposes☆14Jun 22, 2023Updated 2 years ago
- ☆17Oct 27, 2020Updated 5 years ago
- Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers☆161Sep 25, 2020Updated 5 years ago
- PassivePy: A Tool to Automatically Identify Passive Voice in Big Text Data☆23Mar 6, 2024Updated 2 years ago
- Repository contains various Malayalam ASR based resources curated from multiple sources☆18Oct 1, 2021Updated 4 years ago
- A framework for detecting, highlighting and correcting grammatical errors on natural language text. Created by Prithiviraj Damodaran. Ope…☆1,573Feb 15, 2023Updated 3 years ago
- 🐍🍑 Python 3 library for managing, annotating, and converting natural language corpuses using popular formats (CoNLL, ELAN, Praat, CSV, …☆21Jun 26, 2024Updated last year
- I will be putting notebooks created for #100daysofnlp here☆20May 29, 2020Updated 5 years ago
- Shoonya - Platform to Annotate and label data at scale.☆64Oct 31, 2025Updated 4 months ago
- Pistol, Rifle, and Fire detection using yolov4-tiny in videos as well as images. Training code, dataset, and trained weight file availabl…☆21Oct 15, 2020Updated 5 years ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆403Jul 30, 2021Updated 4 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Jan 17, 2023Updated 3 years ago
- An e-learning platform built in python (django)☆23Oct 24, 2024Updated last year
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆55Dec 2, 2021Updated 4 years ago
- Fuzzy string matching, grouping, and evaluation.☆791Jul 10, 2025Updated 7 months ago
- source code of bison☆26Jul 20, 2020Updated 5 years ago
- Framework for zero-shot learning with knowledge graphs.☆113Mar 28, 2023Updated 2 years ago
- This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (K…☆173Feb 3, 2023Updated 3 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Dec 12, 2021Updated 4 years ago
- Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, vide…☆561Aug 20, 2024Updated last year
- Domain-specific BERT representation for Named Entity Recognition of lab protocol☆29Dec 25, 2020Updated 5 years ago
- A python module for English lemmatization and inflection.☆273Sep 14, 2023Updated 2 years ago
- A lightweight implementation of shapes drawn across a geo-temporal plane.☆12Jan 27, 2026Updated last month
- edaSQL is a python library to bridge the SQL with Exploratory Data Analysis where you can connect to the Database and insert the queries.…☆10Nov 14, 2021Updated 4 years ago
- Contains relevant notebooks for the hands-on NLP workshop for the Analytics India Magazine Plugin Conference -2020 Edition☆71May 29, 2020Updated 5 years ago
- Large Scale BERT Distillation☆33Mar 24, 2023Updated 2 years ago
- A Neural Language Style Transfer framework to transfer natural language text smoothly between fine-grained language styles like formal/ca…☆493Dec 12, 2023Updated 2 years ago
- mT5 model for question answering and question generation☆27Apr 2, 2021Updated 4 years ago
- Self-hosted automated receipt recognition system☆32Mar 4, 2018Updated 8 years ago
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Apr 5, 2019Updated 6 years ago