shahrukhx01 / multilingual-pdf2textView external linksLinks
A python library for extracting text from PDFs without losing the formatting of the PDF content.
☆79Jan 11, 2022Updated 4 years ago
Alternatives and similar repositories for multilingual-pdf2text
Users that are interested in multilingual-pdf2text are comparing it to the libraries listed below
Sorting:
- semantically distinct key phrase extraction using hilbert hashes.☆51Feb 28, 2022Updated 3 years ago
- Code for "The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction an…☆11Apr 30, 2024Updated last year
- Streamlit-based Web App for Ai Text Generation based on GPT-2 Models from HuggingFace Model Hub using Python library aitextgen☆27Nov 26, 2020Updated 5 years ago
- 🚀 This repo is a showcase of how you can use models deployed on AWS SageMaker in your Haystack Retrieval Augmented Generative AI pipelin…☆13Jul 27, 2023Updated 2 years ago
- A repository containing comprehensive Neural Networks based PyTorch implementations for the semantic text similarity task, including arch…☆53Mar 10, 2022Updated 3 years ago
- ☆20Jul 22, 2021Updated 4 years ago
- OpnEco is a Python3 project developed to aid content writers throughout the content writing process. By content writers, for content writ…☆21Feb 15, 2023Updated 3 years ago
- ☆16Oct 27, 2020Updated 5 years ago
- Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers☆161Sep 25, 2020Updated 5 years ago
- A Python implementation of the uncertainty classifier, based on the work of Veronika Vincze.☆17Aug 20, 2024Updated last year
- Repository contains various Malayalam ASR based resources curated from multiple sources☆18Oct 1, 2021Updated 4 years ago
- PassivePy: A Tool to Automatically Identify Passive Voice in Big Text Data☆23Mar 6, 2024Updated last year
- A framework for detecting, highlighting and correcting grammatical errors on natural language text. Created by Prithiviraj Damodaran. Ope…☆1,569Feb 15, 2023Updated 3 years ago
- NS-CQA: the model of the JWS paper 'Less is More: Data-Efficient Complex Question Answering over Knowledge Bases.' This work has been acc…☆22Jan 6, 2021Updated 5 years ago
- Source Code for paper "NERO: A Neural Rule Grounding Framework for Label-Efficient Relation Extraction", WWW 2020☆47May 6, 2020Updated 5 years ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆402Jul 30, 2021Updated 4 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Jan 17, 2023Updated 3 years ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆55Dec 2, 2021Updated 4 years ago
- Detect the Language of Text☆52Jan 15, 2016Updated 10 years ago
- Fuzzy string matching, grouping, and evaluation.☆788Jul 10, 2025Updated 7 months ago
- Knowledge pills on Neural Search☆27May 8, 2023Updated 2 years ago
- Get vaccine availability in India☆25May 16, 2021Updated 4 years ago
- Framework for zero-shot learning with knowledge graphs.☆113Mar 28, 2023Updated 2 years ago
- This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (K…☆174Feb 3, 2023Updated 3 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Dec 12, 2021Updated 4 years ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆120Oct 20, 2025Updated 3 months ago
- Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, vide…☆561Aug 20, 2024Updated last year
- Domain-specific BERT representation for Named Entity Recognition of lab protocol☆29Dec 25, 2020Updated 5 years ago
- A python module for English lemmatization and inflection.☆273Sep 14, 2023Updated 2 years ago
- 🍳 NLPrep - dataset tool for many natural language processing task☆28Jul 30, 2021Updated 4 years ago
- A lightweight implementation of shapes drawn across a geo-temporal plane.☆12Jan 27, 2026Updated 2 weeks ago
- Contains relevant notebooks for the hands-on NLP workshop for the Analytics India Magazine Plugin Conference -2020 Edition☆71May 29, 2020Updated 5 years ago
- Large Scale BERT Distillation☆33Mar 24, 2023Updated 2 years ago
- TensorFlow implementation of "Detecting Incongruity Between News Headline and Body Text via a Deep Hierarchical Encoder," AAAI-19☆38Jun 17, 2024Updated last year
- A Neural Language Style Transfer framework to transfer natural language text smoothly between fine-grained language styles like formal/ca…☆493Dec 12, 2023Updated 2 years ago
- NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve the relevance of search results on differe…☆674Sep 30, 2020Updated 5 years ago
- mT5 model for question answering and question generation☆27Apr 2, 2021Updated 4 years ago
- Self-hosted automated receipt recognition system☆32Mar 4, 2018Updated 7 years ago
- This is a repository for georeferencing of pushbroom hyperspectral imagery and includes ray-intersection, orthorectification and a coregi…☆11Oct 23, 2024Updated last year