shahrukhx01/multilingual-pdf2text

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shahrukhx01/multilingual-pdf2text)

shahrukhx01 / multilingual-pdf2text

A python library for extracting text from PDFs without losing the formatting of the PDF content.

☆78

Alternatives and similar repositories for multilingual-pdf2text

Users that are interested in multilingual-pdf2text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

microsoft / dataflow2text
View on GitHub
Code for "The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction an…
☆10Apr 30, 2024Updated 2 years ago
karndeb / Arxiv-Neural-Search
View on GitHub
Neural Search System on Arxiv AI/ML Papers
☆54Aug 4, 2021Updated 4 years ago
sahyagiri / DistinctKeywords
View on GitHub
semantically distinct key phrase extraction using hilbert hashes.
☆51Feb 28, 2022Updated 4 years ago
megagonlabs / tagruler
View on GitHub
Data programming by demonstration for information extraction and span annotation
☆34Sep 9, 2021Updated 4 years ago
md-experiments / elastic_transformers
View on GitHub
Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers
☆160Sep 25, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Wadaboa / ner-annotator
View on GitHub
GUI useful to manually annotate text for Named Entity Recognition purposes
☆14Jun 22, 2023Updated 3 years ago
DevinJake / NS-CQA
View on GitHub
NS-CQA: the model of the JWS paper 'Less is More: Data-Efficient Complex Question Answering over Knowledge Bases.' This work has been acc…
☆22Jan 6, 2021Updated 5 years ago
UB-Mannheim / bbw
View on GitHub
Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup
☆71Jun 9, 2025Updated last year
PyThaiNLP / thai-g2p-wiktionary-corpus
View on GitHub
Thai Grapheme to Phoneme (G2P) Wiktionary Corpus
☆13Jul 25, 2022Updated 4 years ago
Lucaterre / spacyfishing
View on GitHub
A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata
☆173Nov 7, 2022Updated 3 years ago
allenai / ACCoRD
View on GitHub
☆19May 13, 2022Updated 4 years ago
Layout-Parser / annotation-service
View on GitHub
☆20Jul 22, 2021Updated 5 years ago
PrithivirajDamodaran / Gramformer
View on GitHub
A framework for detecting, highlighting and correcting grammatical errors on natural language text. Created by Prithiviraj Damodaran. Ope…
☆1,586Feb 15, 2023Updated 3 years ago
peterbhase / ExplanationRoles
View on GitHub
Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"
☆14Feb 16, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tejasvaidhyadev / NER_Lab_Protocols
View on GitHub
Domain-specific BERT representation for Named Entity Recognition of lab protocol
☆29Dec 25, 2020Updated 5 years ago
k-m-irfan / mediapipe_FaceMesh
View on GitHub
Mediapipe Face Mesh
☆14Jun 24, 2022Updated 4 years ago
blengerich / explainable-cnn
View on GitHub
Towards Visual Explanations for Convolutional Neural Networks via Input Resampling
☆13Aug 16, 2017Updated 8 years ago
sapped / flip
View on GitHub
Code for my personal site, flip.rip
☆11Mar 20, 2021Updated 5 years ago
simonguenther / AWS_Machine_Learning_Specialty_MLS-C01
View on GitHub
My detailed experience of taking Amazon's Machine Learning Specialty exam
☆15Aug 30, 2021Updated 4 years ago
openzipkin / pyramid_zipkin-example
View on GitHub
See how much time python services spend on an http request
☆14Feb 26, 2019Updated 7 years ago
giguru / converse
View on GitHub
☆11Oct 14, 2021Updated 4 years ago
amrrs / aitextgen_streamlit
View on GitHub
Streamlit-based Web App for Ai Text Generation based on GPT-2 Models from HuggingFace Model Hub using Python library aitextgen
☆27Nov 26, 2020Updated 5 years ago
naver-ai / talebrush
View on GitHub
The official source code for TaleBrush (CHI 2022)
☆17Jul 13, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cadobe / bison
View on GitHub
source code of bison
☆26Jul 20, 2020Updated 6 years ago
MaartenGr / PolyFuzz
View on GitHub
Fuzzy string matching, grouping, and evaluation.
☆801Jul 10, 2025Updated last year
theblackcat102 / language-models-are-knowledge-graphs-pytorch
View on GitHub
Language models are open knowledge graphs ( non official implementation )
☆170Nov 14, 2020Updated 5 years ago
bhavsarpratik / vaccine_availability
View on GitHub
Get vaccine availability in India
☆25May 16, 2021Updated 5 years ago
naitian / Condolence-Empathy-Online-Communities
View on GitHub
Repository for "Condolence and Empathy in Online Communities", EMNLP 2020
☆10Nov 9, 2020Updated 5 years ago
RelevanceAI / vectorhub
View on GitHub
Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, vide…
☆560Aug 20, 2024Updated last year
Nardien / KALA
View on GitHub
Official Code Repository for the paper "KALA: Knowledge-Augmented Language Model Adaptation" (NAACL 2022)
☆35Oct 17, 2023Updated 2 years ago
BinWang28 / Sentence-Embedding-S3E
View on GitHub
Efficient Sentence Embedding via Semantic Subspace Analysis
☆14Feb 25, 2020Updated 6 years ago
socialmediaie / pytail
View on GitHub
PyTAIL - Interactive and Incremental Learning of NLP Models with Human in the Loop for Online Data
☆13Dec 3, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
chen42 / RatsPub
View on GitHub
Using PubMed to find out how a gene contributes to addiction.
☆20Dec 27, 2022Updated 3 years ago
microsoft / Litmus
View on GitHub
AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems
☆48Aug 19, 2022Updated 3 years ago
oceanumeric / EnteRAG
View on GitHub
A RAG that can scale 🧑🏻‍💻
☆11May 28, 2024Updated 2 years ago
insidersolutions / weka-mnb-sentiment-analysis-template-project
View on GitHub
The template project for three way and five way sentiment classification
☆11Nov 16, 2016Updated 9 years ago
minyong-shin / Bloging
View on GitHub
블로그에 업로드된 자료에 대한 코드를 공유하고 있습니다.
☆14Feb 23, 2020Updated 6 years ago
georgetown-cset / ai-relevant-papers
View on GitHub
Replication materials for "Identifying the Development and Application of Artificial Intelligence in Scientific Text"
☆14Feb 18, 2020Updated 6 years ago
koursaros-ai / nboost
View on GitHub
NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve the relevance of search results on differe…
☆673Sep 30, 2020Updated 5 years ago