baulbo/Diard

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/baulbo/Diard)

baulbo / Diard

From document (PDF) or document images to analysis ready semi-structured data.

☆20

Alternatives and similar repositories for Diard

Users that are interested in Diard are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Altabeh / tesseract-ocr-wrapper
View on GitHub
This is a highly efficient python wrapper for tesseract-ocr.
☆26May 19, 2022Updated 4 years ago
baulbo / table-transformer-simple-inference
View on GitHub
Simple table extraction example.
☆10Jun 26, 2022Updated 4 years ago
saichandrareddy1 / oxygenjs
View on GitHub
This a JavaScript Library for the Numerical Javascript and Machine Learning
☆14May 12, 2021Updated 5 years ago
YFChiu / Resources--Databricks-Certified-Associate-Developer-for-Apache-Spark-3.0
View on GitHub
(Python, PySpark)
☆10Nov 15, 2020Updated 5 years ago
h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
wutong8023 / SpeechRE
View on GitHub
☆11Nov 11, 2022Updated 3 years ago
snapthat / TF-T5-text-to-text
View on GitHub
This repository demonstrate training T5 transformers using tensorflow 2
☆14Oct 1, 2020Updated 5 years ago
betterstack-community / projects
View on GitHub
Curated list of awesome projects built on top of Better Stack
☆15Mar 15, 2024Updated 2 years ago
XL2248 / SOV-MAS
View on GitHub
The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"
☆11May 16, 2023Updated 3 years ago
MengboLi / MS-SENet
View on GitHub
☆11Jul 16, 2024Updated 2 years ago
mengshiY / RCSF
View on GitHub
Code for paper "Cross-Domain Slot Filling as Machine Reading Comprehension" in IJCAI 2021
☆11Aug 24, 2021Updated 4 years ago
Sreyan88 / RECAP
View on GitHub
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
☆16Jun 23, 2024Updated 2 years ago
yangjingyuan / ConstDecoder
View on GitHub
☆11Oct 24, 2022Updated 3 years ago
may- / joeys2t
View on GitHub
Minimalist Speech-to-Text toolkit for educational purposes
☆13Feb 1, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
alexpovel / betterletter
View on GitHub
Substitute alternative spellings of special characters (e.g. German umlauts [ae, oe, ue] and [ss]) with their correct versions (ä, ö, ü, …
☆11Nov 24, 2024Updated last year
dksanyal / SpERT.PL
View on GitHub
Joint Neural Model for Entity & Relation Extraction
☆16Oct 18, 2021Updated 4 years ago
Sreyan88 / LipGER
View on GitHub
Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
☆19Jul 16, 2024Updated 2 years ago
cambridge-mlg / LITE
View on GitHub
Code for "Memory Efficient Meta-Learning with Large Images"
☆11Nov 24, 2021Updated 4 years ago
Speech-Lab-IITM / data2vec-aqc
View on GitHub
Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…
☆13Mar 18, 2024Updated 2 years ago
rpeloff / multimodal_one_shot_learning
View on GitHub
Code recipe for "Multimodal One-Shot Learning of Speech and Images"
☆11Nov 21, 2018Updated 7 years ago
guangkun0818 / speech2text
View on GitHub
Speech understanding system training toolkit, including tasks of ASR, SSL, LM, etc.
☆12Feb 12, 2026Updated 5 months ago
vagos / llm-clap
View on GitHub
Generate embeddings for audio files (music, speech, sounds) and text using CLAP with llm
☆22May 15, 2025Updated last year
W-Wu / DEER
View on GitHub
☆12Aug 25, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
MaitySubhajit / SelfDocSeg
View on GitHub
[ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)
☆43Oct 6, 2023Updated 2 years ago
ufal / MLASK
View on GitHub
EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"
☆11Nov 7, 2023Updated 2 years ago
Janie1996 / MSRFG
View on GitHub
The code for Multi-Scale Receptive Field Graph Model for Emotion Recognition in Conversations
☆11Jan 17, 2023Updated 3 years ago
skit-ai / N-Best-ASR-Transformer
View on GitHub
Code for ACL-IJCNLP 2021 paper "N-Best-ASR-Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses."
☆17Nov 30, 2021Updated 4 years ago
MiuLab / SpokenCSE
View on GitHub
Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding
☆11May 19, 2023Updated 3 years ago
talsperre / LectureSummarizer
View on GitHub
A lecture summarization tool that uses AI and computer vision to summarize and index videos
☆10Dec 8, 2022Updated 3 years ago
ufal / augpt
View on GitHub
DSTC9 Submission
☆16Apr 12, 2021Updated 5 years ago
arkaprabha-majumdar / google-crawler
View on GitHub
☆13Oct 17, 2020Updated 5 years ago
razvan404 / multimodal-speech-emotion-recognition
View on GitHub
Multimodal SER Model meant to be trained on recognising emotions from speech (text + acoustic data). Fine-tuned the DeBERTaV3 model, resp…
☆11Jun 19, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
duyet / google-search-crawler
View on GitHub
Input list of keywords, crawler top (url, title, description) data from Google Search.
☆17Jun 6, 2026Updated last month
Dalia-Sher / Speech-Emotion-Recognition-using-BLSTM-with-Attention
View on GitHub
We present a study of a neural network based method for speech emotion recognition, using audio-only features. In the studied scheme, the…
☆11Jul 24, 2024Updated 2 years ago
rithiksachdev / PostASR-Correction-SLT2024
View on GitHub
☆18Jul 22, 2024Updated 2 years ago
tzyll / ChineseHP
View on GitHub
Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.
☆16Jul 4, 2024Updated 2 years ago
cruvadom / Prediction_Intervals
View on GitHub
Computing calibrated prediction intervals for neural network regressors
☆10May 28, 2019Updated 7 years ago
isaacOnline / whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆13Oct 28, 2023Updated 2 years ago
robd003 / sph2pipe
View on GitHub
provide SPHERE-formatted output as well as RIFF, AU, AIFF and raw
☆14Dec 18, 2021Updated 4 years ago