Leveraging LLMs for Post-OCR Correction of Historical Newspapers
☆15Jun 20, 2024Updated last year
Alternatives and similar repositories for llms_post-ocr_correction
Users that are interested in llms_post-ocr_correction are comparing it to the libraries listed below
Sorting:
- материалы курса по питону для студентов дпо-программы "компьютерная лингвистика" в НИУ ВШЭ (2020-2021)☆11Feb 21, 2022Updated 4 years ago
- Collection of useful React components☆10Apr 10, 2019Updated 6 years ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆39Dec 2, 2023Updated 2 years ago
- Services and guidelines for normalizing drug and other therapy terms☆13Updated this week
- This is the repo for CROssBARv2 Knowledge Graph data. CROssBARv2 is a heterogeneous general-purpose biomedical KG-based system.☆11Feb 4, 2026Updated 3 weeks ago
- ☆10Aug 3, 2019Updated 6 years ago
- Molecular Data Provider☆10Dec 17, 2025Updated 2 months ago
- Simple tool for generating tokens with open source transformers and/or calculate per-token surprisal.☆14May 23, 2025Updated 9 months ago
- Neural architecture search framework based on reinforcement learning:"A Novel Approach to Detecting Muscle Fatigue Based on sEMG by Using…☆14Nov 22, 2024Updated last year
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- ☆11Feb 13, 2024Updated 2 years ago
- ACL'2024-Main: Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Languag…☆12Sep 19, 2025Updated 5 months ago
- ☆11Nov 14, 2021Updated 4 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Decodes Compact Disc data from microscope images of a CD's surface☆12Jan 14, 2023Updated 3 years ago
- This repository contains code for classification of sound using spectrograms. We train a CNN to classify the sounds after converting to s…☆10Dec 14, 2018Updated 7 years ago
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- ☆15May 16, 2017Updated 8 years ago
- annotation storage backend☆11Apr 3, 2025Updated 10 months ago
- Transform audio files into mel spectrograms for text-to-speech model training☆12Aug 25, 2021Updated 4 years ago
- Hybrid GAN (HiFi-WaveGAN) applied to footsteps sound effects☆12Jul 17, 2023Updated 2 years ago
- Code and Results for the paper: A Revisiting Study of Appropriate Offline Evaluation for Top-𝑁 Recommendation Algorithms.☆11Mar 10, 2022Updated 3 years ago
- THUIR website☆10Feb 23, 2026Updated last week
- ☆10May 11, 2024Updated last year
- ☆10Dec 10, 2021Updated 4 years ago
- Morphological analysis for Udmurt.☆12Feb 17, 2026Updated last week
- Encode an image to sound (WAV file) and view it as a spectrogram. Optimized Python 3 version.☆11Jan 25, 2023Updated 3 years ago
- Python tool for batch visual question answering (BVQA).☆14Sep 18, 2025Updated 5 months ago
- Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.☆10Jan 4, 2021Updated 5 years ago
- Know2BIO: A Comprehensive Dual-View Benchmark for Evolving Biomedical Knowledge Graphs☆14Feb 10, 2026Updated 3 weeks ago
- ☆10Jul 21, 2017Updated 8 years ago
- This is the official PyTorch implementation for the paper: "Directed Acyclic Graph Factorization Machines for CTR Prediction via Knowledg…☆14Mar 5, 2023Updated 2 years ago
- Part-of-speech tagging using BERT☆10Nov 14, 2019Updated 6 years ago
- Semantically Search Emojis From the Command Line!☆13Nov 26, 2023Updated 2 years ago
- 1D-CNN models for NAFLD diagnosis and liver fat fraction quantification using radiofrequency ultrasound signals☆12Jun 10, 2020Updated 5 years ago
- WaveGANによる音声生成器☆13Feb 9, 2024Updated 2 years ago
- Generates spectrogram from images☆13Apr 26, 2021Updated 4 years ago
- ☆14Aug 25, 2021Updated 4 years ago
- LLM-only topic extraction and classification☆11Sep 20, 2024Updated last year