A full codebase for replicating the results of Nougat from downloading arXiv dataset to the final evaluation. It also contains a few fixes to the original codebase.
☆11Dec 11, 2023Updated 2 years ago
Alternatives and similar repositories for nougat-replication
Users that are interested in nougat-replication are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Datasets and Evaluation Scripts for CompHRDoc☆57Feb 25, 2025Updated last year
- ☆19Sep 1, 2022Updated 3 years ago
- Mixture of Experts from scratch☆13Apr 12, 2024Updated last year
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆160Sep 25, 2024Updated last year
- Pino configuration for Google Cloud Platform. Enabled structured logging!☆19Mar 13, 2026Updated last week
- Patched Next.js to have full logs via pino☆15Mar 19, 2024Updated 2 years ago
- Location Predictor 📍☆16Mar 16, 2026Updated last week
- Official code repository for the main conference paper in EMNLP 2022: SubeventWriter: Iterative Sub-event Sequence Generation with Cohere…☆11Oct 16, 2022Updated 3 years ago
- ☆11Nov 29, 2024Updated last year
- ☆14Jul 6, 2022Updated 3 years ago
- download html paper to word format☆16Nov 16, 2022Updated 3 years ago
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…☆11Oct 18, 2022Updated 3 years ago
- Fork of dhSegment for experiments on visual and textual feature combination.☆15Jan 30, 2021Updated 5 years ago
- Code for EMNLP 2020 paper: Analogous Process Structure Induction for Sub-event Sequence Prediction☆11Oct 19, 2020Updated 5 years ago
- GenFlowChart is a framework that implements flowchart parsing using generative AI. Leveraging SAM for segmentation and OCR for text extra…☆32Jun 5, 2024Updated last year
- Official code repository for the paper: AbsPyramid: Benchmarking the Abstration Ability of Language Models with a Unified Entailment Grap…☆13Oct 30, 2024Updated last year
- This is the code repo for Findings of EMNLP2022 paper: MICO: a multi-alternative contrastive learning framework for commonsense knowledg…☆10Nov 29, 2022Updated 3 years ago
- 🍨 Gelato — From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents☆39Dec 22, 2025Updated 3 months ago
- 番茄简谱后端开源实现☆18Oct 17, 2024Updated last year
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆15Jan 25, 2024Updated 2 years ago
- Source code for the paper 'Complex Hyperbolic Knowledge Graph Embeddings with Fast Fourier Transform'.☆12Nov 9, 2022Updated 3 years ago
- Code for the ACL2023 paper: CAT: A Contextualized Conceptualization and Instantiation Framework for Commonsense Reasoning (https://aclant…☆11May 9, 2023Updated 2 years ago
- A Modern Text-based User Interface for ChatGPT.☆13Jul 25, 2023Updated 2 years ago
- Simple Inventory CRUD Application using spring-boot - kafka - mongoDB☆16Oct 24, 2024Updated last year
- Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained Language Models"☆11May 9, 2023Updated 2 years ago
- A simple JS script to register desired course when slots are available, for UM-SJTU JI students.☆12May 9, 2022Updated 3 years ago
- PANiC - PAraphrasing Noun-Compounds☆15Apr 6, 2018Updated 7 years ago
- (WIP) Ams Date Picker - A modern, magical, and unstyled date picker for React. We have your favorite Time Machine and Input Supercharge o…☆32Feb 17, 2023Updated 3 years ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Apr 9, 2025Updated 11 months ago
- ☆31Jan 17, 2026Updated 2 months ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 3 years ago
- Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.☆14Jan 23, 2022Updated 4 years ago
- Flux training codes (lora) for UniTEX☆24Jun 8, 2025Updated 9 months ago
- This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.☆14Feb 24, 2022Updated 4 years ago
- The corresponding code from our paper "Social Commonsense Reasoning with Multi-Head Knowledge Attention (EMNLP 2020)". Do not hesitate to…☆11Jun 12, 2022Updated 3 years ago
- Code for our work "Read, Highlight and Summarize: A Hierarchical Neural Semantic Encoder-based Approach"☆10Oct 28, 2019Updated 6 years ago
- Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs☆23Nov 11, 2025Updated 4 months ago
- Historical shortest-path distance querying index by pruned landmark labeling☆10May 24, 2014Updated 11 years ago
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆25Mar 17, 2021Updated 5 years ago