DS4SD / SemTabNetLinks
Repository for ACL paper: "Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs"
☆17Updated last year
Alternatives and similar repositories for SemTabNet
Users that are interested in SemTabNet are comparing it to the libraries listed below
Sorting:
- Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.☆58Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Updated 2 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆97Updated 9 months ago
- Universal text classifier for generative models☆24Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- entropix style sampling + GUI☆27Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Updated 10 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆24Updated last year
- [TACL, EMNLP 2025 Oral] Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Thr…☆33Updated 2 months ago
- ☆141Updated 5 months ago
- One Line To Build Zero-Data Classifiers in Minutes☆63Updated last year
- Code for the EMNLP'24 paper "Learning to Extract Structured Entities Using Language Models"☆49Updated 10 months ago
- ☆68Updated last year
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆77Updated 6 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- ☆53Updated 3 months ago
- Python library to use Pleias-RAG models☆68Updated 9 months ago
- ☆120Updated last year
- A framework for evaluating function calls made by LLMs☆40Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Updated last month
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- ☆56Updated last year
- ☆54Updated 3 weeks ago
- ☆53Updated 11 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆84Updated last year
- Efficient few-shot learning with cross-encoders.☆62Updated last year
- Forces DeepSeek R1 models to engage in extended reasoning by intercepting early termination tokens.☆19Updated 11 months ago
- DocLLM: A layout-aware generative language model for multimodal document understanding☆137Updated 2 years ago