DS4SD / SemTabNetLinks

Repository for ACL paper: "Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs"

☆13

Alternatives and similar repositories for SemTabNet

Users that are interested in SemTabNet are comparing it to the libraries listed below

Sorting:

docling-project / docling-sdg
A set of tools to create synthetically-generated data from documents
☆20Updated last month
DS4SD / deepsearch-glm
Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.
☆55Updated 5 months ago
LAION-AI / bud-e
A general human-ai interaction platform.
☆15Updated 6 months ago
IBM / torchlogic
torchlogic is a pytorch framework for developing Neuro-Symbolic AI systems and implements Neural Reasoning Networks.
☆10Updated 2 months ago
LAION-AI / Desktop-BUD-E_V1.0
BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…
☆20Updated 9 months ago
ibm-granite / granite-embedding-models
☆29Updated 3 weeks ago
agno-agi / personalized-agentic-rag
☆11Updated last year
ibm-granite / granite-3.3-language-models
Granite 3.3 repository
☆17Updated 3 weeks ago
DS4SD / quackling
Build document-native LLM applications
☆53Updated 10 months ago
robbiemu / llama-gguf-optimize
Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.
☆14Updated 6 months ago
slashml / awesome-finetuning
☆28Updated 10 months ago
kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆30Updated this week
docling-project / docling-eval
Evaluation framework for document processing models and services.
☆24Updated this week
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆66Updated 8 months ago
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆80Updated 2 months ago
wizenheimer / cyyrus
Transform Unstructured Data into Synthetic Datasets
☆27Updated 10 months ago
IBM / InspectorRAGet
The repository contains generative AI analytics platform application code.
☆26Updated 2 months ago
allenai / olmo-cookbook
OLMost every training recipe you need to perform data interventions with the OLMo family of models.
☆36Updated this week
DS4SD / deepsearch-examples
Examples using the Deep Search functionalities
☆81Updated 5 months ago
nova-land / gbnf-compiler
Plug n Play GBNF Compiler for llama.cpp
☆26Updated last year
shivamsanju / ragswift
🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform
☆38Updated last year
docling-project / docling-langchain
Docling LangChain integration
☆32Updated last month
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆88Updated 2 months ago
docling-project / docling-ibm-models
☆130Updated last week
Knowledgator / unlimited_classifier
Universal text classifier for generative models
☆24Updated 11 months ago
calcuis / gguf-connector
gguf (GPT-Generated Unified Format) connector
☆19Updated this week
flowaicom / flow-judge
Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…
☆74Updated 8 months ago
ritabratamaiti / AnyModal
AnyModal is a Flexible Multimodal Language Model Framework for PyTorch
☆100Updated 6 months ago
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 7 months ago
huggingface / hf-endpoints-documentation
☆17Updated this week