ucbepic/TWIX

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ucbepic/TWIX)

ucbepic / TWIX

TWIX is an open-source data extraction tool that reconstructs structured data from documents at scale, accurately and at low cost, by inferring the shared underlying visual template across documents

☆226

Alternatives and similar repositories for TWIX

Users that are interested in TWIX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MaximeRivest / attachments
View on GitHub
Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…
☆367Jun 10, 2026Updated last month
MaximeRivest / moereport
View on GitHub
☆19Aug 23, 2025Updated 10 months ago
ucbepic / docetl
View on GitHub
A system for agentic LLM-powered data processing and ETL
☆3,909Updated this week
Archelunch / vibe-dspy
View on GitHub
☆55Aug 22, 2025Updated 10 months ago
Ziems / arbor
View on GitHub
A framework for optimizing DSPy programs with RL
☆340Jan 12, 2026Updated 6 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
MaximeRivest / dspy-lm-auth
View on GitHub
☆32Mar 11, 2026Updated 4 months ago
the8472 / reapfrog
View on GitHub
Rust library for multi-file readahead / dropbehind
☆14May 31, 2017Updated 9 years ago
ucbepic / BARGAIN
View on GitHub
Low-Cost LLM-Powered Data Processing with Theoretical Guarantees
☆42Jun 11, 2026Updated last month
MaximeRivest / ovllm
View on GitHub
☆39Aug 4, 2025Updated 11 months ago
eyelevelai / groundx-on-prem
View on GitHub
A Kubernetes deployable instance of GroundX for document parsing, storage, and search.
☆816Updated this week
getolive / olive-cli
View on GitHub
olive-cli: a minimal llm-based operating system for engineers packaged as a terminal app.
☆20Jun 13, 2025Updated last year
swyxio / chrometaboverflow
View on GitHub
manage your chrome tab overload in markdown
☆76Dec 29, 2025Updated 6 months ago
delightful-ai / beads-rs
View on GitHub
A distributed work-item database for agent swarms, using git as the sync layer
☆24Updated this week
HazAT / pi-find-forks
View on GitHub
Scan GitHub forks of your repo for patterns worth upstreaming. A Pi extension.
☆16Apr 17, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
clchinkc / streamlit-editor
View on GitHub
Personal project, Generative AI, Streamlit, Python
☆53Apr 30, 2025Updated last year
MaximeRivest / funnydspy
View on GitHub
Vanilla-Python ergonomics on top of DSPy
☆40Jun 3, 2025Updated last year
drmingler / smart-llm-loader
View on GitHub
smart-llm-loader is a lightweight yet powerful Python package that transforms any document into LLM-ready chunks. Spend less time on prep…
☆75Nov 14, 2025Updated 8 months ago
jxmorris12 / embzip
View on GitHub
lossily compress representation vectors using product quantization
☆59Oct 28, 2025Updated 8 months ago
zhudotexe / redel
View on GitHub
ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)
☆94May 11, 2026Updated 2 months ago
run-llama / semtools
View on GitHub
Semantic search and document parsing tools for the command line
☆1,837Mar 11, 2026Updated 4 months ago
microsoft / typeagent-py
View on GitHub
Structured RAG: ingest, index, query
☆875Updated this week
WolframRavenwolf / AI-Hotkeys
View on GitHub
Transform your CapsLock into an AI key! This AutoHotkey app puts powerful AI capabilities right at your fingertips, supercharging your Wi…
☆22Oct 31, 2025Updated 8 months ago
lotus-data / lotus
View on GitHub
Optimized Agentic and LLM Bulk Processing Over Your Data
☆1,653Jul 3, 2026Updated 2 weeks ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
dario-vazquez-albacete / GraphSupplyChain
View on GitHub
This project showcases a comprehensive analysis of CO2 emissions in a fictitious cheese manufacturing supply chain using both graph datab…
☆11Sep 18, 2024Updated last year
langstruct-ai / langstruct
View on GitHub
Extract structured data from any content using LLMs.
☆127Dec 1, 2025Updated 7 months ago
SALT-NLP / SynthesizeMe
View on GitHub
☆36Jun 10, 2025Updated last year
nielsgl / dspy-profiles
View on GitHub
DSPy Profile Manager
☆25Oct 9, 2025Updated 9 months ago
jxnl / mit-lecture
View on GitHub
☆10Feb 25, 2025Updated last year
intertwine / dspy-agent-skills
View on GitHub
Production-grade DSPy 3.2.x agent skills + validated end-to-end examples for Claude Code and Codex CLI — fundamentals, evaluation, GEPA, …
☆264Jun 20, 2026Updated last month
HomebrewML / Olmax
View on GitHub
HomebrewNLP in JAX flavour for maintable TPU-Training
☆50Jan 20, 2024Updated 2 years ago
landing-ai / agentic-doc
View on GitHub
Legacy Python library for Agentic Document Extraction (ADE). Use the landingai-ade library for all new projects.
☆2,394Mar 24, 2026Updated 3 months ago
PolicyEngine / policyengine-taxsim
View on GitHub
TAXSIM emulator using the PolicyEngine US federal and state tax calculator
☆18Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kuzudb / baml-kuzu-demo
View on GitHub
Demo of knowledge graph creation and Graph RAG with BAML and Kuzu
☆73Sep 17, 2025Updated 10 months ago
OpenPipe / ART
View on GitHub
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…
☆10,505Updated this week
gepa-ai / gepa
View on GitHub
Optimize prompts, code, and more with AI-powered Reflective Optimization
☆5,714Updated this week
AnthonyRonning / pi-ax-model-optimization
View on GitHub
☆36Apr 25, 2026Updated 2 months ago
Michaelliv / dripline
View on GitHub
💧 Query mode for agents
☆102May 11, 2026Updated 2 months ago
kostyay / pi-k-excalidraw
View on GitHub
Native Excalidraw diagram preview tool for pi — draw and save diagrams from the agent with a live glimpse webview.
☆62May 3, 2026Updated 2 months ago
mitdbg / palimpzest
View on GitHub
A System for Optimized Semantic Computation
☆230Updated this week