ucbepic / TWIXLinks
TWIX is an open-source data extraction tool that reconstructs structured data from documents at scale, accurately and at low cost, by inferring the shared underlying visual template across documents
☆189Updated 3 weeks ago
Alternatives and similar repositories for TWIX
Users that are interested in TWIX are comparing it to the libraries listed below
Sorting:
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆190Updated this week
- Metadspy: The framework for specifying—not programming—language models☆82Updated this week
- ☆113Updated last week
- Deep Research for your internal data☆327Updated 2 weeks ago
- A user interface for DSPy☆160Updated last month
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆204Updated this week
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆313Updated 2 weeks ago
- Terminal-based AI Coding Agent, similar to Claude Code, OpenAI Codex etc. but works with many more LLMs e.g. Gemini, Groq, Deepseek☆135Updated last month
- ☆87Updated 4 months ago
- Helping you select an AI agent framework☆350Updated last week
- Solving data for LLMs - Create quality synthetic datasets!☆149Updated 5 months ago
- ☆105Updated 2 months ago
- Together Open Deep Research☆309Updated 2 months ago
- Claude Deep Research config for Claude Code.☆186Updated 3 months ago
- Optimize Document Retrieval with Fine-Tuned KnowledgeBases☆141Updated 3 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆552Updated 3 weeks ago
- A non-official CLI for Llama Index Parser☆211Updated 11 months ago
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆120Updated 3 months ago
- ☆86Updated 5 months ago
- ☆98Updated 6 months ago
- Structured information extraction from documents☆315Updated 8 months ago
- 🦄 ai that works - every tuesday 10 AM PST☆109Updated this week
- LLMap solves context extraction for large codebases☆96Updated 3 months ago
- FastAPI wrapper around DSPy☆247Updated last year
- ☆152Updated 6 months ago
- ☆103Updated 5 months ago
- Contains my configuration files☆272Updated 2 weeks ago
- ☆137Updated last week
- Sample applications built on the Graphlit Platform☆75Updated 2 months ago
- A fully customizable and self-hosted sandboxing solution for AI agent code execution and computer use. It features out-of-the-box support…☆493Updated 2 weeks ago