ucbepic / TWIXLinks
TWIX is an open-source data extraction tool that reconstructs structured data from documents at scale, accurately and at low cost, by inferring the shared underlying visual template across documents
☆211Updated 2 months ago
Alternatives and similar repositories for TWIX
Users that are interested in TWIX are comparing it to the libraries listed below
Sorting:
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆348Updated 4 months ago
- Deep Research for your internal data☆351Updated 8 months ago
- A user interface for DSPy☆210Updated 4 months ago
- A framework for optimizing DSPy programs with RL☆308Updated 3 weeks ago
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆417Updated 5 months ago
- ☆114Updated 7 months ago
- Open-source versioning, tracing, and annotation tooling.☆214Updated 2 weeks ago
- FastAPI wrapper around DSPy☆291Updated last year
- Structured information extraction from documents☆318Updated last year
- Minimal agent runtime built with DSPy modules and a thin Python loop. Includes CLI, FastAPI server, and eval harness with OpenAI/Ollama s…☆68Updated last month
- Simple UI for debugging correlations of text embeddings☆305Updated 8 months ago
- A comprehensive 0-to-1 guide for building self-improving LLM applications with DSPy framework☆202Updated 3 weeks ago
- How to build the best search, one step at a time!☆233Updated 2 months ago
- Metadspy: The framework for specifying—not programming—language models☆88Updated 7 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆352Updated 8 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆151Updated last year
- Together Open Deep Research☆358Updated 9 months ago
- ☆274Updated 2 weeks ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆140Updated 5 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆567Updated 2 months ago
- A reimplementation of langgraph's customer support example in Rasa's CALM paradigm and a quantiative evaluation of the 2 approaches☆81Updated 10 months ago
- ☆85Updated 5 months ago
- A tool kit for generating high quality prompts using DSPy GEPA optimizer☆298Updated last week
- Terminal-based AI Coding Agent, similar to Claude Code, OpenAI Codex etc. but works with many more LLMs e.g. Gemini, Groq, Deepseek☆151Updated 9 months ago
- RAG evaluation without the need for "golden answers"☆338Updated last month
- Vibe-coding tools for the LlamaIndex ecosystem☆176Updated 3 months ago
- ☆104Updated last year
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.☆801Updated last week
- A non-official CLI for Llama Index Parser☆216Updated last year
- Context Engineering Course with DSPy☆214Updated 6 months ago