Structured Data Extractor for AI Agents. Search your documents or the web for specific data and get it back in JSON or Markdown in a single tool call.
☆186Jan 5, 2026Updated 4 months ago
Alternatives and similar repositories for open-extract
Users that are interested in open-extract are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Oct 4, 2024Updated last year
- AceParse: A Comprehensive Dataset with Diverse Structured Texts for Academic Literature Parsing☆44Sep 17, 2024Updated last year
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆62Feb 10, 2025Updated last year
- The first dense retrieval model that can be prompted like an LM☆92May 8, 2025Updated 11 months ago
- Multi AI agent system for financial analysis with CrewAI☆38Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Detect and extract tables to markdown and csv☆756Jan 24, 2025Updated last year
- TensorFlow code for our ECCV'24 Workshop paper "LightAvatar: Efficient Head Avatar as Dynamic NeLF"☆31Nov 7, 2024Updated last year
- Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning☆52Oct 17, 2025Updated 6 months ago
- Every system has its `telos´ — its final cause. This CLI fulfills the purpose of MindsDB's Knowledge Base: to seek, structure, and serve …☆21Dec 19, 2025Updated 4 months ago
- ☆252Oct 16, 2024Updated last year
- Friends of OLMo and their links.☆364Sep 15, 2025Updated 7 months ago
- Legacy project of an analytics platform for LLM-generated content☆439Jul 17, 2025Updated 9 months ago
- A benchmark for testing memorization abilities of LMs☆24Oct 15, 2024Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆82Mar 18, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The official implementation for Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation.☆25Jan 30, 2024Updated 2 years ago
- [NeurIPS 2023] GenS: Generalizable Neural Surface Reconstruction from Multi-View Images☆39Jul 22, 2024Updated last year
- 👩🏻🔬🧪SciTonic is a highly adaptive technical operator of agents that can produce complexe analyses on technical data with high perfor…☆55Jul 19, 2025Updated 9 months ago
- ☆22Mar 23, 2025Updated last year
- CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models [NAACL 2025]☆64Feb 28, 2025Updated last year
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- Towards Medical Small Language Models with Self-Evolved \\ Slow Thinking☆88Nov 11, 2025Updated 5 months ago
- A set of tools for building AI coders☆16Sep 9, 2024Updated last year
- Statewide Visual Geolocalization in the Wild (ECCV 2024)☆74Dec 2, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A swarm of LLM agents that will help you test, document, and productionize your code!☆18Apr 27, 2026Updated last week
- 🐮📢 The first AI voice assistant that interrupts *you*☆148Sep 6, 2024Updated last year
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆38Jul 19, 2024Updated last year
- This repository holds enhanced Agents, built for the Microsoft AutoGen Framework. Debuting with a MemoryEnabledAgent with improvements in…☆116Oct 12, 2023Updated 2 years ago
- SnapDocs - A Modern, Open-Source Document Workspace☆25Sep 7, 2025Updated 7 months ago
- Packages whisper.cpp into pre-built, pip-installable wheels, for macOS and Linux.☆178Jun 10, 2024Updated last year
- ☆42Feb 8, 2026Updated 2 months ago
- ☆210May 28, 2025Updated 11 months ago
- The official GitHub page for the survey paper "Foundation Models for Music: A Survey".☆223Sep 4, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- AI agent simulation framework☆67Mar 25, 2026Updated last month
- Enrich data by prompting LLMs☆15Apr 1, 2026Updated last month
- Implementing a scalable content team using AI involves creating a framework that blends the strengths of AI technologies with the creativ…☆28Jun 29, 2024Updated last year
- AI-based search done right☆20Dec 25, 2025Updated 4 months ago
- Gradio UI to load crewAI configuration from excel xls and generate the python code. The source of the crews is in the xls. It allows for …☆11Oct 17, 2025Updated 6 months ago
- ☆88Oct 28, 2024Updated last year
- Building a Legal Case Search Engine Using Qdrant, Llama 3, LangChain and Exploring Different Filtering Techniques☆16Jul 6, 2024Updated last year