herniqeu / extract0Links
Extract-0: A Specialized Language Model for Document Information
☆127Updated 4 months ago
Alternatives and similar repositories for extract0
Users that are interested in extract0 are comparing it to the libraries listed below
Sorting:
- Lightweight Nearest Neighbors with Flexible Backends☆333Updated last month
- Unified Schema-Based Information Extraction☆714Updated last week
- Parse vision is an open source tool to visualise what OCR is parsing in a PDF document to help developers and product teams identify if t…☆85Updated last year
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆200Updated 11 months ago
- ☆85Updated 5 months ago
- ☆171Updated last week
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- Pixelagent — Multimodal stateful agents☆224Updated 8 months ago
- ☆198Updated 6 months ago
- Sculpt: Structuring unstructured data with LLMs☆38Updated 4 months ago
- Small python package to measure OCR quality and other related metrics.☆26Updated last year
- ☆120Updated 6 months ago
- Montelimar - Extract text from anywhere☆87Updated 4 months ago
- Example repo showcasing model training and deployment with distil claude cli skill☆51Updated 3 weeks ago
- DSPydantic: Auto-Optimize Your Prompts and Pydantic Models with DSPy☆244Updated last week
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Updated 7 months ago
- Jupyter Notebooks and an R Notebook for encoding Pokémon embeddings and creating data visualizations.☆20Updated last year
- 🔥 LitLytics - an affordable, simple analytics platform that leverages LLMs to automate data analysis☆103Updated last year
- Python library to use Pleias-RAG models☆68Updated 9 months ago
- This repository contains a Retrieval-Augmented Generation (RAG) framework developed in C++ for high performance and scalability, with CUD…☆115Updated 5 months ago
- Extract structured data from any content using LLMs.☆109Updated 2 months ago
- Enhancing LLMs with LoRA☆206Updated 3 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Updated 2 months ago
- MLX-based QA pair generator and LLM finetuning tool in Streamlit☆42Updated 3 months ago
- This is the official PyTorch implementation for our NAACL 2024 paper: "AnchorAL: Computationally Efficient Active Learning for Large and …☆21Updated 9 months ago
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆348Updated 9 months ago
- ☆107Updated 3 months ago
- One Line To Build Zero-Data Classifiers in Minutes☆63Updated last year
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆224Updated 5 months ago
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆54Updated last year