huggingface / finepdfsLinks
Codebase for FinePDFs
☆176Updated last month
Alternatives and similar repositories for finepdfs
Users that are interested in finepdfs are comparing it to the libraries listed below
Sorting:
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆225Updated 5 months ago
- ☆107Updated 10 months ago
- Verifiers for LLM Reinforcement Learning☆82Updated 5 months ago
- ☆238Updated 2 months ago
- Benchmark and optimize LLM inference across frameworks with ease☆161Updated 4 months ago
- AI Agent that researches the lives of historical figures and extracts events into structured JSON timelines using LangGraph multi-agent o…☆227Updated 3 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆261Updated this week
- [ICLR2026] Test-Time Scaling with Reflective Generative Model☆302Updated 2 weeks ago
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆278Updated 2 months ago
- Extract structured data from any content using LLMs.☆109Updated 2 months ago
- A set of tools to create synthetically-generated data from documents☆39Updated 5 months ago
- How to build the best search, one step at a time!☆233Updated 2 months ago
- ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!☆137Updated 4 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 10 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆460Updated 5 months ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆281Updated 4 months ago
- Simple UI for debugging correlations of text embeddings☆305Updated 8 months ago
- Train LLM on Hugging Face infra☆67Updated 2 months ago
- Data recipes and robust infrastructure for training AI agents☆94Updated this week
- ☆274Updated 3 weeks ago
- alphaxiv open source alternative☆109Updated 8 months ago
- Data Synthesis for Deep Research Based on Semi-Structured Data☆198Updated last month
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆84Updated last year
- Deep research agents using MiniMax M2.1 interleaved thinking☆196Updated last month
- Context Engineering Course with DSPy☆214Updated 6 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆181Updated 9 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆279Updated 6 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 5 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 9 months ago
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆95Updated this week