dsdanielpark / arxiv2textLinks
Converting PDF files to text, mainly with a focus on arXiv papers.
β24Updated last year
Alternatives and similar repositories for arxiv2text
Users that are interested in arxiv2text are comparing it to the libraries listed below
Sorting:
- Explore the use of DSPy for extracting features from PDFs πβ52Updated last year
- β54Updated 2 weeks ago
- Codebase accompanying the Summary of a Haystack paper.β80Updated last year
- ππ§ Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!β53Updated 6 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding forβ¦β28Updated last year
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β77Updated last year
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [Fβ¦β68Updated last year
- Small and Efficient Mathematical Reasoning LLMsβ73Updated 2 years ago
- The first dense retrieval model that can be prompted like an LMβ90Updated 8 months ago
- HuggingChat like UI in Gradioβ70Updated 2 years ago
- β129Updated last year
- β84Updated 2 years ago
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ72Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ61Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β75Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β51Updated last year
- β55Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"β120Updated 3 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)β92Updated last year
- β75Updated last year
- β61Updated last year
- β59Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with Lβ¦β45Updated 2 years ago
- Weekly visualization report of Open LLM model performance based on 4 metrics.β86Updated 2 years ago
- LLM reads a paper and produce a working prototypeβ60Updated 9 months ago
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMTβ27Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Modelsβ101Updated 2 years ago
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrievalβ38Updated 5 months ago
- Mixing Language Models with Self-Verification and Meta-Verificationβ112Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Searchβ102Updated last year