dsdanielpark / arxiv2textLinks
Converting PDF files to text, mainly with a focus on arXiv papers.
☆23Updated last year
Alternatives and similar repositories for arxiv2text
Users that are interested in arxiv2text are comparing it to the libraries listed below
Sorting:
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆34Updated 3 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆69Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆99Updated 2 years ago
- Open Implementations of LLM Analyses☆107Updated last year
- ☆51Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆49Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆117Updated last month
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆45Updated 2 years ago
- Small and Efficient Mathematical Reasoning LLMs☆72Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆27Updated 11 months ago
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023☆36Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆170Updated last year
- ☆80Updated 2 weeks ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated last year
- Weekly visualization report of Open LLM model performance based on 4 metrics.☆86Updated last year
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆151Updated last year
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆99Updated 2 years ago
- ☆120Updated last year
- The first dense retrieval model that can be prompted like an LM☆89Updated 6 months ago
- ☆78Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 10 months ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆69Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆110Updated 11 months ago
- Track the progress of LLM context utilisation☆55Updated 7 months ago
- ☆58Updated last year
- Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel☆24Updated 2 years ago
- ☆51Updated last year
- ☆55Updated last year