dsdanielpark / arxiv2textLinks
Converting PDF files to text, mainly with a focus on arXiv papers.
β23Updated last year
Alternatives and similar repositories for arxiv2text
Users that are interested in arxiv2text are comparing it to the libraries listed below
Sorting:
- Codebase accompanying the Summary of a Haystack paper.β79Updated last year
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β78Updated last year
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [Fβ¦β69Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Modelsβ100Updated 2 years ago
- Large-language Model Evaluation framework with Elo Leaderboard and A-B testingβ52Updated last year
- Small and Efficient Mathematical Reasoning LLMsβ72Updated last year
- Explore the use of DSPy for extracting features from PDFs πβ49Updated last year
- Mixing Language Models with Self-Verification and Meta-Verificationβ110Updated last year
- β85Updated 2 years ago
- β75Updated last year
- LLM reads a paper and produce a working prototypeβ60Updated 8 months ago
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrievalβ37Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β51Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"β118Updated 2 months ago
- β58Updated last year
- β81Updated last month
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with Lβ¦β45Updated 2 years ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding forβ¦β27Updated last year
- π§ Compare how Agent systems perform on several benchmarks. ππβ102Updated 4 months ago
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023β36Updated 2 years ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"β56Updated last year
- β78Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β75Updated last year
- β129Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first appβ¦β170Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillationβ29Updated 10 months ago
- β20Updated 8 months ago
- Retrieval Augmented Generation Generalized Evaluation Datasetβ59Updated 5 months ago
- A set of utilities for running few-shot prompting experiments on large-language modelsβ126Updated 2 years ago
- β67Updated 8 months ago