dsdanielpark / arxiv2textLinks
Converting PDF files to text, mainly with a focus on arXiv papers.
☆24Updated last year
Alternatives and similar repositories for arxiv2text
Users that are interested in arxiv2text are comparing it to the libraries listed below
Sorting:
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆101Updated 2 years ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆67Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆73Updated 2 years ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆45Updated 2 years ago
- LLM reads a paper and produce a working prototype☆60Updated 10 months ago
- Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel☆24Updated 2 years ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆38Updated 6 months ago
- ☆84Updated 2 years ago
- Codebase accompanying the Summary of a Haystack paper.☆80Updated last year
- ☆54Updated 3 weeks ago
- ☆78Updated 2 years ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆52Updated last year
- Benchmark baseline for retrieval qa applications☆119Updated last year
- ☆82Updated 3 months ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆103Updated 6 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆100Updated 2 years ago
- HuggingChat like UI in Gradio☆70Updated 2 years ago
- Weekly visualization report of Open LLM model performance based on 4 metrics.☆86Updated 2 years ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated last year
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- ☆59Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆28Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆126Updated 2 years ago
- ☆63Updated last year
- Verifiers for LLM Reinforcement Learning☆80Updated 9 months ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆104Updated 8 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆112Updated last year