dsdanielpark / arxiv2text
Converting PDF files to text, mainly with a focus on arXiv papers.
β16Updated last year
Alternatives and similar repositories for arxiv2text:
Users that are interested in arxiv2text are comparing it to the libraries listed below
- Explore the use of DSPy for extracting features from PDFs πβ39Updated last year
- LLM reads a paper and produce a working prototypeβ51Updated 2 weeks ago
- Writing Blog Posts with Generative Feedback Loops!β47Updated last year
- β37Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with Lβ¦β43Updated last year
- A Flask extension to manage Langchain chat memory and document stores in Flaask apps.β70Updated last year
- β45Updated 6 months ago
- β20Updated 9 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorerβ41Updated last year
- ππ§ Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!β51Updated 2 weeks ago
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context recβ¦β29Updated 7 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platformβ84Updated 2 weeks ago
- β24Updated 6 months ago
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β73Updated 5 months ago
- Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.β18Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 8 months ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Reβ¦β20Updated 2 weeks ago
- β11Updated last year
- Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallelβ24Updated last year
- β74Updated last year
- β24Updated last year
- One Line To Build Zero-Data Classifiers in Minutesβ36Updated 6 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".β65Updated 9 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding forβ¦β25Updated 3 months ago
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMTβ27Updated last year
- Experimental sampler to make LLMs more creative