dsdanielpark / arxiv2text
Converting PDF files to text, mainly with a focus on arXiv papers.
β14Updated 10 months ago
Alternatives and similar repositories for arxiv2text:
Users that are interested in arxiv2text are comparing it to the libraries listed below
- Explore the use of DSPy for extracting features from PDFs πβ37Updated 10 months ago
- β37Updated last year
- π A deep-dive into HyDE for Advanced LLM RAG + π‘ Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, coveraβ¦β30Updated 9 months ago
- Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallelβ24Updated last year
- β43Updated 3 months ago
- Writing Blog Posts with Generative Feedback Loops!β46Updated 9 months ago
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welcβ¦β20Updated 10 months ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agentsβ23Updated 2 years ago
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMTβ26Updated 11 months ago
- β12Updated last year
- HuggingChat like UI in Gradioβ69Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPOβ116Updated last year
- Miscellaneous codes and writings for MLOpsβ11Updated this week
- Medical Mixture of Experts LLM using Mergekit.β20Updated 10 months ago
- β74Updated last year
- Evaluate your LLM apps, RAG pipeline, any generated text, and more!β0Updated 8 months ago
- Simple Model Similarities Analysisβ21Updated 11 months ago
- Use OpenAI with HuggingChat by emulating the text_generation_inference_serverβ45Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β31Updated 11 months ago
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"β23Updated 3 months ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with Lβ¦β42Updated last year
- CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environmentsβ40Updated this week
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ49Updated 10 months ago
- Awesome series for LLMOpsβ39Updated 5 months ago
- manage histories of LLM applied applicationsβ88Updated last year
- Very minimal (and stateless) agent frameworkβ41Updated this week
- "Learning-based One-line intelligence Owner Network Connectivity Tool"β15Updated last year
- Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quantβ¦β30Updated last year
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context recβ¦β28Updated 5 months ago
- Experimental sampler to make LLMs more creativeβ30Updated last year