dsdanielpark/arxiv2text

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dsdanielpark/arxiv2text)

dsdanielpark / arxiv2text

Converting PDF files to text, mainly with a focus on arXiv papers.

☆25

Alternatives and similar repositories for arxiv2text

Users that are interested in arxiv2text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dsdanielpark / open-interview
View on GitHub
Open Interview automates technical Q&A generation from resumes, offers document and audio outputs, and customizable settings for efficien…
☆19May 8, 2024Updated 2 years ago
myeonghak / kobert-multi-label-VOC-classifier
View on GitHub
pretrained kobert를 사용한 multi-label VOC(Voice of Customers) 태그 분류 모델
☆15Apr 25, 2022Updated 4 years ago
dsdanielpark / hf-transllm
View on GitHub
LLMtranslator translates and generates text in multiple languages.
☆45May 10, 2024Updated 2 years ago
seanchatmangpt / rdddy
View on GitHub
Reactive DDD with DSPy
☆23Feb 24, 2024Updated 2 years ago
dsdanielpark / co-coder
View on GitHub
Co-Coder is a Python package that streamlines error debugging from Open AI chat GPT and Google Bard by providing hints, example code, and…
☆45May 22, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
dsdanielpark / gpt2-bert-medical-qa-chat
View on GitHub
Medical domain-focused GPT-2 fine-tuning, optimization, and lightweighting research repository (compared to GPT-4).
☆38Mar 13, 2024Updated 2 years ago
zwhe99 / LLM-MT-Eval
View on GitHub
{DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}
☆14Jun 18, 2023Updated 3 years ago
dsdanielpark / open-llm-datasets
View on GitHub
Repository for organizing datasets and papers used in Open LLM.
☆100Jul 6, 2023Updated 3 years ago
S1M0N38 / dspy-arxiv
View on GitHub
Explore the use of DSPy for extracting features from PDFs 🔎
☆54Mar 1, 2024Updated 2 years ago
Skytliang / SpyGame
View on GitHub
SpyGame: An interactive multi-agent framework to evaluate intelligence with large language models :D
☆15Nov 9, 2023Updated 2 years ago
krypticmouse / dspy-docs
View on GitHub
Official Documentation for DSPy Library
☆25Updated this week
wxjiao / Pre-CODE
View on GitHub
Implementation of our paper "Exploiting Unsupervised Data for Emotion Recognition in Conversations" in the Findings of EMNLP-2020.
☆13Nov 17, 2020Updated 5 years ago
Tonic-AI / EasyAGI
View on GitHub
🌟EasyAGI : A generalist agent that can go online and accomplish complex tasks.
☆30Dec 12, 2023Updated 2 years ago
kenchan0226 / FineGrainedFact
View on GitHub
Official implementation of the ACL Findings 2023 paper: Interpretable Automatic Fine-grained Inconsistency Detection in Text Summarizatio…
☆15Jan 25, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
TroyDoesAI / AI_Research
View on GitHub
My Gen AI research
☆11Jun 3, 2024Updated 2 years ago
Chen-Wang-CUHK / Training-Free-and-Ref-Free-Summ-Evaluation
View on GitHub
The source code of our ACL paper "A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance an…
☆14May 6, 2023Updated 3 years ago
hexuandeng / Mono4SiMT
View on GitHub
The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉
☆12Jul 19, 2023Updated 3 years ago
AIAnytime / Create-Vector-Store-from-Scratch
View on GitHub
Create Vector Store from Scratch in pure Python.
☆13Dec 15, 2023Updated 2 years ago
heiko-hotz / multimodal-live-api-web-console
View on GitHub
A react-based starter app for using the Multimodal Live API over websockets with Gemini
☆17Dec 22, 2024Updated last year
yongchanghao / multi-task-nat
View on GitHub
☆11Jul 17, 2021Updated 5 years ago
multimodal-art-projection / Megatron-LM-NEO
View on GitHub
☆40May 9, 2024Updated 2 years ago
siyan-sylvia-li / arxivParser
View on GitHub
☆18Sep 21, 2023Updated 2 years ago
davanstrien / huggingface-tldr
View on GitHub
Experimental tl;dr summaries for datasets on the Hugging Face Hub!
☆10Apr 4, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Antony90 / arxiv-discord
View on GitHub
arXiv-Chat: An AI research assistant and Discord bot
☆13Jul 16, 2023Updated 3 years ago
dsdanielpark / open-llm-leaderboard-report
View on GitHub
Weekly visualization report of Open LLM model performance based on 4 metrics.
☆86Dec 14, 2023Updated 2 years ago
Atrewin / SignXmDA
View on GitHub
This is the official code repository for the paper 'Cross-modality Data Augmentation for End-to-End Sign Language Translation'. Accepted…
☆16Oct 18, 2023Updated 2 years ago
jiah-li / magic
View on GitHub
The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.
☆15Dec 16, 2024Updated last year
weaviate-tutorials / Hurricane
View on GitHub
Writing Blog Posts with Generative Feedback Loops!
☆52Mar 19, 2024Updated 2 years ago
xingjian-zhang / massw
View on GitHub
MASSW is a comprehensive text dataset on Multi-Aspect Summarization of Scientific Workflows. MASSW includes more than 152,000 peer-review…
☆22May 16, 2025Updated last year
shuo-git / InfECE
View on GitHub
☆20Dec 31, 2020Updated 5 years ago
filipopo / undetected-chromedriver-lambda
View on GitHub
A minimal working example of using undetected-chromedriver on AWS Lambda with Selenium and Docker
☆18Aug 12, 2025Updated 11 months ago
myeonghak / Transformer-product-categorization
View on GitHub
트랜스포머 블록을 활용한 상품명 자연어처리 기반 카테고리 분류 모델
☆10Dec 5, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Coding-Crashkurse / Applied-Advanced-RAG
View on GitHub
☆24Jan 28, 2024Updated 2 years ago
florianjuengermann / query-god
View on GitHub
QueryGod lets you interact with any API or database using natural language. Writing simple prompts you can chain together the execution o…
☆15Dec 11, 2022Updated 3 years ago
adiekaye / very-simple-vector-database
View on GitHub
A Very Simple Vector Database
☆15May 1, 2023Updated 3 years ago
SunbowLiu / SurfaceFusion
View on GitHub
Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)
☆24Mar 18, 2021Updated 5 years ago
umrlastig / tracklib
View on GitHub
Tracklib library provide a variety of tools, operators and functions to manipulate GPS trajectories
☆18Jul 5, 2026Updated 3 weeks ago
acheong08 / gpt4
View on GitHub
The real GPT-4 with image access (You probably don't have access)
☆12Mar 17, 2023Updated 3 years ago
nuclia / nuclia-eval
View on GitHub
Library for evaluating RAG using Nuclia's models
☆18Jul 31, 2024Updated last year