Converting PDF files to text, mainly with a focus on arXiv papers.
☆24Feb 19, 2024Updated 2 years ago
Alternatives and similar repositories for arxiv2text
Users that are interested in arxiv2text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pretrained kobert를 사용한 multi-label VOC(Voice of Customers) 태그 분류 모델☆15Apr 25, 2022Updated 4 years ago
- LLMtranslator translates and generates text in multiple languages.☆45May 10, 2024Updated 2 years ago
- Medical domain-focused GPT-2 fine-tuning, optimization, and lightweighting research repository (compared to GPT-4).☆38Mar 13, 2024Updated 2 years ago
- A bespoke time‑first language + toolchain for hybrid Neuromorphic - classical systems☆11Feb 4, 2026Updated 4 months ago
- Official Documentation for DSPy Library☆24Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 💵 Code for Less is More for Long Document Summary Evaluation by LLMs (Wu*, Iso* et al; EACL 2024)☆11Feb 22, 2024Updated 2 years ago
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}☆14Jun 18, 2023Updated 2 years ago
- Implementation of our paper "Scaling Back-Translation with Domain Text Generation for Sign Language Gloss Translation". Accepted in EACL …☆11May 22, 2023Updated 3 years ago
- SpyGame: An interactive multi-agent framework to evaluate intelligence with large language models :D☆15Nov 9, 2023Updated 2 years ago
- Implementation of our paper "Exploiting Unsupervised Data for Emotion Recognition in Conversations" in the Findings of EMNLP-2020.☆13Nov 17, 2020Updated 5 years ago
- brings autocomplete to Quill Placeholder module☆12Sep 28, 2018Updated 7 years ago
- Official implementation of the ACL Findings 2023 paper: Interpretable Automatic Fine-grained Inconsistency Detection in Text Summarizatio…☆14Jan 25, 2024Updated 2 years ago
- kNN-TL: k-Nearest-Neighbor Transfer Learning for Low-Resource Neural Machine Translation (ACL2023)☆11Jul 26, 2023Updated 2 years ago
- Source Code for "Adapters for Enhanced Modeling of Multilingual Knowledge and Text"☆12Oct 28, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Jul 19, 2023Updated 2 years ago
- My Gen AI research☆11Jun 3, 2024Updated 2 years ago
- 🌟EasyAGI : A generalist agent that can go online and accomplish complex tasks.☆30Dec 12, 2023Updated 2 years ago
- Create Vector Store from Scratch in pure Python.☆13Dec 15, 2023Updated 2 years ago
- Python API for Science Parse☆13Mar 27, 2021Updated 5 years ago
- A fully autonomous AI artist☆19Jun 19, 2023Updated 2 years ago
- ☆12Jan 25, 2025Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆74Nov 4, 2025Updated 7 months ago
- Helm Chart for Fastapi Deployment☆10May 18, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Reasoning-based Evaluation and Ranking of Translations.☆20Jun 2, 2026Updated last week
- Experimental tl;dr summaries for datasets on the Hugging Face Hub!☆10Apr 4, 2024Updated 2 years ago
- ☆18Sep 21, 2023Updated 2 years ago
- Crawls emails from web pages☆10Jun 16, 2024Updated last year
- Yahoo! Finance next gen python 3 / pandas market data downloader☆11Feb 22, 2026Updated 3 months ago
- Official PyTorch implementation of "Neural Relation Graph: A Unified Framework for Identifying Label Noise and Outlier Data" (NeurIPS'23)☆15Dec 4, 2023Updated 2 years ago
- Do Multilingual Language Models Think Better in English?☆42Aug 3, 2023Updated 2 years ago
- arXiv-Chat: An AI research assistant and Discord bot☆13Jul 16, 2023Updated 2 years ago
- Sample project with a very simple API build with Django Rest Framework to illustrate the use of AWS Fargate and Aurora Serverless with po…☆13Apr 21, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Oct 31, 2025Updated 7 months ago
- A tutorial for building Web Services in Rust with Actix-Web, SQLx, and PostgreSQL☆13Mar 20, 2024Updated 2 years ago
- ☆13Nov 11, 2022Updated 3 years ago
- A scraper for bioRxiv☆12Apr 12, 2023Updated 3 years ago
- Writing Blog Posts with Generative Feedback Loops!☆51Mar 19, 2024Updated 2 years ago
- Detecting topic clusters in arXiv ML papers.☆14Oct 10, 2020Updated 5 years ago
- Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)☆24Mar 18, 2021Updated 5 years ago