braun-steven / arxiv-downloaderLinks
A command line interface to download PDF files from https://arxiv.org.
☆62Updated 4 months ago
Alternatives and similar repositories for arxiv-downloader
Users that are interested in arxiv-downloader are comparing it to the libraries listed below
Sorting:
- ☆78Updated 3 months ago
- ☆145Updated 11 months ago
- Safety Score for Pre-Trained Language Models☆96Updated 2 years ago
- Ensembling Hugging Face transformers made easy☆61Updated 3 years ago
- Converting PDF files to text, mainly with a focus on arXiv papers.☆24Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆105Updated 2 years ago
- evolve llm training instruction, from english instruction to any language.☆119Updated 2 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆74Updated last year
- Benchmarking library for RAG☆252Updated 3 months ago
- An instruction-based benchmark for text improvements.☆143Updated 3 years ago
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆78Updated last year
- Repository for analysis and experiments in the BigCode project.☆128Updated last year
- ☆29Updated last year
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)☆117Updated 3 years ago
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"☆23Updated last year
- Retrieval Augmented Generation Generalized Evaluation Dataset☆59Updated 5 months ago
- Repository for the "Understanding and Mitigating Language Confusion in LLMs" paper☆31Updated last year
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆96Updated 2 years ago
- ☆102Updated 3 years ago
- The official code of TACL 2021, "Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies".☆82Updated 3 years ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆96Updated 2 years ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆137Updated last year
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆86Updated 2 years ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated 2 years ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆189Updated 6 months ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆96Updated 2 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Updated 2 years ago
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆183Updated 3 years ago
- Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel☆24Updated 2 years ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆71Updated last year