braun-steven / arxiv-downloaderLinks
A command line interface to download PDF files from https://arxiv.org.
☆57Updated 2 weeks ago
Alternatives and similar repositories for arxiv-downloader
Users that are interested in arxiv-downloader are comparing it to the libraries listed below
Sorting:
- Ensembling Hugging Face transformers made easy☆63Updated 2 years ago
- Safety Score for Pre-Trained Language Models☆96Updated last year
- An instruction-based benchmark for text improvements.☆141Updated 2 years ago
- ☆75Updated 4 years ago
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆78Updated 11 months ago
- ☆29Updated last year
- Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel☆24Updated 2 years ago
- Minimal code to train a Large Language Model (LLM).☆172Updated 3 years ago
- ☆138Updated 7 months ago
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆116Updated 2 months ago
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆55Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆94Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆105Updated 2 years ago
- SILO Language Models code repository☆81Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆92Updated 10 months ago
- Benchmarking library for RAG☆226Updated last month
- Pretraining Efficiently on S2ORC!☆167Updated 10 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆186Updated 2 months ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆87Updated last year
- Code accompanying the paper Pretraining Language Models with Human Preferences☆180Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Updated last year
- Helper scripts and notes that were used while porting various nlp models☆47Updated 3 years ago
- Pipeline for pulling and processing online language model pretraining data from the web