valayDave / arxiv-minerLinks
arxiv_miner is a toolkit for mining research papers on CS ArXiv.
☆137Updated last year
Alternatives and similar repositories for arxiv-miner
Users that are interested in arxiv-miner are comparing it to the libraries listed below
Sorting:
- A diff tool for language models☆44Updated last year
- A deep learning library based on Pytorch focussed on low resource language research and robustness☆70Updated 3 years ago
- Intelligence Task Ontology (ITO)☆74Updated 2 years ago
- Functional deep learning☆108Updated 2 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Updated 3 years ago
- HetSeq: Distributed GPU Training on Heterogeneous Infrastructure☆106Updated 2 years ago
- Annotate python source code☆69Updated 5 years ago
- API Client for paperswithcode.com☆184Updated last year
- ☆104Updated 4 years ago
- Ludwig benchmark☆19Updated 3 years ago
- The Python library with command line tools to interact with Dynabench(https://dynabench.org/), such as uploading models.☆55Updated 3 years ago
- Helper scripts and notes that were used while porting various nlp models☆48Updated 3 years ago
- AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems☆47Updated 3 years ago
- Evaluation suite for large-scale language models.☆128Updated 4 years ago
- Python Research Framework☆106Updated 2 years ago
- Automatically check mismatch between code and comments using AI and ML☆53Updated 4 years ago
- arXiv plain text extraction☆41Updated 2 years ago
- Open Source Annotation Tools for Computer Vision and NLP tasks☆53Updated 4 years ago
- ☆156Updated 3 weeks ago
- Reads arXiv papers using Text-to-Speech☆62Updated 2 years ago
- Distributed skorch on Ray Train☆58Updated 3 years ago
- The "tl;dr" on a few notable transformer papers (pre-2022).☆190Updated 2 years ago
- Yet another mini autodiff system for educational purposes☆30Updated 10 months ago
- Model Validation Toolkit is a collection of tools to assist with validating machine learning models prior to deploying them to production…☆29Updated last year
- Generating Training Data Made Easy☆43Updated 5 years ago
- An implementation of Additive Attention☆150Updated 3 years ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Updated 2 years ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆44Updated last year
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 4 years ago
- Large dataset storage format for Pytorch