valayDave / arxiv-minerLinks
arxiv_miner is a toolkit for mining research papers on CS ArXiv.
☆137Updated last year
Alternatives and similar repositories for arxiv-miner
Users that are interested in arxiv-miner are comparing it to the libraries listed below
Sorting:
- Functional deep learning☆108Updated 2 years ago
- A diff tool for language models☆43Updated last year
- HetSeq: Distributed GPU Training on Heterogeneous Infrastructure☆106Updated 2 years ago
- ☆103Updated 4 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Updated 3 years ago
- API Client for paperswithcode.com☆183Updated last year
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆66Updated 2 years ago
- Distributed skorch on Ray Train☆58Updated 2 years ago
- Generating Training Data Made Easy☆43Updated 5 years ago
- Python Research Framework☆106Updated 2 years ago
- Library that contains implementations of machine learning components in the hyperbolic space☆141Updated last year
- Intelligence Task Ontology (ITO)☆74Updated 2 years ago
- arXiv plain text extraction☆41Updated 2 years ago
- Vectorizers for a range of different data types☆102Updated 6 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year
- ☆156Updated last month
- Evaluation suite for large-scale language models.☆127Updated 4 years ago
- A deep learning library based on Pytorch focussed on low resource language research and robustness☆70Updated 3 years ago
- The Python library with command line tools to interact with Dynabench(https://dynabench.org/), such as uploading models.☆55Updated 3 years ago
- AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems☆47Updated 3 years ago
- The "tl;dr" on a few notable transformer papers (pre-2022).☆190Updated 2 years ago
- Model Validation Toolkit is a collection of tools to assist with validating machine learning models prior to deploying them to production…☆29Updated last year
- Introduction to Data-Centric AI, MIT IAP 2023 🤖☆103Updated 2 months ago
- Ludwig benchmark☆19Updated 3 years ago
- A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed- …☆67Updated 2 years ago
- AI Data Management & Evaluation Platform☆216Updated last year
- Large dataset storage format for Pytorch☆45Updated 4 years ago
- Annotate python source code☆69Updated 5 years ago
- SPEAR: Programmatically label and build training data quickly.☆108Updated last year
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆33Updated last year