arXiv / arxiv-search
arXiv Search UI & APIs
☆103Updated last month
Alternatives and similar repositories for arxiv-search:
Users that are interested in arxiv-search are comparing it to the libraries listed below
- Flask app for article abstract and listing pages☆125Updated this week
- Pilot project to render HTML5 from arXiv LaTeX sources☆112Updated 5 years ago
- Supporting libraries for templates and arXiv services☆34Updated this week
- Tools to construct and process webgraphs from Common Crawl data☆85Updated 2 weeks ago
- A robust web archive analytics toolkit☆98Updated 2 months ago
- A turnkey command for converting a LaTeX source to ar5iv-style HTML☆59Updated 11 months ago
- Atom and RSS feeds for arXiv articles☆13Updated this week
- arXiv plain text extraction☆41Updated 2 years ago
- A general purpose processing framework for corpora of scientific documents☆58Updated 9 months ago
- Various Jupyter notebooks about Common Crawl data☆50Updated 2 weeks ago
- Providing references and citations on abstract pages for the arXiv☆129Updated 2 years ago
- The AI Knowledge Editor☆183Updated 2 years ago
- Statistics of Common Crawl monthly archives mined from URL index files☆170Updated last week
- ☆89Updated 2 years ago
- Indri search implementation on top of Lucene search engine☆34Updated 11 months ago
- SLING - A natural language frame semantics parser☆163Updated this week
- search interface for scholarly works☆83Updated 6 months ago
- Safety Score for Pre-Trained Language Models☆94Updated last year
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆81Updated last year
- Two Automatic code completion IDE extensions for @JetBrains and @microsoft/vscode based on Transformer-based large language models for so…☆55Updated 10 months ago
- ☆36Updated last year
- Scientific articles using or citing Common Crawl data☆13Updated 2 weeks ago
- PostText is a QA system for querying your text data. When appropriate structured views are in place, PostText is good at answering querie…☆31Updated last year
- The official code repo for "Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations".☆79Updated last year
- SOftware Metadata Extraction Framework: A tool for automatically extracting relevant software information from readme files☆50Updated this week
- Python API for https://vespa.ai, the open big data serving engine☆113Updated this week
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆29Updated last year
- distill chatGPT coding ability into small model (1b)☆28Updated last year
- ☆15Updated 10 months ago
- Code for the curation of The Stack v2 and StarCoder2 training data☆95Updated 10 months ago