☆33May 23, 2023Updated 2 years ago
Alternatives and similar repositories for commoncrawl_downloader
Users that are interested in commoncrawl_downloader are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Downloads 2020 English Wikipedia articles as plaintext☆27Mar 25, 2023Updated 3 years ago
- ☆16Mar 25, 2022Updated 4 years ago
- ☆95Jul 16, 2022Updated 3 years ago
- Source code to "SliTraNet: Automatic Detection of Slide Transitions in Lecture Videos using Convolutional Neural Networks"☆10Dec 17, 2023Updated 2 years ago
- ☆25Aug 18, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Script for downloading GitHub.☆13Sep 24, 2020Updated 5 years ago
- Python Research Framework☆107Nov 3, 2022Updated 3 years ago
- ☆78Dec 7, 2023Updated 2 years ago
- StyleGAN2 - Official TensorFlow Implementation☆12Jul 15, 2020Updated 5 years ago
- The OpenLH is a Liquid handling system based on an available robotic arm platform (uARM swift Pro) which allows for creative exploration …☆23Jun 20, 2024Updated last year
- A simple, minimalist writing theme for Typora☆15Jan 20, 2026Updated 3 months ago
- [CIKM 2023 Oral] This is the code repo for our CIKM‘23 paper "Text Matching Improves Sequential Recommendation by Reducing Popularity Bia…☆40Mar 17, 2024Updated 2 years ago
- A GPT-powered AI auto scraper for websites. AI Web Scraping made easy.☆14Jun 26, 2023Updated 2 years ago
- ☆1,650Apr 27, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆15Jul 24, 2023Updated 2 years ago
- ☆28Nov 28, 2024Updated last year
- A PyTorch toolbox for domain adaptation, domain generalization, federated learning DA/DG, active learning DA/DG, ALDG and semi-supervised…☆11Jan 10, 2022Updated 4 years ago
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorch☆76Jan 14, 2021Updated 5 years ago
- ☆12May 17, 2022Updated 3 years ago
- ZYN: Zero-Shot Reward Models with Yes-No Questions☆35Aug 15, 2023Updated 2 years ago
- [EMNLP 2025 Findings] Familiarity-aware Evidence Compression for Retrieval Augmented Generation☆15Aug 20, 2025Updated 8 months ago
- Evaluation tools shared across anserini, pyserini, and pygaggle☆36Apr 25, 2026Updated 2 weeks ago
- ☆19Mar 23, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval☆26Aug 7, 2023Updated 2 years ago
- Script for downloading GitHub.☆99Jul 1, 2024Updated last year
- Web archiving utility library☆11Updated this week
- ## Step 1 - Scraping Complete your initial scraping using Jupyter Notebook, BeautifulSoup, Pandas, and Requests/Splinter. * Create a Ju…☆11Dec 22, 2021Updated 4 years ago
- Helper to use Plotly in SvelteKit☆18Jul 12, 2022Updated 3 years ago
- code for Preprint paper at Arxiv: MoT: Pre-thinking and Recalling Enable ChatGPT to Self-Improve with Memory-of-Thoughts☆24Nov 29, 2023Updated 2 years ago
- StyleGAN2 - Official TensorFlow Implementation☆25Sep 5, 2020Updated 5 years ago
- ::A tool to abbreviate scientific paper contents using ChatGPT::☆13Nov 20, 2023Updated 2 years ago
- Data and preprocessing scripts for SemEval 2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding☆15Feb 3, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Mar 28, 2024Updated 2 years ago
- This repository contains generic information about open-source ventilator applications.☆21Jun 11, 2020Updated 5 years ago
- ML-powered news classifier: categorise articles via REST API. Built with scikit-learn, model comparison with VotingClassifier, and a REST…☆14Updated this week
- Zeobuilder is an extensible GUI-toolkit for molecular model construction.☆13Feb 15, 2019Updated 7 years ago
- ☆11Jun 21, 2024Updated last year
- Resources to get started building Neo4j Desktop Graph Apps☆20Apr 1, 2023Updated 3 years ago
- A dataset of alignment research and code to reproduce it☆78Jun 22, 2023Updated 2 years ago