JaderDias / download-from-common-crawlView external linksLinks
☆24Mar 20, 2024Updated last year
Alternatives and similar repositories for download-from-common-crawl
Users that are interested in download-from-common-crawl are comparing it to the libraries listed below
Sorting:
- Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.☆14May 5, 2022Updated 3 years ago
- A News Article Collection Library☆22Mar 31, 2023Updated 2 years ago
- terminally online☆37Updated this week
- AIS is an evaluation framework for assessing whether the output of natural language models only contains information about the external w…☆31Jan 14, 2023Updated 3 years ago
- mReasoner is a unified computational implementation of the model theory of thinking and reasoning☆13Aug 17, 2023Updated 2 years ago
- ☆16Updated this week
- Containerfile for the Vanilla OS Desktop+Nvidia image.☆16Feb 5, 2026Updated last week
- AI Liquidity Management Agent☆13Jan 19, 2026Updated 3 weeks ago
- Security research organization dedicated to finding low hanging, critical, vulnerabilities.☆15May 12, 2022Updated 3 years ago
- ☆10Jul 6, 2023Updated 2 years ago
- ☆12Sep 27, 2024Updated last year
- private repository checkout action via github apps☆11Dec 28, 2022Updated 3 years ago
- Fake NEWS detector using LIAR dataset.☆11Aug 19, 2019Updated 6 years ago
- Discord Docsbot, Built on bgent☆11Jun 17, 2024Updated last year
- Code that drives the public web-based tools for the Media Cloud Online News Archive and Directory.☆11Updated this week
- Wikimedia Enterprise - client SDK in Python☆20Nov 11, 2025Updated 3 months ago
- Code and data for the Walert large language model-based chatbot☆12Aug 14, 2025Updated 6 months ago
- scrape web content into readable markdown for llms and human readers☆10Feb 19, 2024Updated last year
- C4RepSet: Representative Subset from C4 data for Training Pre-trained LMs☆11Jan 13, 2023Updated 3 years ago
- Simulated user for TREC 2016-2017 Dynamic Domain track☆10Dec 27, 2017Updated 8 years ago
- Python wrapper around Yossi Rubner's Earth Mover's Distance implementation (http://ai.stanford.edu/~rubner/emd/default.htm)☆22Jul 9, 2015Updated 10 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- An Easy Annotation Tool for Natural Language Processing☆11May 17, 2024Updated last year
- ☆14May 6, 2018Updated 7 years ago
- parse_mediawiki_dump clone☆11Mar 22, 2025Updated 10 months ago
- Headless agent for test driven relevancy with Quepid.com☆11Mar 6, 2024Updated last year
- Blazing fast signature detection☆11Sep 5, 2022Updated 3 years ago
- ☆11May 6, 2025Updated 9 months ago
- ☆10Apr 12, 2024Updated last year
- Poetry Corpora Annotated on Aesthetic Emotions☆12Aug 2, 2022Updated 3 years ago
- Feature Selection using Simulated Annealing☆11Aug 10, 2022Updated 3 years ago
- Via Text Density Simple Web Crawler With Go☆13Mar 19, 2023Updated 2 years ago
- R library for common information retrieval metrics☆14Jun 5, 2023Updated 2 years ago
- A python implementation of discrete optimal transport with a Tsallis entropy regularization.☆14Oct 23, 2023Updated 2 years ago
- GitHub Action to approve pull requests securely☆12Updated this week
- an experimental implementation of Burrow's delta in Python 3☆12Jun 6, 2017Updated 8 years ago
- Associated blog post - https://tristanrhodes.com/blog/Adventures-in-Algorithmic-Trading-on-the-Runescape-Grand-Exchange☆10Oct 14, 2024Updated last year
- Generate Software Bill of Materials for R Things☆19Feb 9, 2024Updated 2 years ago
- Run greatexpectations.io on ANY SQL Engine using REST API. Supported by FastAPI, Pydantic and SQLAlchemy as best data quality tool☆14Dec 12, 2025Updated 2 months ago