noanabeshima / github-downloaderLinks
Script for downloading GitHub.
☆13Updated 5 years ago
Alternatives and similar repositories for github-downloader
Users that are interested in github-downloader are comparing it to the libraries listed below
Sorting:
- Hugging Face and Pyserini interoperability☆19Updated 2 years ago
- ☆26Updated last year
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Updated 2 years ago
- Stuff related to scraping the Code Review StackExchange☆12Updated 3 years ago
- This repository contains all the code for collecting large scale amounts of code from GitHub.☆110Updated 2 years ago
- Script for downloading GitHub.☆98Updated last year
- ☆23Updated last year
- The data and implementation for the experiments in the paper "Flows: Building Blocks of Reasoning and Collaborating AI".☆31Updated last year
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated 2 years ago
- Code for "The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction an…☆11Updated last year
- Extracts iframes or keyframes from a video file, through the command line or from inside python.☆18Updated 3 years ago
- examples and guides to using Nomic Atlas☆37Updated 9 months ago
- ☆44Updated 3 years ago
- Downloads 2020 English Wikipedia articles as plaintext☆26Updated 2 years ago
- ☆44Updated last year
- Everything for the Paper: 'Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing'☆19Updated 2 years ago
- One stop shop for all things carp☆59Updated 3 years ago
- Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instance☆28Updated 2 years ago
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆86Updated 2 years ago
- Minetest is an open source voxel game engine with easy modding and game creation☆69Updated last year
- Intuitive graphical representation of source code☆14Updated 2 years ago
- Transformer-based approaches for an efficient docstrings generation on a piece of Python's code.☆17Updated this week
- Repository for opt-out requests.☆10Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Updated 2 years ago
- Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.☆15Updated 2 years ago
- GenieNLP: A versatile codebase for any NLP task☆89Updated last year
- Neural search engine for discovering semantically similar Python repositories on GitHub☆29Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated 2 years ago
- FactNews is the first dataset to predict sentence-level factuality of news reporting. Furthemore, we provide baseline results for sentenc…☆11Updated 7 months ago
- YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training☆45Updated 5 years ago