noanabeshima / github-downloaderLinks
Script for downloading GitHub.
☆13Updated 5 years ago
Alternatives and similar repositories for github-downloader
Users that are interested in github-downloader are comparing it to the libraries listed below
Sorting:
- Hugging Face and Pyserini interoperability☆19Updated 2 years ago
- FactNews is the first dataset to predict sentence-level factuality of news reporting. Furthemore, we provide baseline results for sentenc…☆10Updated 6 months ago
- ☆26Updated last year
- Stuff related to scraping the Code Review StackExchange☆12Updated 2 years ago
- GenieNLP: A versatile codebase for any NLP task☆88Updated last year
- Everything for the Paper: 'Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing'☆19Updated 2 years ago
- examples and guides to using Nomic Atlas☆37Updated 8 months ago
- Code for "The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction an…☆11Updated last year
- Downloads 2020 English Wikipedia articles as plaintext☆25Updated 2 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated 2 years ago
- This repository contains all the code for collecting large scale amounts of code from GitHub.☆110Updated 2 years ago
- ☆14Updated 2 years ago
- Script for downloading GitHub.☆97Updated last year
- ☆23Updated 10 months ago
- The data and implementation for the experiments in the paper "Flows: Building Blocks of Reasoning and Collaborating AI".☆31Updated last year
- A forest of autonomous agents.☆19Updated 10 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆11Updated 2 years ago
- Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.☆15Updated 2 years ago
- YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training☆45Updated 5 years ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆17Updated 2 weeks ago
- Code for constructing TLDR corpus from Reddit dataset☆27Updated 4 years ago
- Intuitive graphical representation of source code☆13Updated 2 years ago
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Updated 2 years ago
- GPU Environment Management for Visual Studio Code☆39Updated 2 years ago
- ☆32Updated 2 years ago
- ☆27Updated 2 weeks ago
- ☆44Updated last year
- ☆92Updated 3 years ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆23Updated last year