Code for constructing TLDR corpus from Reddit dataset
☆27Nov 23, 2021Updated 4 years ago
Alternatives and similar repositories for webis-tldr-17-corpus
Users that are interested in webis-tldr-17-corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Nov 20, 2025Updated 7 months ago
- 文法誤り訂正に関する日本語文献を収集・分類するためのリポジトリ☆13Apr 17, 2025Updated last year
- 青空文庫及びサピエの点字データから作成した振り仮名コーパスのデータセット☆22Jan 17, 2024Updated 2 years ago
- ☆18Jun 21, 2024Updated 2 years ago
- A Symfony 4 & 5 bundle that provides some common parts of web-based tools running on Wikimedia's Toolforge. Maintained by the Wikimedia F…☆15Jan 18, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A simple open source web indexer + search engine☆18Aug 25, 2018Updated 7 years ago
- A tool for extracting plain text from Wikipedia dumps☆15Oct 3, 2019Updated 6 years ago
- The Python 3 library for Multi-Criteria Decision Analysis.☆12Jun 22, 2026Updated last week
- Easy & Pretrained SOTA Deep Learning for RNA strings☆12Apr 15, 2022Updated 4 years ago
- Easier error handling for Golang☆10Aug 17, 2022Updated 3 years ago
- Targetted language identifier, based on FastText and Hunspell.☆38Sep 4, 2025Updated 9 months ago
- Evaluating majors LLMs on the Abstraction and Reasoning Corpus☆17Nov 9, 2023Updated 2 years ago
- EMNLP BlackBox NLP 2020: Searching for a Search Method: Benchmarking Search Algorithms for Generating NLP Adversarial Examples☆26Oct 11, 2020Updated 5 years ago
- The Concept Bottleneck Shift Detection (CBSD) methods for explaining and detecting various dataset shifts.☆14Jun 22, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Visual Hash for matching copies of visually similar images.☆16Mar 17, 2025Updated last year
- Companies' House API☆11Oct 22, 2021Updated 4 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- Scrape financial terms from Investopedia☆12Sep 7, 2018Updated 7 years ago
- ☆13Sep 8, 2021Updated 4 years ago
- Bruno is a speech-to-text-to-AI tool designed to facilitate collaborative learning in groupwork. The prototype application listens to hum…☆20Jun 10, 2024Updated 2 years ago
- simple kv store for streams☆36Mar 14, 2013Updated 13 years ago
- Indexing project where we index a portion of the web using spark, hadoop and cassandra.☆22Oct 30, 2019Updated 6 years ago
- A variational auto-encoder (VAE) framework with a new type of prior "Variational Mixture of Posteriors" prior, or VampPrior for short.☆10Apr 7, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Resources for Tutorial on Neuro-Symbolic Representations for IR☆15Jul 23, 2023Updated 2 years ago
- An implementation of the Equivariant Graph Neural Network (EGNN) layer type for DGL-PyTorch.☆15Dec 27, 2022Updated 3 years ago
- statically generated weekly digest of articles read in Pocket☆10May 14, 2019Updated 7 years ago
- A nuxt module to expose Vuex state in the browser URL for easy sharing☆12Aug 28, 2017Updated 8 years ago
- An AI assistant for open-source communities☆19Jul 17, 2023Updated 2 years ago
- A stylesheet based on Richard Rutter's book Web Typography.☆10Dec 6, 2018Updated 7 years ago
- Results for intent classification benchmark (Botfuel, DialogFlow, Luis, Watson, RASA, Recast, Snips)☆11Jun 1, 2018Updated 8 years ago
- "Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation☆30Feb 4, 2025Updated last year
- ☆15Oct 4, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆16Nov 25, 2024Updated last year
- ☆15Sep 30, 2022Updated 3 years ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆16Jun 27, 2023Updated 3 years ago
- Library for fast text representation and classification.☆10Apr 17, 2022Updated 4 years ago
- ☆12Apr 29, 2022Updated 4 years ago
- [WWW 2026] 🕸 GlotWeb: Web Indexing for Minority Languages☆17Apr 14, 2026Updated 2 months ago
- A menu and CLI based console program to play and write songs for the PC Speaker☆15Aug 1, 2019Updated 6 years ago