Dump the text of the Gigaword dataset into a single file, for use with language modeling (and other!) toolkits
☆23Sep 23, 2017Updated 8 years ago
Alternatives and similar repositories for flatten_gigaword
Users that are interested in flatten_gigaword are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Language model evaluation for morality and causality☆20Nov 14, 2023Updated 2 years ago
- ☆13Jul 8, 2020Updated 5 years ago
- jiant-dev☆28Dec 17, 2020Updated 5 years ago
- Fastened CROWN: Tightened Neural Network Robustness Certificates☆10Feb 10, 2020Updated 6 years ago
- Tinker is a parallel-by-default File/Directory Management System with additional interface to NLP and ML libraries☆10Jul 21, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 📄🕸️ Generalizing Cross-Document Event Coreference Resolution Across Multiple Corpora☆10May 25, 2022Updated 3 years ago
- Repository for AAAI 2018 paper "Using Syntax for Referring Expression Recognition"☆13Oct 7, 2020Updated 5 years ago
- Don't just regulate gradients like in Muon, regulate the weights too☆32Jul 30, 2025Updated 8 months ago
- Slides for my intro to deep reinforcement learning at Imperial College☆17Apr 8, 2018Updated 8 years ago
- A web app for sharing, editing, and commenting on kifus (game records for the board game Go)☆10Jan 22, 2019Updated 7 years ago
- Echo Noise Channel for Exact Mutual Information Calculation☆17Jul 17, 2020Updated 5 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆116May 20, 2019Updated 6 years ago
- Classification Benchmarks for Under-resourced Bengali Language based on Multichannel Convolutional-LSTM Network☆20Jul 26, 2021Updated 4 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Example external repository for interacting with armory.☆11May 2, 2022Updated 3 years ago
- The repository for the paper "Predicting in-hospital mortality by combining clinical notes with time-series data"☆12May 23, 2021Updated 4 years ago
- Zero-shot Transfer Learning from English to Arabic☆30Jun 22, 2022Updated 3 years ago
- ☆14Feb 3, 2021Updated 5 years ago
- ner using crf++☆10Mar 24, 2015Updated 11 years ago
- Python code and data for the post "Word Segmentation, or Makingsenseofthis"☆17Oct 24, 2022Updated 3 years ago
- IndoNLI☆19Dec 4, 2021Updated 4 years ago
- A remote Scala code evaluator☆14May 16, 2023Updated 2 years ago
- Guide for the slp group on how to use the Grnet cluster☆11Apr 16, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆13Jun 3, 2019Updated 6 years ago
- Heuristic Analysis for NLI Systems☆130Jan 27, 2021Updated 5 years ago
- Local lightning-fast semantic code search built for agents☆41Mar 16, 2026Updated 3 weeks ago
- Example agents I've built using the LiveKit Agents (https://github.com/livekit/agents) framework☆20May 11, 2024Updated last year
- ☆15Oct 4, 2024Updated last year
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆17Jun 14, 2017Updated 8 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- ☆17Updated this week
- Library for implementing RNNs with Theano☆11Mar 26, 2015Updated 11 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- DeNSe parser in Dependency Parsing as Head Selection (EACL 2017) https://arxiv.org/abs/1606.01280☆25Apr 27, 2017Updated 8 years ago
- Recurrent versus Recursive Approaches Towards Compositionality in Semantic Vector Spaces.☆13Sep 22, 2021Updated 4 years ago
- LockManager with deadlock detection for implementing 2PL☆13Mar 13, 2019Updated 7 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- ☆20Jun 6, 2018Updated 7 years ago
- Fixes the rotation of the images based on EXIF data☆15Updated this week