Dump the text of the Gigaword dataset into a single file, for use with language modeling (and other!) toolkits
☆23Sep 23, 2017Updated 8 years ago
Alternatives and similar repositories for flatten_gigaword
Users that are interested in flatten_gigaword are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Language model evaluation for morality and causality☆20Nov 14, 2023Updated 2 years ago
- ☆13Jul 8, 2020Updated 5 years ago
- jiant-dev☆29Dec 17, 2020Updated 5 years ago
- Tools for robustness evaluation in interpretability methods☆10Jun 25, 2021Updated 5 years ago
- 📄🕸️ Generalizing Cross-Document Event Coreference Resolution Across Multiple Corpora☆10May 25, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Repository for AAAI 2018 paper "Using Syntax for Referring Expression Recognition"☆13Oct 7, 2020Updated 5 years ago
- A graphical editor for directed graphs used for Abstract Meaning Representation (AMR)☆13May 9, 2026Updated last month
- Semantic Parser with Execution☆13Dec 8, 2017Updated 8 years ago
- Don't just regulate gradients like in Muon, regulate the weights too☆32Jul 30, 2025Updated 10 months ago
- Slides for my intro to deep reinforcement learning at Imperial College☆17Apr 8, 2018Updated 8 years ago
- An unofficial re-implementation of Graph Structure of Neural Networks (Jiaxuan You · Kaiming He · Jure Leskovec · Saining Xie) ICML 2020☆10Jul 27, 2020Updated 5 years ago
- A web app for sharing, editing, and commenting on kifus (game records for the board game Go)☆10Jan 22, 2019Updated 7 years ago
- Echo Noise Channel for Exact Mutual Information Calculation☆17Jul 17, 2020Updated 5 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆116May 20, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- Complete set of English dialect transformation rules and evaluation code☆17Jun 7, 2024Updated 2 years ago
- Code and Data for the ACL21 paper "Modeling Bilingual Conversational Characteristics for Neural Chat Translation"☆12Dec 17, 2021Updated 4 years ago
- ☆20Nov 27, 2025Updated 7 months ago
- ☆14Feb 3, 2021Updated 5 years ago
- ner using crf++☆10Mar 24, 2015Updated 11 years ago
- Python code and data for the post "Word Segmentation, or Makingsenseofthis"☆16Oct 24, 2022Updated 3 years ago
- Guide for the slp group on how to use the Grnet cluster☆11Apr 16, 2020Updated 6 years ago
- ☆13Jun 3, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Heuristic Analysis for NLI Systems☆132Jan 27, 2021Updated 5 years ago
- Example agents I've built using the LiveKit Agents (https://github.com/livekit/agents) framework☆20May 11, 2024Updated 2 years ago
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.☆10Aug 13, 2023Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Library for implementing RNNs with Theano☆11Mar 26, 2015Updated 11 years ago
- DeNSe parser in Dependency Parsing as Head Selection (EACL 2017) https://arxiv.org/abs/1606.01280☆25Apr 27, 2017Updated 9 years ago
- Recurrent versus Recursive Approaches Towards Compositionality in Semantic Vector Spaces.☆13Sep 22, 2021Updated 4 years ago
- LockManager with deadlock detection for implementing 2PL☆13Mar 13, 2019Updated 7 years ago
- Implement Overcoming the Lack of Parallel Data in Sentence Compression Katja Filippova and Yasemin Altun Google☆14Nov 22, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- Hierarchical Condition Category (HCC) Risk Models from the Centers for Medicare and Medicaid Services (CMS) and the Department of Health …☆17May 7, 2017Updated 9 years ago
- Fixes the rotation of the images based on EXIF data☆15Apr 6, 2026Updated 2 months ago
- ☆20Jun 6, 2018Updated 8 years ago
- Yet another web-based presentation library☆17Jul 5, 2019Updated 6 years ago
- RDrop 的 torch版☆16Jul 15, 2021Updated 4 years ago
- A source plugin for Gatsby to source Github data from its GraphQL API for static builds☆18Aug 17, 2021Updated 4 years ago