Dump the text of the Gigaword dataset into a single file, for use with language modeling (and other!) toolkits
☆23Sep 23, 2017Updated 8 years ago
Alternatives and similar repositories for flatten_gigaword
Users that are interested in flatten_gigaword are comparing it to the libraries listed below
Sorting:
- Language model evaluation for morality and causality☆19Nov 14, 2023Updated 2 years ago
- This repo contains the code and results for reproducing the results in the paper: A SIMPLE BUT TOUGH-TO-BEAT BASELINE FOR SENTENCE EMBEDD…☆12Jul 13, 2018Updated 7 years ago
- 通过阅读论文Attention is all you need来复现Transformer模型☆12Aug 5, 2019Updated 6 years ago
- ☆13Jul 8, 2020Updated 5 years ago
- jiant-dev☆28Dec 17, 2020Updated 5 years ago
- Boilerplate Electron Application with Handlebars.js/Material Design CSS☆13Dec 11, 2015Updated 10 years ago
- Tinker is a parallel-by-default File/Directory Management System with additional interface to NLP and ML libraries☆10Jul 21, 2017Updated 8 years ago
- 📄🕸️ Generalizing Cross-Document Event Coreference Resolution Across Multiple Corpora☆10May 25, 2022Updated 3 years ago
- Repository for AAAI 2018 paper "Using Syntax for Referring Expression Recognition"☆13Oct 7, 2020Updated 5 years ago
- A graphical editor for directed graphs used for Abstract Meaning Representation (AMR)☆13Feb 25, 2026Updated 3 weeks ago
- Semantic Parser with Execution☆13Dec 8, 2017Updated 8 years ago
- Don't just regulate gradients like in Muon, regulate the weights too☆31Jul 30, 2025Updated 7 months ago
- An unofficial re-implementation of Graph Structure of Neural Networks (Jiaxuan You · Kaiming He · Jure Leskovec · Saining Xie) ICML 2020☆10Jul 27, 2020Updated 5 years ago
- A web app for sharing, editing, and commenting on kifus (game records for the board game Go)☆10Jan 22, 2019Updated 7 years ago
- Solving CartPole-v1 environment in Keras with Actor Critic algorithm an Deep Reinforcement Learning algorithm☆12May 19, 2020Updated 5 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆116May 20, 2019Updated 6 years ago
- Classification Benchmarks for Under-resourced Bengali Language based on Multichannel Convolutional-LSTM Network☆20Jul 26, 2021Updated 4 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- Complete set of English dialect transformation rules and evaluation code☆16Jun 7, 2024Updated last year
- ☆12Feb 22, 2021Updated 5 years ago
- Code and Data for the ACL21 paper "Modeling Bilingual Conversational Characteristics for Neural Chat Translation"☆12Dec 17, 2021Updated 4 years ago
- Zero-shot Transfer Learning from English to Arabic☆30Jun 22, 2022Updated 3 years ago
- ☆14Feb 3, 2021Updated 5 years ago
- Python code and data for the post "Word Segmentation, or Makingsenseofthis"☆17Oct 24, 2022Updated 3 years ago
- ner using crf++☆10Mar 24, 2015Updated 10 years ago
- A remote Scala code evaluator☆14May 16, 2023Updated 2 years ago
- Guide for the slp group on how to use the Grnet cluster☆11Apr 16, 2020Updated 5 years ago
- Heuristic Analysis for NLI Systems☆130Jan 27, 2021Updated 5 years ago
- Example agents I've built using the LiveKit Agents (https://github.com/livekit/agents) framework☆20May 11, 2024Updated last year
- Local lightning-fast semantic code search built for agents☆39Updated this week
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆17Jun 14, 2017Updated 8 years ago
- ☆15Oct 4, 2024Updated last year
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- This project collects methods that enhance the comparison between AMR graphs.☆18Jun 15, 2023Updated 2 years ago
- Library for implementing RNNs with Theano☆11Mar 26, 2015Updated 10 years ago
- Recurrent versus Recursive Approaches Towards Compositionality in Semantic Vector Spaces.☆13Sep 22, 2021Updated 4 years ago
- LockManager with deadlock detection for implementing 2PL☆13Mar 13, 2019Updated 7 years ago
- Implement Overcoming the Lack of Parallel Data in Sentence Compression Katja Filippova and Yasemin Altun Google☆14Nov 22, 2016Updated 9 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago