Dump the text of the Gigaword dataset into a single file, for use with language modeling (and other!) toolkits
☆23Sep 23, 2017Updated 8 years ago
Alternatives and similar repositories for flatten_gigaword
Users that are interested in flatten_gigaword are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo contains the code and results for reproducing the results in the paper: A SIMPLE BUT TOUGH-TO-BEAT BASELINE FOR SENTENCE EMBEDD…☆12Jul 13, 2018Updated 7 years ago
- jiant-dev☆29Dec 17, 2020Updated 5 years ago
- Tools for robustness evaluation in interpretability methods☆10Jun 25, 2021Updated 4 years ago
- Boilerplate Electron Application with Handlebars.js/Material Design CSS☆13Dec 11, 2015Updated 10 years ago
- Tinker is a parallel-by-default File/Directory Management System with additional interface to NLP and ML libraries☆10Jul 21, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 📄🕸️ Generalizing Cross-Document Event Coreference Resolution Across Multiple Corpora☆10May 25, 2022Updated 4 years ago
- Semantic Parser with Execution☆13Dec 8, 2017Updated 8 years ago
- Don't just regulate gradients like in Muon, regulate the weights too☆32Jul 30, 2025Updated 10 months ago
- Slides for my intro to deep reinforcement learning at Imperial College☆17Apr 8, 2018Updated 8 years ago
- Classification Benchmarks for Under-resourced Bengali Language based on Multichannel Convolutional-LSTM Network☆20Jul 26, 2021Updated 4 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- Complete set of English dialect transformation rules and evaluation code☆17Jun 7, 2024Updated 2 years ago
- Code and Data for the ACL21 paper "Modeling Bilingual Conversational Characteristics for Neural Chat Translation"☆12Dec 17, 2021Updated 4 years ago
- ☆20Nov 27, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ner using crf++☆10Mar 24, 2015Updated 11 years ago
- Python code and data for the post "Word Segmentation, or Makingsenseofthis"☆17Oct 24, 2022Updated 3 years ago
- Guide for the slp group on how to use the Grnet cluster☆11Apr 16, 2020Updated 6 years ago
- ☆13Jun 3, 2019Updated 7 years ago
- Local lightning-fast semantic code search built for agents☆42Mar 16, 2026Updated 2 months ago
- Heuristic Analysis for NLI Systems☆131Jan 27, 2021Updated 5 years ago
- Example agents I've built using the LiveKit Agents (https://github.com/livekit/agents) framework☆20May 11, 2024Updated 2 years ago
- 结合上下文和篇章特征的多标签情绪分类☆28Aug 19, 2016Updated 9 years ago
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆17Jun 14, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.☆10Aug 13, 2023Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆17Jun 1, 2026Updated last week
- Library for implementing RNNs with Theano☆11Mar 26, 2015Updated 11 years ago
- Implement Overcoming the Lack of Parallel Data in Sentence Compression Katja Filippova and Yasemin Altun Google☆14Nov 22, 2016Updated 9 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- Hierarchical Condition Category (HCC) Risk Models from the Centers for Medicare and Medicaid Services (CMS) and the Department of Health …☆17May 7, 2017Updated 9 years ago
- Fixes the rotation of the images based on EXIF data☆15Apr 6, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆20Jun 6, 2018Updated 8 years ago
- Yet another web-based presentation library☆17Jul 5, 2019Updated 6 years ago
- A source plugin for Gatsby to source Github data from its GraphQL API for static builds☆18Aug 17, 2021Updated 4 years ago
- Jupyter extension to visualize dependency structures☆28Apr 19, 2018Updated 8 years ago
- Code for ACL 2018 paper 'Think Visually: Question Answering through Virtual Imagery'☆13Mar 24, 2023Updated 3 years ago
- Implementation for Decision-focused Summarization (EMNLP2021)☆12Mar 14, 2022Updated 4 years ago
- A framework for Lexical Simplification.☆14Mar 27, 2018Updated 8 years ago