This application shuffles the input file lines skipping (optionaly) the header. It's optimized for files bigger than available RAM.
☆25Jan 9, 2017Updated 9 years ago
Alternatives and similar repositories for shuf-t
Users that are interested in shuf-t are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Line shuffler for huge text file which does not fit in memory☆13Dec 1, 2022Updated 3 years ago
- Experimental collections library☆14Mar 27, 2019Updated 7 years ago
- a pythonic dplyr clone☆17Jul 15, 2025Updated 9 months ago
- Jupyter wire protocol implementation enabling D plugins to be jupyter kernels☆13Apr 9, 2021Updated 5 years ago
- A comparison of Nim's performance against the "Faster Command Line Tools in D" blog post found here: http://dlang.org/blog/2017/05/24/fas…☆14Mar 31, 2018Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Terminal tool that converts files encoding to UTF-8☆10Oct 5, 2019Updated 6 years ago
- Feature Decay Algorithms☆11Mar 5, 2014Updated 12 years ago
- Data Analytics Library for Python☆17Mar 24, 2026Updated last month
- [WWW 2026] 🕸 GlotWeb: Web Indexing for Minority Languages☆17Apr 14, 2026Updated 2 weeks ago
- Helsinki Neural Machine Translation system☆28Nov 8, 2020Updated 5 years ago
- Randomly sample lines from massive text files efficiently☆17Apr 1, 2015Updated 11 years ago
- ☆15Oct 4, 2024Updated last year
- Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translation☆15Dec 2, 2016Updated 9 years ago
- Automagically ignore all notifications related to work when you are on vacations☆21Aug 21, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Tutorial on running keras model in C++ and python tensorflow☆11Oct 30, 2018Updated 7 years ago
- record power consumption on thinkpads and create a gnuplot graph☆10May 8, 2019Updated 6 years ago
- http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en//pubs/archive/36266.pdf☆14Apr 25, 2012Updated 14 years ago
- Poor man's simple harvester for arXiv resources☆14Jul 14, 2023Updated 2 years ago
- Code for "On Long-Tailed Phenomena in NMT".☆10Jan 10, 2021Updated 5 years ago
- GC4LM: A Colossal (Biased) language model for German☆13May 2, 2021Updated 4 years ago
- A script for rapidly sampling a proportion of lines from a file☆19Feb 5, 2026Updated 2 months ago
- ACL style for Typst☆21Jan 27, 2026Updated 3 months ago
- Produce a sample of lines from files.☆19Jul 2, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Gale&Church (1993) sentence alignment☆16May 9, 2020Updated 5 years ago
- An utility to randomize and split really huge (100+ GB) text files☆21Dec 18, 2016Updated 9 years ago
- Extension for pie to include taggers with their models and pre/postprocessors☆11May 30, 2024Updated last year
- D port of meta tic-tac-toe game written for the GNU assembler☆24Nov 22, 2018Updated 7 years ago
- Deep learning model of machine translation using attentional and structural biases☆13Jul 21, 2017Updated 8 years ago
- Resources for our IJCAI 2020 paper, TopicKA: Generating Commonsense Knowledge-Aware Dialogue Responses Towards the Recommended Topic Fact☆12Nov 30, 2020Updated 5 years ago
- Fast binary matrix product on CPU☆10Feb 11, 2016Updated 10 years ago
- Online material and code base for the article Coordinates and Intervals in Graph Based Reference Genomes☆11May 2, 2017Updated 8 years ago
- dlang pretty printers for GDB & LLDB for various standard types☆22Dec 24, 2025Updated 4 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆11Dec 14, 2016Updated 9 years ago
- UltraFast GPU Grammar eXtractor for Machine Translation (He et al., TACL 2015 & NAACL 2013)☆12Jun 19, 2015Updated 10 years ago
- Resources for the Semeval 2016 Task 3 Community Question Answering. Contains word embeddings and system description results☆10Jan 13, 2017Updated 9 years ago
- Torch implementation of the Collobert's SENNA system for NER.☆13Jun 27, 2016Updated 9 years ago
- Code for the paper Faster Phrase-Based Decoding by Refining Feature State☆14Jan 9, 2023Updated 3 years ago
- Bilingual sentence aligner (Gale & Church, 1993)☆14Jan 8, 2026Updated 3 months ago
- Robust Principal Component Analysis☆10Apr 1, 2014Updated 12 years ago