A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.
☆33Sep 4, 2018Updated 7 years ago
Alternatives and similar repositories for parSentExtract
Users that are interested in parSentExtract are comparing it to the libraries listed below
Sorting:
- Code for our paper in ACL 2017☆13Dec 14, 2017Updated 8 years ago
- Code from http://www.ark.cs.cmu.edu/mheilman/questions/☆12Apr 23, 2013Updated 12 years ago
- Recipes for training OpenNMT systems☆14Jul 26, 2017Updated 8 years ago
- Distributed infrastructure for Machine Translation web services (using Moses, Python, JSON-RPC/web interface)☆34Jan 26, 2022Updated 4 years ago
- ☆19Mar 10, 2025Updated 11 months ago
- Three Pass Regression Filter for R☆16Aug 25, 2015Updated 10 years ago
- Hwyluso cyfieithu peirianyddol MosesSMT i'r Gymraeg // Making MosesSMT machine translation easier for Welsh (and other languages)☆16Aug 25, 2021Updated 4 years ago
- ☆21Dec 9, 2016Updated 9 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆315Mar 18, 2021Updated 4 years ago
- A dataset of sentences with ordinal labels for grammaticality☆29Jun 9, 2014Updated 11 years ago
- This is the text partitioner project for Python.☆21Dec 11, 2018Updated 7 years ago
- 📄Neural Sentential Paraphrase Generation to Augment Chatbot Training Dataset☆21Dec 7, 2022Updated 3 years ago
- Text simplification using RNNs☆55Mar 31, 2016Updated 9 years ago
- JFLEG (JHU FLuency-Extended GUG) corpus for Grammatical Error Correction Evaluation☆114Jun 11, 2023Updated 2 years ago
- Financial Analysis and Algorithmic Trading Strategies in Python☆11Feb 16, 2023Updated 3 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆74Feb 25, 2015Updated 11 years ago
- End-to-end integration of HuggingFace's models for sequence labeling.☆11Oct 4, 2020Updated 5 years ago
- Experimenting with GANs in Tensorflow/Keras☆10Jan 13, 2022Updated 4 years ago
- ☆12Apr 26, 2020Updated 5 years ago
- Open-source implementation of the BilBOWA (Bilingual Bag-of-Words without Alignments) word embedding model.☆69Jul 28, 2021Updated 4 years ago
- Gale&Church (1993) sentence alignment☆16May 9, 2020Updated 5 years ago
- Syllabification and stress detection for Spanish☆12Oct 6, 2024Updated last year
- Emotion Recognition☆10Oct 22, 2017Updated 8 years ago
- A Python script to delete all comment and submission data from a given Reddit account.☆11Jan 5, 2021Updated 5 years ago
- Indonesian Treebank☆35Jul 5, 2022Updated 3 years ago
- A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.☆166May 12, 2021Updated 4 years ago
- Vi-like editing for Wolfram Notebooks☆11May 15, 2023Updated 2 years ago
- Image batch datasets for training and testing ag-net to recognize crop from Landsat imagery☆10Aug 29, 2019Updated 6 years ago
- Code for Harmonic Exponential Families on Manifolds☆10Jun 2, 2016Updated 9 years ago
- ☆21May 28, 2024Updated last year
- ☆11May 6, 2016Updated 9 years ago
- Lazy python recipes.☆10Apr 17, 2021Updated 4 years ago
- solutions to adventofcode.com 2019 in go, python, rust, ocaml☆11Sep 14, 2023Updated 2 years ago
- HOL Guidebook☆12Oct 11, 2024Updated last year
- A Simple Flask App to interact with your Machine Translation Model☆13Feb 26, 2020Updated 6 years ago
- A simple starting configuration for provisioning vagrant with ansible; uses postgres, rvm, ruby 1.9.2☆30May 16, 2013Updated 12 years ago
- Scala port of the word2vec toolkit.☆11Aug 15, 2016Updated 9 years ago
- Match tokenized words and phrases within the original, untokenized, often messy, text.☆19Apr 11, 2023Updated 2 years ago
- With one whole audio and corresponding text, the audio can be split line by line and saved with exact sentence using comparison with the …☆10Feb 28, 2019Updated 7 years ago