luca-ant / WhatsSeeLinks
A simple and humble image captioning application, based on a neural network built with Keras
☆10Updated 3 years ago
Alternatives and similar repositories for WhatsSee
Users that are interested in WhatsSee are comparing it to the libraries listed below
Sorting:
- Dialect identification using Siamese network☆15Updated 8 years ago
- Language Model Fine-tuning for Moby Dick☆42Updated 6 years ago
- A Cross-Domain Transferable Neural Coherence Model https://arxiv.org/abs/1905.11912☆24Updated 5 years ago
- Natural Language Generation by Hierarchical Decoding with Linguistic Patterns (NAACL-HLT 2018), Investigating Linguistic Pattern Ordering…☆32Updated 7 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 4 years ago
- Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions. Code…☆13Updated 6 months ago
- Pun-GAN: Generative Adversarial Network for Pun Generation (EMNLP 2019)☆42Updated 6 years ago
- explores Chinese language models with sub-character level visual information☆16Updated 7 years ago
- Code for NAACL 2018 paper "Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information"☆12Updated 8 years ago
- A Tensorflow LSTM spam detector utilizing GloVe word embeddings.☆12Updated 6 years ago
- A Neural Attention Model for Abstractive Sentence Summarization in DyNet☆19Updated 7 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Updated 6 years ago
- Anonymous ICLR Submission☆14Updated 6 years ago
- ASR transcription and SLU annotation web interface for call logs collected at UFAL-DSG.☆11Updated 11 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Updated 5 years ago
- ☆51Updated 3 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 5 years ago
- Two-stage GANs that generate fingerstyle guitarist images from audio.☆59Updated 7 years ago
- COMIC: This is the code repo of our TMM2019 work titled "COMIC: Towards a Compact Image Captioning Model with Attention".☆15Updated 4 years ago
- Using embedding-based loss functions for phonetics/speech recognition.☆17Updated 11 years ago
- Semi-supervised emotion lexicon expansion with label propagation and specialized word embeddings☆21Updated 8 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- Portal Tutorial☆11Updated 7 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Updated 5 years ago
- This repo is containing notes and implementations for cherry-picked publications of my particular interest☆12Updated 5 years ago
- A Structured Self-attentive Sentence Embedding Zhouhan Lin, Minwei Feng, Cicero Nogueira dos Santos, Mo Yu, Bing Xiang, Bowen Zhou, Yoshu…☆11Updated 8 years ago
- Survey on machine learning.☆14Updated 5 years ago
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"☆17Updated 2 years ago
- Universal segmenter based on the Universal Dependency framework, written by Y. Shao, Uppsala University☆34Updated 6 years ago
- Zero-shot Learning for Audio-based Music Classification and Tagging (ISMIR 2019)☆43Updated 6 years ago