smola / language-dataset
Dataset for programming language identification.
☆21Updated last year
Related projects ⓘ
Alternatives and complementary repositories for language-dataset
- Extract statistics from Wikipedia Dump files.☆26Updated 3 years ago
- Launch NMT tasks on the cloud☆13Updated last year
- Vecino is a command line application to discover Git repositories which are similar to the one that the user provides.☆46Updated 5 years ago
- source{d} MLonCode foundation - core algorithms and models.☆14Updated 5 years ago
- My own playground for PLP (Programming Language Processing) using DeepLearning techniques☆19Updated last year
- Burglary prediction for mortals☆10Updated 6 months ago
- ☆31Updated last year
- Assessing Source Code Semantic Similarity with Unsupervised Learning☆41Updated 6 years ago
- Elasticsearch like search engine supporting real time indexing and querying☆14Updated 7 years ago
- Database smell detector☆13Updated 6 years ago
- ☆21Updated 5 years ago
- GitHub repositories and users recommendations by embeddings☆16Updated 2 years ago
- List of online / computer-based annotation tools☆18Updated 7 years ago
- ☆11Updated 5 years ago
- Tools for building a Lucene index for Semantic Vectors☆21Updated 9 years ago
- Deep Semantic Code Search aims to explore a joint embedding space for code and description vectors and then use it for a code search appl…☆65Updated 4 months ago
- Advanced similarity and duplicate source code proof of concept for our research efforts.☆52Updated 2 years ago
- This is an Object Oriented implementation of a Trie in python. The class contains setter and getter methods, and implements several usefu…☆14Updated 6 years ago
- Script for downloading GitHub.☆11Updated 4 years ago
- Code examples for Google Natural Language API.☆13Updated 5 years ago
- Notebooks and data associated to constructing and exploring a map of subreddits.☆55Updated 7 years ago
- ☆21Updated 7 years ago
- Dataset used to analyze user preferences of podcast summaries☆8Updated 2 years ago
- ☆9Updated 5 years ago
- My data is bigger than your data!☆39Updated 5 years ago
- Stuff related to scraping the Code Review StackExchange☆11Updated last year
- Scripts to take hand washing related text in (almost) any language and float it into a hand washing poster.☆9Updated 3 years ago
- Polyglot skipgram embeddings, and their many health benefits☆11Updated 4 years ago
- ☆12Updated 7 years ago
- Implementing the vision of an autonomous bot to eliminate code smells through automatic refactoring.☆61Updated last year