smola / language-dataset
Dataset for programming language identification.
☆21Updated last year
Alternatives and similar repositories for language-dataset:
Users that are interested in language-dataset are comparing it to the libraries listed below
- ☆22Updated 5 years ago
- source{d} MLonCode foundation - core algorithms and models.☆14Updated 5 years ago
- Python library to share machine learning models easily and reliably.☆18Updated 5 years ago
- Advanced similarity and duplicate source code at scale.☆55Updated 5 years ago
- Vecino is a command line application to discover Git repositories which are similar to the one that the user provides.☆49Updated 5 years ago
- Advanced similarity and duplicate source code proof of concept for our research efforts.☆52Updated 2 years ago
- Database smell detector☆13Updated 7 years ago
- ☆12Updated 7 years ago
- Fixes Java syntax errors with LSTM neural networks! [proof-of-concept]☆18Updated 3 years ago
- Detect whether a social media comment is insulting or derogatory☆23Updated 2 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- Burglary prediction for mortals☆10Updated 8 months ago
- Open Access PDF harvester☆36Updated 9 months ago
- Elasticsearch like search engine supporting real time indexing and querying☆14Updated 7 years ago
- Summaries of academic papers☆20Updated 5 years ago
- A faithful (albeit optimized) port of Terence Parr List of Lists Visualization library, https://github.com/parrt/lolviz from Python to Ja…☆20Updated 4 years ago
- Example how to pre-process news articles with textbox and index on Elastic Search☆13Updated 7 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated 2 years ago
- 🐈 Code Annotation Tool☆28Updated 5 years ago
- My portfolio, for the lulz and interwebz. Not for 4/8 chan though.☆16Updated 3 weeks ago
- Convert git logs to JSON for easy analysis☆74Updated 2 years ago
- learning related projects☆17Updated 10 years ago
- Deep Semantic Code Search aims to explore a joint embedding space for code and description vectors and then use it for a code search appl…☆65Updated 6 months ago
- Python and pandas tools to perform various analyses on different types of word lists☆16Updated 10 years ago
- Generate a summarized description of a body of text☆27Updated last year
- Assessing Source Code Semantic Similarity with Unsupervised Learning☆41Updated 6 years ago
- Lightweight Natural Intelligence Framework☆20Updated 8 years ago
- Extract statistics from Wikipedia Dump files.☆26Updated 3 years ago
- code for Seattle Twitter-Dev Meetup, October 2016☆13Updated 8 years ago
- OSoMe API mashups☆11Updated 6 years ago