smola / language-dataset
Dataset for programming language identification.
☆21Updated last year
Related projects ⓘ
Alternatives and complementary repositories for language-dataset
- Python library to share machine learning models easily and reliably.☆18Updated 5 years ago
- Online service for analyzing research profiles of scientists and conferences☆12Updated 2 years ago
- Fixes Java syntax errors with LSTM neural networks! [proof-of-concept]☆18Updated 3 years ago
- Tools for exploring the contents of web archive files.☆39Updated 4 years ago
- source{d} extension to match Git signatures to real people.☆17Updated 5 years ago
- Advanced similarity and duplicate source code proof of concept for our research efforts.☆52Updated 2 years ago
- Dockerfile-s to build the images which power source{d}'s computing infrastructure.☆22Updated 4 years ago
- A faithful (albeit optimized) port of Terence Parr List of Lists Visualization library, https://github.com/parrt/lolviz from Python to Ja…☆20Updated 3 years ago
- Open Quality Model and Tool Support for Quality Modelling and Evaluation☆11Updated 6 years ago
- An Excel formula parser☆12Updated 5 years ago
- gitbase web client; source{d} CE comes with a new UI, check it at https://docs.sourced.tech/community-edition/☆57Updated 5 years ago
- Babelfish documentation (GitBook)☆41Updated 5 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆42Updated 6 years ago
- Script to calculate the normalized compression distance of sets of files. It also tries to parallize the work over the available processo…☆16Updated 9 years ago
- Database smell detector☆13Updated 6 years ago
- Git scripts and aliases☆24Updated this week
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki☆25Updated 3 months ago
- My portfolio, for the lulz and interwebz. Not for 4/8 chan though.☆15Updated last month
- Summaries of academic papers☆17Updated 5 years ago
- ☆21Updated 5 years ago
- A Domain-Specific Language (DSL) for designing experiments in psychology☆14Updated 2 years ago
- ☆12Updated 6 years ago
- A flexible data structure for low-rank (≤ 5), sparse tensors supporting slices by any dimension and Einstein summation (einsum).☆14Updated 6 months ago
- Web client for Babelfish server☆23Updated last year
- Dexter document monitor for MMA☆17Updated 6 months ago
- Extract statistics from Wikipedia Dump files.☆26Updated 3 years ago
- Babelfish Python client☆16Updated 5 years ago
- Recurrent neural network to split code snippets from text.☆13Updated 5 years ago