Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
☆35May 10, 2016Updated 9 years ago
Alternatives and similar repositories for nlp-datasets-1
Users that are interested in nlp-datasets-1 are comparing it to the libraries listed below
Sorting:
- Today I Learned☆10Jan 5, 2020Updated 6 years ago
- Kaggle☆14Jan 15, 2019Updated 7 years ago
- Tensorflow Dev Summit Extended Seoul - ML Kit Codelabs☆14Apr 2, 2019Updated 6 years ago
- Regeneration of Google's tpu-resnet tutorial☆12Aug 22, 2018Updated 7 years ago
- ☆22Jan 31, 2018Updated 8 years ago
- Language modeling on the Penn Treebank (PTB) corpus using a trigram model with linear interpolation, a neural probabilistic language mode…☆18Oct 8, 2018Updated 7 years ago
- 패스트캠퍼스, 파이썬을 이용한 머신러닝 입문 실습 코드☆21Sep 25, 2020Updated 5 years ago
- 2018 TF Pattern Design Study in MoT☆19Jul 1, 2018Updated 7 years ago
- ☆25Sep 10, 2019Updated 6 years ago
- etc☆24Jan 4, 2021Updated 5 years ago
- This repo has some proposed agenda for Azure Machine Learning related hands-on workshops.☆11Feb 2, 2021Updated 5 years ago
- 한국투자증권 mcp server☆14Jun 22, 2025Updated 8 months ago
- hmm-filter: Improve classifier predictions for sequential data with Hidden Markov Models (HMMs)☆12Jan 23, 2019Updated 7 years ago
- Configuration system geared towards Python ML projects☆11Apr 30, 2023Updated 2 years ago
- personal repository☆36Sep 17, 2023Updated 2 years ago
- implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering☆10Mar 17, 2022Updated 3 years ago
- JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning☆10Nov 3, 2024Updated last year
- Thai word segmentation using deep learning☆14Jul 1, 2019Updated 6 years ago
- This repo is for residual-connected sentence encoder for NLI.☆11Jan 21, 2018Updated 8 years ago
- Vintix: Action Model via In-Context Reinforcement Learning - - —☆22May 23, 2025Updated 9 months ago
- ☆10Aug 22, 2023Updated 2 years ago
- Two-stage text summarization with BERT and BART☆11Jan 5, 2022Updated 4 years ago
- Core classes for BloodContracts runtime data validation and monitoring toolkit☆10Aug 31, 2019Updated 6 years ago
- [NO LONGER WORKS WITH GOOGLE] - Rugalytics is a Ruby API for accessing your Google Analytics Data☆112May 12, 2011Updated 14 years ago
- Code for "AtTGen: Attribute Tree Generation for Real-World Attribute Joint Extraction", ACL 2023☆13May 19, 2023Updated 2 years ago
- A FFI-based RESTful system monitoring service 🛰️ using the lovely Rust and venerated Ruby 📡☆11Feb 3, 2019Updated 7 years ago
- This repository provide script to do OCR using some basic Deep Learning approach☆10Aug 27, 2020Updated 5 years ago
- "유닉스 리눅스 셸 스크립트 예제 사전: Unix & Linux Shell Script Exercise Dictionary" - 한빛미디어☆10Jan 17, 2017Updated 9 years ago
- stoplists for African languages generated from the ASP corpus☆14Jan 16, 2016Updated 10 years ago
- U-Net for BDD100K Dataset☆13Jan 1, 2019Updated 7 years ago
- simple Chat User Interface, using Js (jquery), Html, Css (bootstrap)☆14Dec 8, 2022Updated 3 years ago
- This tool provides a fast and efficient way to convert text into vector embeddings and store them in the Qdrant search engine. Built with…☆15Mar 31, 2023Updated 2 years ago
- Implementation example of Distributed Tensorflow☆10Jul 22, 2017Updated 8 years ago
- Google Apps Script to Archive Google Sheet