Curated list of Publicly available Big Data datasets. Uncompressed size in brackets. No Blockchains.
☆47May 21, 2019Updated 6 years ago
Alternatives and similar repositories for big-data-datasets
Users that are interested in big-data-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Jun 22, 2022Updated 3 years ago
- ☆13Nov 9, 2025Updated 4 months ago
- ☆17Oct 25, 2018Updated 7 years ago
- Various simulations of random processes☆14Mar 3, 2026Updated 2 weeks ago
- Deep Canonical Correlation Analysis implemented with tensorflow☆14Mar 16, 2018Updated 8 years ago
- Build 2019 Demos for Knowledge Mining Session☆10May 17, 2019Updated 6 years ago
- shttpd - HTTP服务器代码注释☆16Sep 12, 2020Updated 5 years ago
- (weighted) dynamic time warping☆19Mar 2, 2018Updated 8 years ago
- Make GPT safe for production☆17Dec 21, 2024Updated last year
- ☆11Oct 28, 2022Updated 3 years ago
- ☆12Nov 1, 2023Updated 2 years ago
- Canonical Time Warping for NIPS 2010, CVPR 2012☆25Sep 16, 2020Updated 5 years ago
- Berserker - BERt chineSE woRd toKenizER☆16Feb 25, 2019Updated 7 years ago
- Create RP training data from a VN, using GPT-4☆18Nov 2, 2023Updated 2 years ago
- Implementation of Aligned Cluster Analysis☆18Sep 29, 2018Updated 7 years ago
- A webhook bridge to send messages on Discord through a webpage☆14Updated this week
- A short guide and example on how to fine-tune OpenAI's gpt-3.5-turbo for better roleplay☆14Aug 26, 2023Updated 2 years ago
- CoquiTTS Framework☆10Mar 21, 2023Updated 3 years ago
- Agent-based implementation of RAG, incorporating AI agents into the RAG pipeline to orchestrate its components and perform additional act…☆20Feb 20, 2025Updated last year
- GPT4 & LangChain Chroma Chatbot for large PDF docs☆12Apr 29, 2023Updated 2 years ago
- It's a script that controls the fan on a nvidia graphics card☆21Nov 6, 2020Updated 5 years ago
- Published by Packt☆22Jan 30, 2023Updated 3 years ago
- SyncPy is a novel open-source analytic library for investigating synchrony in a fast and exhaustive way.☆21Feb 3, 2022Updated 4 years ago
- API for custom GPT Actions to talk to custom GPT Agents.☆15Apr 28, 2024Updated last year
- Tool to check the CloudTrail configuration and the services where trails are sent, to detect potential attacks to CloudTrail logging.☆13May 25, 2024Updated last year
- HDInsight Developer Guide☆14Jun 27, 2018Updated 7 years ago
- simply implement "Personalizing Dialogue Agents: I have a dog, do you have pets too? "☆14Nov 27, 2018Updated 7 years ago
- HAN model. Three versions.☆27Jan 16, 2018Updated 8 years ago
- resources for openhack☆10May 29, 2019Updated 6 years ago
- ☆13Jan 7, 2022Updated 4 years ago
- Review of time series using regression and neural network methods☆33Dec 3, 2018Updated 7 years ago
- ☆18Dec 8, 2023Updated 2 years ago
- Learning PostgreSQL 11, Third Edition, Published by Packt☆24Jan 15, 2021Updated 5 years ago
- ☆25Jun 1, 2016Updated 9 years ago
- Code used in DEVNET Workshops at Cisco Live Events☆14Dec 8, 2022Updated 3 years ago
- Scalable cloud load/stress test for Azure Cognitive Search. Includes a pipelined solution with Apache JMeter and Terraform to dynamically…☆17Jun 10, 2021Updated 4 years ago
- Crawl Google Scholar publications easily.☆21Feb 2, 2022Updated 4 years ago
- The fastai deep learning library, plus lessons and tutorials☆13Jun 2, 2019Updated 6 years ago
- Chrome Extension for YouTube. Acts as an assistant for the YouTube video you are watching☆23Apr 26, 2023Updated 2 years ago