大数据笔记整理
☆57Feb 15, 2022Updated 4 years ago
Alternatives and similar repositories for bigdata
Users that are interested in bigdata are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 大数据相关笔记☆25Mar 21, 2021Updated 5 years ago
- 从0到1构建用户画像☆39Jun 4, 2021Updated 5 years ago
- 一个实时数仓项目,从0到1搭建实时数仓☆64May 27, 2021Updated 5 years ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- 机器学习:1)离线统计(统计数据即可),离线推荐(基于LFM隐语义模型 采用ALS算法 ,并根据最小方差计算RMSE),2)实时推荐,实时根据用户最近看过的一部电影,找到相似的电影(相似矩阵由上一个需求得出)作为候选电影,再结合最近评分的电影,推出优先级别 3)基于内容(电…☆13Mar 21, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- debezium同步mysql☆13Oct 5, 2022Updated 3 years ago
- Python自动化办公☆13Nov 5, 2021Updated 4 years ago
- 提取每条新闻中的人名,假设在同一条新闻的人物具有联系,建立新闻人物的社交网络,并进一步探索网络的性质。☆10Oct 6, 2019Updated 6 years ago
- The example for using OpenTelemetry Collector in Java☆12May 4, 2023Updated 3 years ago
- 🚀🚀🚀优质的历史文章,大数据高频考点,Java一线大厂知识考点,更有精美简历模板,简历指导手册和上百本技术书籍,最重要的就是被全网下载上千次的我自己花精力去画的大数据生态圈,Kafka,Spark,Scala的思维导图...这是一个你在大数据学习路上不能错过的宝 藏项目…☆881Aug 25, 2021Updated 4 years ago
- This repo demonstrates 3 ways for apps to auto reload from Kubernetes ConfigMap.☆12Jun 14, 2019Updated 7 years ago
- 淘宝模拟登陆/数据采集☆10Sep 28, 2020Updated 5 years ago
- Prompt engineering based on ChatGLM-6B☆10May 12, 2023Updated 3 years ago
- 总结典型的hive面试题、SQL、HQL练习☆21Mar 24, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is a Kaggle data mining contest, link: https://www.kaggle.com/c/avazu-ctr-prediction☆11Mar 12, 2015Updated 11 years ago
- 【数据虫巢】公众号《数据与广告》系列对应机器学习实例。☆28Jun 6, 2020Updated 6 years ago
- Presto dynamic catalog is implemented based on ZooKeeper, and REST API is provided for catalog curd, without the need to restart presto c…☆11Nov 10, 2020Updated 5 years ago
- php生成分布式唯一id扩展,基于Twitter SnowFlake分布式ID生成算法,使用c实现的php Extension。默认生成ID是一个64位long型数字。单机每秒内理论上最多可以生成1024*(2^12),也就是409.6万个ID(1024 X 4096 = …☆11Apr 16, 2025Updated last year
- flink sql☆11Jun 21, 2022Updated 4 years ago
- Amazon.com price check, item description & review, and more☆22Mar 2, 2011Updated 15 years ago
- csdn用户画像的源码☆20Jul 19, 2017Updated 8 years ago
- pornhub.com crawler to crawl and download videos those are publicly present in the website for viewing and downloading☆11Oct 1, 2020Updated 5 years ago
- 币安合约自动交易(node旧版)☆12Jun 30, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 微信爬虫☆17Nov 16, 2019Updated 6 years ago
- CTR prediction models in TensorFlow 2.x☆21Nov 3, 2021Updated 4 years ago
- Code for the Click-Through Rate Prediction Kaggle challenge from Avazu☆11Feb 5, 2017Updated 9 years ago
- ☆21Nov 13, 2025Updated 7 months ago
- Alibaba Cloud Graph Database Service (GDB) Tools☆19Mar 6, 2024Updated 2 years ago
- Machine learning and Deep learning notes.