Euclid-Jie / EuclidZhiHu-serachView external linksLinks
知乎回答、专栏及评论数据全覆盖爬取
☆17Mar 11, 2023Updated 2 years ago
Alternatives and similar repositories for EuclidZhiHu-serach
Users that are interested in EuclidZhiHu-serach are comparing it to the libraries listed below
Sorting:
- 知乎爬虫,用于爬取问题和对应的回答☆28Jan 31, 2023Updated 3 years ago
- Python3 实现的文章余弦相似度计算☆10Sep 28, 2017Updated 8 years ago
- Automatic missing value imputation using random forests☆14Aug 19, 2015Updated 10 years ago
- Supplementary code for "News Frame Analysis: An Inductive Mixed-method Computational Approach" http://dx.doi.org/10.1080/19312458.2019.16…☆15Nov 13, 2020Updated 5 years ago
- 利用Bert获取中文字、词向量☆10Jan 18, 2022Updated 4 years ago
- 实现功能:新输入一段文本,与已有数据进行相似度进行比较,返回TOP10的文本。主要实现方法:jieba中文分词、gensim、TF-IDF词汇重要性、cosine余弦相似度。☆11Jul 30, 2020Updated 5 years ago
- Demo for the calculation of the Semantic Brand Score (Basic Version)☆13Sep 1, 2020Updated 5 years ago
- Small tutorial on how you can use BERT for Topic Modeling☆18Jun 1, 2021Updated 4 years ago
- MiniGPT-4 :: Updated to Torch 2.0, simple setup, easier API, cut out training code☆15Jun 12, 2023Updated 2 years ago
- 基于nodejs的知乎爬虫,x-zse-96,支持文章,评论,图片下载到 本地☆16Nov 8, 2023Updated 2 years ago
- 将word2vec训练生成的词向量和BERT生成的词向量进行可视化对比☆15Jun 29, 2020Updated 5 years ago
- ✨ 本仓库用于存储一些小工具。例如,知乎问答爬虫、京东评论爬虫、分句工具等☆60Dec 9, 2023Updated 2 years ago
- Fake News Detection - Feature Extraction using Vectorization such as Count Vectorizer, TFIDF Vectorizer, Hash Vectorizer,. Then used an E…☆20Feb 21, 2020Updated 5 years ago
- 使用开源的Bert-as-Service预训练生成文档特征向量,基于k-means对COVID-19文献聚类,t-SNE可视化数据,通过LDA为每个簇生成主题关键词,画Bokeh图实现按簇、关键词搜索和筛选数据。☆19Aug 3, 2020Updated 5 years ago
- Nonlinear Granger causality using machine learning techniques☆21Sep 8, 2023Updated 2 years ago
- WordBias: Visualizing Intersectional Social biases encoded in Word Embeddings☆23Aug 18, 2025Updated 6 months ago
- Implementation of Dynamic Embedding Topic Modeling on arxiv.org articles☆21Apr 24, 2022Updated 3 years ago
- TXT文本语料数据清洗(Text corpus data cleaning):1> 合并TXT文件;2> 过滤干扰字符串;3> 对人名、地名、组织机构进行遮码处理;4> 将其他编码格式统一转换为UTF-8☆19Oct 14, 2022Updated 3 years ago
- 训练词向量☆22Sep 26, 2020Updated 5 years ago
- An implementation of the exponential random graph model☆27May 14, 2014Updated 11 years ago
- Visualization of the full depth of the order book along time☆21Dec 17, 2019Updated 6 years ago
- This package consists of functionalities for dynamic topic modelling and its visualization☆26May 16, 2020Updated 5 years ago
- 利用bert预训练模型生成句向量或词向量☆27Oct 29, 2020Updated 5 years ago
- OCR based on onnxruntime with PaddleOCR models☆30Dec 28, 2023Updated 2 years ago
- 轻量级知乎爬虫,支持问题、收藏夹和本月最热☆24Dec 19, 2018Updated 7 years ago
- 【B站】UP主动态图片下载☆37Apr 14, 2025Updated 10 months ago
- 基于TF-IDF和余弦定理计算文本相似度☆36Aug 29, 2018Updated 7 years ago
- Granger Causality library in python☆38Nov 19, 2021Updated 4 years ago
- python数据可视化☆32Jun 7, 2019Updated 6 years ago
- A python package for the Linguistic Inquiry and Word Count (LIWC) dictionary.☆40Apr 20, 2021Updated 4 years ago
- An R package to estimate Generalized Exponential Random Graph Models☆40May 20, 2023Updated 2 years ago
- dynamic topic modeling☆42Feb 5, 2023Updated 3 years ago
- TorchQuantum is a backtesting framework that integrates the structure of PyTorch and WorldQuant's Operator for efficient quantitative fin…☆51Jul 13, 2023Updated 2 years ago
- 基于CCXT的Crypto全市场量化回测框架☆51Sep 30, 2025Updated 4 months ago
- Pytorch implementation of Axial-LOB from 'Axial-LOB: High-Frequency Trading with Axial Attention'☆60Apr 6, 2023Updated 2 years ago
- High frequency factors based on order and trade data.☆69Dec 16, 2023Updated 2 years ago
- ☆70Mar 26, 2024Updated last year
- ☆66Jul 6, 2025Updated 7 months ago
- 人民日报(1946-2024)、习近平系列重要讲话数据库、古诗文☆82Mar 23, 2025Updated 10 months ago