将pdf转为txt然后进行分词,并进行词频统计
☆34Nov 14, 2024Updated last year
Alternatives and similar repositories for PdfSplitter
Users that are interested in PdfSplitter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于span分类和负采样的嵌套实体识别☆13Mar 9, 2023Updated 3 years ago
- GitBook of Note of Applied Statistics with R☆18Mar 13, 2021Updated 5 years ago
- Cordova plugin for Baichuan SDK (阿里百川Cordova插件)☆11Oct 15, 2017Updated 8 years ago
- 三个分词器,一个使用bilstm+viterbi,一个使用n-gram,一个使用cnn+bilstm+crf☆17Jan 24, 2018Updated 8 years ago
- LR / SVM / XGBoost / RandomForest etc.☆31May 25, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 一个问答/聊天机器人。A chat-bot in field of QA.☆31Jun 26, 2019Updated 6 years ago
- Hidden monitoring and blocking Android apps☆10May 2, 2017Updated 8 years ago
- Baidu 100G Chasiss Switch hardware spec☆12Sep 20, 2017Updated 8 years ago
- Chinese Word Segmentation Using MM/CRF/Bi-LSTM/Bi-LSTM-CRF/BERT-LSTM, 中文分词,使用 最大词匹配、CRF(CRF++)、Bi-LSTM (+CRF)、BERT-Bi-LSTM☆30Apr 2, 2020Updated 6 years ago
- intent detection and slot filling 意图识别与槽填充联合模型☆44Sep 11, 2022Updated 3 years ago
- Speech Recognition and Simple AI Summary:可用于本地语音转文字、说话人分割及简易的AI总结,搭配web端操作界面。☆11Jul 22, 2024Updated last year
- 关于behance爬虫项目☆10May 16, 2019Updated 6 years ago
- MachO editor / disassembler. No internet, no os requirements, just need a browser ;)☆12May 13, 2019Updated 6 years ago
- ☆10May 22, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 文本相似度算法☆40Nov 1, 2019Updated 6 years ago
- A modern Python library for efficiently scraping LinkedIn.☆26Jan 12, 2026Updated 3 months ago
- GitHub Markdown 渲染器☆12Jul 22, 2022Updated 3 years ago
- A Python library to split a Chinese Pinyin phrase into possible permutations of Chinese Pinyin words☆13Aug 10, 2021Updated 4 years ago
- 📈DevStats deployment on Kubernetes using Equinix servers and Helm, CoreDNS, containerd, MetalLB, OpenEBS, nginx-ingress, nginx, cert-man…☆16Updated this week
- 使用 Django2 作为接口后端,scrapy 作为爬虫的一个代理 IP 池☆10Jun 6, 2020Updated 5 years ago
- scrapy实战教程,分享scrapy爬虫的知识,针对各大网站做爬虫采集,并且以实例代码讲解。☆11Jan 22, 2026Updated 2 months ago
- This project demonstares an illegal read- and write- access to the kernel-mode data for both allocated by 3rd party drivers and EPROCESS …☆13Mar 6, 2018Updated 8 years ago
- web im projects implemented using timjs in tim☆17May 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Mac menubar app to monitor daily keystrokes and trends☆14Mar 26, 2021Updated 5 years ago
- Identify your logic with URI, apply any number of composable asynchronous or synchronous handlers using intuitive syntax and return anyth…☆12Mar 21, 2023Updated 3 years ago
- Here we utilize the OpenCV libraries and apply the Histograms of Oriented Gradients (HOG) algorithm to create a computer vision applicati…☆18Jan 3, 2023Updated 3 years ago
- 学习正则表达式☆38Oct 31, 2017Updated 8 years ago
- ☆15Apr 29, 2025Updated 11 months ago
- Detected highway lane lines on a video stream. Used OpencV image analysis techniques to identify lines, including Hough Transforms and Ca…☆14Jun 23, 2017Updated 8 years ago
- The simplest solution to run Selenium on Linux server (based on Docker). --Linux 服务端运行 Selenium 的最简方案(基于 Docker)。☆10Sep 19, 2022Updated 3 years ago
- Paper to Reviewer Assignment is a tedious but a very crucial job for conference organizers. Till date the Toronto Paper Matching System (…☆10Nov 30, 2017Updated 8 years ago
- vue3+vite+element-plus 打造的纯前端Admin模板。☆12Apr 17, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- https://github.com/OpenFlutter/amap_base_flutter 地图模块增加地图移动监听☆10May 10, 2019Updated 6 years ago
- Legacy Tweak☆12Feb 6, 2023Updated 3 years ago
- Extract Speech/Text from Video☆12Mar 6, 2022Updated 4 years ago
- A gesture recognition system powered by OpenPose, k-nearest neighbours, and local outlier factor.☆16Jun 7, 2021Updated 4 years ago
- log plugin for vuejs,vue 日志插件☆14Jul 25, 2019Updated 6 years ago
- ☆11Nov 19, 2024Updated last year
- Android Utils for context and thread.☆11Aug 4, 2016Updated 9 years ago