kwaziidev / textractorLinks
从html中提取正文,用于新闻类网页
☆15Updated 2 years ago
Alternatives and similar repositories for textractor
Users that are interested in textractor are comparing it to the libraries listed below
Sorting:
- Golang RESTful Client for HanLP.中文分词 词性标注 命名实体识别 依存句法分析 语义依存分析 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理☆36Updated 2 years ago
- Functional Meaning Representation and Semantic Parsing Framework☆78Updated 2 years ago
- A simhasher for Chinese documents implemented by golang, simply translated from yanyiwu/gosimhash☆17Updated 7 years ago
- Web服务:使用腾讯 800 万词向量模型和 spotify annoy 引擎得到相似关键词☆101Updated 5 years ago
- 昆仑:超快的代码搜索引擎☆55Updated 3 years ago
- Go framework for DL model inference and API deployment☆50Updated 8 months ago
- G文⚡️: Concurrency-safe, Free and Unlimited google translate api for Golang. 🔥免费、无限、并发安全的谷歌翻译包☆111Updated 4 years ago
- Go use bleve with gse tokenizer☆29Updated last year
- Golang gzip middleware for Gin and net/http | Golang gzip中间件,支持Gin和net/http,开箱即用同时可定制☆144Updated 2 years ago
- 📒 An Aho-Corasick algorithm based string-searching utility for Go. It supports tokenization, ignoring case, replacing text. So you can u…☆49Updated 2 years ago
- GoJieba Bleve support☆29Updated last week
- Mir is a toolkit for register method handler to http engine router(eg: gin,echo,iris,mux,httprouter) use struct tag info.☆85Updated last week
- Fast implementation of fuzzy matching and Blacklist filtering☆35Updated 4 years ago
- Go 语言汉字转拼音工具☆196Updated 6 months ago
- ^=^ clop是基于struct的命令行解析库,专注是它的灵魂,像AK47一样,简单,强大,专注,让命令行里面的疑难杂症统统走开[从零实现]☆133Updated last year
- GoPool is a high-performance, feature-rich, and easy-to-use worker pool library for Golang.☆212Updated last year
- Calculate string similarity library, integrate multiple algorithms on the back end。计算字符串相似度库,后端集成多种算法[从零实现]☆276Updated 3 years ago
- a simple fsm for Go☆51Updated 3 years ago
- Pure go library for creating Office Open/OOXML/ECMA-376 documents (.docx, .xlsx, .pptx)☆53Updated 7 years ago
- ☆25Updated 2 years ago
- A simple and light excel file reader to read a standard excel as a table faster | 一个轻量级的Excel数据读取库,用一种更`关系数据库`的方式解析Excel。☆195Updated 3 months ago
- pcopy是深度拷贝库,相比上个版本(v0.0.10),性能提升4-10倍☆94Updated 4 months ago
- Go SDK Client for ZincSearch☆35Updated 2 years ago
- 基于文本密度的html2article实现[golang]☆191Updated 6 years ago
- go-xss is a module used to filter input from users to prevent XSS attacks☆48Updated 4 months ago
- weibo spider written by golang colly(使用golang的colly库编写的微博爬虫)☆68Updated last year
- 🗃 Generic cache use and cache manage. Provide a unified usage API by packaging various commonly used drivers. Support File, Memory, Red…☆198Updated this week
- String similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subse…☆61Updated 7 years ago
- a cli tool for generating code based on gendry☆70Updated 6 years ago
- gorm v2的即插即用、无需修改代码 的旁路缓存。An easy-to-use look-aside cache solution for gorm v2 users.☆118Updated last year