A Go package that implements the JusText boilerplate removal algorithm
☆110Nov 6, 2022Updated 3 years ago
Alternatives and similar repositories for justext
Users that are interested in justext are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CGo wrapper package around the libtidy, the HTML tidy library☆22Nov 19, 2021Updated 4 years ago
- Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text☆1,781Jul 1, 2024Updated last year
- Package reactLog is reaction middle-ware for standard golang log☆11Oct 10, 2015Updated 10 years ago
- ☆12Aug 22, 2015Updated 10 years ago
- A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29☆89Dec 19, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Develop macOS apps on Windows with seamless cross-platform tools.☆16Jun 5, 2025Updated last year
- Self-hosting binary instrumentation framework for security research☆12Apr 10, 2023Updated 3 years ago
- ActivityStreams 2.0 encoding/decoding for Go 1.18+☆13Aug 31, 2025Updated 9 months ago
- General set data structure based on github.com/xtgo/set☆13Mar 1, 2016Updated 10 years ago
- Simple interface to libmagic for Go Programming Language☆13Jan 10, 2021Updated 5 years ago
- The trashvisor☆12Oct 25, 2020Updated 5 years ago
- Embeddable filesystem mapped key-string store. Ideal for embedding code like sql, lua, et al.☆15Oct 15, 2019Updated 6 years ago
- ☆13Jun 20, 2022Updated 3 years ago
- Multiclass Naive Bayesian Classification☆77Aug 10, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Output high level Pcode (PcodeAST) in Ghidra☆18Apr 7, 2023Updated 3 years ago
- Utilities for working with discrete probability distributions and other tools useful for doing NLP work☆95Nov 15, 2011Updated 14 years ago
- Ferret is a search engine that unifies search results from Github, Slack, Trello and more☆29Feb 25, 2017Updated 9 years ago
- A multilingual command line sentence tokenizer in Golang☆470Feb 28, 2024Updated 2 years ago
- Hierarchical Dirichlet Process (with Split-Merge Operations), originally by Chong Wang☆18Oct 12, 2013Updated 12 years ago
- Text indexing related functions in Go, including tokenizer, word marking, and snippet selecting, etc.☆26Feb 14, 2016Updated 10 years ago
- grobotstxt is a native Go port of Google's robots.txt parser and matcher library.☆115Mar 16, 2022Updated 4 years ago
- simhash storage and searching☆137Mar 29, 2017Updated 9 years ago
- LiT (Zero-Shot Transfer with Locked-image text Tuning) image and text encoder models, working in the browser☆11May 16, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Plugin for ida pro that copies RVA under cursor to clipboard.