Train a model, and detect gibberish strings with it.
☆68Feb 17, 2022Updated 4 years ago
Alternatives and similar repositories for gibberish-detector
Users that are interested in gibberish-detector are comparing it to the libraries listed below
Sorting:
- Service to scan licenses from source code☆12Aug 14, 2023Updated 2 years ago
- An efficient algorithm for k-bounded (Damerau-)Levenshtein distance☆16Oct 13, 2018Updated 7 years ago
- A pure python rpm reader☆20Apr 11, 2024Updated last year
- Scripts as a service. Builds on systemd (for Linux)☆21Mar 10, 2026Updated last week
- Upload a document image or PDF, or provide a URL, to convert it into a structured format using SmolDocling.☆16Mar 31, 2025Updated 11 months ago
- Script to help maintain a wheelhouse folder on a cloud storage.☆33Aug 4, 2020Updated 5 years ago
- Arabic collocations library and data for Python☆10Nov 14, 2021Updated 4 years ago
- ☆17Jul 10, 2024Updated last year
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Dec 22, 2023Updated 2 years ago
- Audit python packages for known vulnerabilities☆34Mar 9, 2022Updated 4 years ago
- Research on the usage of Jupyter notebooks☆19Sep 12, 2019Updated 6 years ago
- Word acquisition in neural language models (TACL 2022).☆20Jan 30, 2025Updated last year
- Python package to hash dictionaries using default hash, md5, sha256 and more.☆27Nov 10, 2025Updated 4 months ago
- Etalab's Lab IA Pseudonymization Demo source code☆11Aug 3, 2023Updated 2 years ago
- Lightweight license checker.☆31Nov 5, 2020Updated 5 years ago
- CveXplore☆42Sep 12, 2025Updated 6 months ago
- Debian packaging tools☆47Mar 9, 2021Updated 5 years ago
- Just a nodeJS wrapper for ghostscript☆12Jul 12, 2023Updated 2 years ago
- Official repository for FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models☆34Sep 19, 2025Updated 6 months ago
- Rust library for extracting data from HTML tables.☆13Mar 4, 2024Updated 2 years ago
- Klimatkollen's data pipeline and API for processing company sustainability reports☆23Updated this week
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆69Nov 7, 2020Updated 5 years ago
- How About Machine Learning Enhancing Theses? - a pilot discovery project☆14May 23, 2023Updated 2 years ago
- Contains over 375 samples of Windows Portable Executable (PE) files ranging from the common to the completely esoteric with detailed orig…☆46Sep 25, 2024Updated last year
- Go stemmers generated by the Snowball project☆24Sep 6, 2020Updated 5 years ago
- Recurrent Discounted Attention unit (RDA) for Tensorflow☆22Mar 12, 2018Updated 8 years ago
- Bytecode Analysis Toolkit.☆17Oct 28, 2022Updated 3 years ago
- linear regression analysis of relationships in fantasy football using player and team data; 12,500+ views on medium☆17Dec 31, 2022Updated 3 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆156Mar 8, 2026Updated last week
- A Radix Tree based router for HTTP and other routing needs with support for middlewares and endpoints with a Cython boost☆15Oct 9, 2018Updated 7 years ago
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆21Nov 10, 2024Updated last year
- ☆11Mar 13, 2026Updated last week
- Lokalise API v2 official Python library☆15Mar 13, 2026Updated last week
- Vendy is a tool for vendoring third-party packages into your project.☆18Nov 28, 2023Updated 2 years ago
- Digital Forensics Windows Registry (dfWinReg)☆54Dec 22, 2025Updated 2 months ago
- CAPE core and community parsers☆18Feb 9, 2026Updated last month
- The main feature flipper library and web admin application.☆10Aug 18, 2025Updated 7 months ago
- Literally exactly like Python's unittest but with colors.☆17Mar 29, 2021Updated 4 years ago
- [ICLR 2024]: Is Self-Repair a Silver Bullet for Code Generation?☆15May 2, 2024Updated last year