Concatenated-word segmentation Python library written in Rust
☆17Aug 11, 2025Updated 10 months ago
Alternatives and similar repositories for PyWordSegment
Users that are interested in PyWordSegment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Queue server base on RocksDB as a KV-Store backend and gRPC as an interface☆10Oct 9, 2023Updated 2 years ago
- Intsights open-source wrappers library for some AWS resources and high level management objects for distributed backend systems☆17Oct 9, 2023Updated 2 years ago
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Nov 12, 2025Updated 6 months ago
- ☆25Oct 16, 2025Updated 7 months ago
- Python library for a duplicate lines removal written in C++☆32Aug 11, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm☆42May 9, 2026Updated last month
- A blazingly fast domain extraction library written in Rust☆67Aug 11, 2025Updated 10 months ago
- Exploiting CVE-2014-3153, AKA Towelroot.☆13Jan 16, 2021Updated 5 years ago
- Python RQL Parser☆16Nov 8, 2025Updated 7 months ago
- Python module to remove wiki markup text.☆10Jan 15, 2016Updated 10 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆12Mar 18, 2023Updated 3 years ago
- Using Siamese LSTM to classify repeated quora questions. Attempted pretrained bert embeddings, Word2Vec and training own embeddings toget…☆10Aug 28, 2020Updated 5 years ago
- AST factorization: transformation AST of Kotlin source code to a vector☆11Oct 17, 2019Updated 6 years ago
- PyTorch library for synthesizing programs from natural language☆18Jul 25, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An alternative front end for Amazon Mechanical Turk☆12May 13, 2024Updated 2 years ago
- CoronaWhy Common Research and Data Infrastructure for COVID-19☆13Dec 2, 2020Updated 5 years ago
- Supporting code for Learning to Rank (LTR) presentation☆16Oct 11, 2018Updated 7 years ago
- NLP2API: Query Reformulation for Code Search using Crowdsourced Knowledge and Extra-Large Data Analytics.☆12Dec 31, 2020Updated 5 years ago
- Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-me…☆13Dec 25, 2025Updated 5 months ago
- This repo hosts semantic-poetry: a composite GitHub Action to release simple Python packages on GitHub and PyPI following semantic versio…☆15Aug 13, 2023Updated 2 years ago
- Implementation query expansion in semantic meta-search engine. The resulting expansion system is called Wiki-MetaSemantik.☆11Feb 10, 2019Updated 7 years ago
- Notebook comparing scikit-learn and Spark ML for building Machine Learning Pipelines☆13Oct 8, 2015Updated 10 years ago
- SuggestBot is an article recommender for Wikipedia☆21Dec 29, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code and data for the EMNLP 2019 paper "In Plain Sight: Media Bias Through the Lens of Factual Reporting"☆10Feb 15, 2022Updated 4 years ago
- Israel ID Generator and validator☆33Dec 3, 2014Updated 11 years ago
- boxOS builds and curates open source projects for Linux containers☆13Feb 4, 2019Updated 7 years ago
- Minimal implementation of multiple PEFT methods for LLaMA fine-tuning☆13May 7, 2023Updated 3 years ago
- ☆11Oct 12, 2023Updated 2 years ago
- Multi-armed bandits for dynamic movie recommendations☆14Nov 20, 2019Updated 6 years ago
- Recipe for Spanish POS tagging using the CESS corpus with NLTK☆18Sep 28, 2016Updated 9 years ago
- [ICLR26] Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs☆32Dec 9, 2025Updated 6 months ago
- MMR for information retrieval☆18Sep 22, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Calculate Bleu, METEOR and ROUGE score☆13May 15, 2018Updated 8 years ago
- ☆16Mar 27, 2023Updated 3 years ago
- Security framework for LLM-generated SQL queries 🛡️☆33Nov 16, 2024Updated last year
- ☆44Feb 3, 2024Updated 2 years ago
- Artifacts and other data for "Code Vectors: Understanding Programs Through Embedded Abstraced Symbolic Traces"☆22Jun 5, 2020Updated 6 years ago
- NATS as backend for Queue Package☆13May 9, 2026Updated last month
- A benchmark for evaluating embeddings of identifiers in source code.☆22Aug 23, 2021Updated 4 years ago