Mythos-Rudy/mnbvc-fasttext-classification

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Mythos-Rudy/mnbvc-fasttext-classification)

Mythos-Rudy / mnbvc-fasttext-classification

this repo is mnbvc text quality classification using fastText

☆16

Alternatives and similar repositories for mnbvc-fasttext-classification

Users that are interested in mnbvc-fasttext-classification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aplmikex / deduplication_mnbvc
View on GitHub
文本去重
☆77May 23, 2024Updated 2 years ago
BarryZM / dataProcessor
View on GitHub
中英文语料数据清洗及分布式分句分词预处理工作
☆12Mar 28, 2020Updated 6 years ago
15810856129 / Simhash
View on GitHub
使用Simhash对海量文本进行去重
☆12Jun 2, 2018Updated 8 years ago
hiyoung123 / DuplicateRemove
View on GitHub
基于simhash的文本去重算法
☆20Jun 18, 2021Updated 5 years ago
luojie1024 / MossQA-mnbvc
View on GitHub
本项目主要对开源的MOSS SFT数据进行整理，转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面，共353w样本，MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数，共630w样本，
☆13Dec 3, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ido-web / jianshu_spider
View on GitHub
Scrapy + selenium/webdriver + 随机User-Agent + IP proxy + twisted ConnectionPool + mysql 爬取某书整站爬虫
☆15Dec 8, 2022Updated 3 years ago
CASIA-LM / ChineseWebText
View on GitHub
☆186Nov 13, 2023Updated 2 years ago
Dynesshely / Prouter
View on GitHub
A library to visualize algorithm by tracing your code.
☆11May 31, 2026Updated last month
zhiweio / porter
View on GitHub
Porter is a data cleaning tool designed to assist with full data extraction from MySQL, MongoDB, and text files (CSV/TSV/JSON) and push t…
☆16Sep 16, 2024Updated last year
yangjingo / IE-Datasets-Collections
View on GitHub
中英文信息抽取数据集整理
☆20May 15, 2022Updated 4 years ago
wlt233 / pmmtool
View on GitHub
Change FeliCa PMm of HCE-F for Android NFC
☆11Apr 30, 2023Updated 3 years ago
ysnows / wx_hook
View on GitHub
微信hook
☆11Jan 7, 2020Updated 6 years ago
muellermartin / frida-iOS-syscall-tracer
View on GitHub
alternative strace for iOS device(64bit)
☆13May 22, 2020Updated 6 years ago
axhlzy / IOSHookScripts
View on GitHub
Help us reverse ios more easily
☆20May 30, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Rprop / AnsolePlus
View on GitHub
Another terminal emulator for Android.
☆11Jun 9, 2026Updated last month
VeloDC / oshot_detection
View on GitHub
One-Shot Unsupervised Cross Domain Detection
☆13Nov 22, 2022Updated 3 years ago
Skielex / slgbuilder
View on GitHub
A Python package for building and cutting sparse layered s-t graphs.
☆13Nov 6, 2023Updated 2 years ago
MaheepChaudhary / SAE-Ravel
View on GitHub
Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…
☆13Jan 26, 2025Updated last year
frankaging / Interchange-Intervention-Training
View on GitHub
The codebase for Inducing Causal Structure for Interpretable Neural Networks
☆11Dec 3, 2021Updated 4 years ago
hax0r31337 / frida-native-dump
View on GitHub
Frida script to dump native libraries from running process on Android, inspired by frida_dump
☆14Jul 5, 2026Updated 2 weeks ago
Liuhong99 / implicitbiasmlmcode
View on GitHub
☆13Mar 22, 2023Updated 3 years ago
seanzhang-zhichen / Qwen-WisdomVast
View on GitHub
Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …
☆17Apr 12, 2024Updated 2 years ago
lm-pub-quiz / lm-pub-quiz
View on GitHub
Evaluate language models using multiple choice items
☆13Mar 6, 2026Updated 4 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
jeffasante / grpo-maze-solver
View on GitHub
A reinforcement learning agent that learns to solve mazes using Group Relative Policy Optimization (GRPO).
☆12Feb 9, 2025Updated last year
jaeheungs / rdf_depth_from_focus
View on GitHub
Official implementation of depth from focus using the ring difference filter (RDF)
☆16Jan 21, 2020Updated 6 years ago
iosaso / KNNGASO
View on GitHub
优化、学习aso；相当感谢https://github.com/houshuai0816/ASO 提供的资料 @houshuai0816
☆12Jun 8, 2018Updated 8 years ago
cisco-open / app-simulator
View on GitHub
A tool to build custom application simulators through declarative configuration
☆11Dec 15, 2025Updated 7 months ago
pystruct / pyqpbo
View on GitHub
QPBO interface and alpha expansion for Python
☆24Nov 3, 2022Updated 3 years ago
wanicca / WikiHowQAExtractor-mnbvc
View on GitHub
Extract Chinese/English QA Data from WikiHow pages.
☆17May 21, 2023Updated 3 years ago
FourTwooo / HookIntent
View on GitHub
基于Frida, Hook Intent
☆15Feb 28, 2025Updated last year
chrishayuk / mcp-code-sandbox
View on GitHub
☆16Mar 14, 2025Updated last year
JuliaSmoothOptimizers / QPSReader.jl
View on GitHub
A reader for MPS and QPS files
☆20Aug 17, 2025Updated 11 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Breathleas / iTweak
View on GitHub
Awesome tweak about WeChat, Weibo, Aweme, and so on. No more introduce, you know.
☆10Feb 27, 2021Updated 5 years ago
MM-Thinking / Metis-RISE
View on GitHub
Metis-RISE: RL Incentivizes and SFT Enhances Multimodal Reasoning Model Learning
☆22Jun 26, 2025Updated last year
thomasmong / llm-power-scheduling
View on GitHub
☆29Apr 24, 2026Updated 2 months ago
MetaMask / metamask-eth-abis
View on GitHub
Collection of smart contracts ABIs
☆11Apr 15, 2026Updated 3 months ago
KKallidromitis / r2o
View on GitHub
PyTorch implementation of Refine and Represent: Region-to-Object Representation Learning.
☆21Jun 19, 2025Updated last year
Arenaa / Accelerated-Generation-Techniques
View on GitHub
This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).
☆11May 24, 2024Updated 2 years ago
Ahmeth4n / r2renef
View on GitHub
Renef IO Plugin for Radare2 - Dynamic Android Instrumentation
☆15Dec 17, 2025Updated 7 months ago