nursery42/ChineseModernBert

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nursery42/ChineseModernBert)

nursery42 / ChineseModernBert

中文预训练ModernBert

☆100

Alternatives and similar repositories for ChineseModernBert

Users that are interested in ChineseModernBert are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AnswerDotAI / ModernBERT-Instruct-mini-cookbook
View on GitHub
☆53Feb 10, 2025Updated last year
EasonWong0327 / Hybrid-FQA-System
View on GitHub
A hybrid retrieval-based question answering system built on BERT, Faiss, and ElasticSearch.
☆25May 29, 2025Updated last year
HITsz-TMG / KaLM-Embedding
View on GitHub
Code for KaLM-Embedding models
☆118Jun 30, 2025Updated last year
illuin-tech / modernvbert
View on GitHub
ModernVBERT is a 250M-parameter vision–language encoder that aligns a text-encoder (Ettin-150M) with a vision-encoder (SigLIP2-B) through…
☆16Oct 16, 2025Updated 9 months ago
JHU-CLSP / mmBERT
View on GitHub
A massively multilingual modern encoder language model
☆145Jan 20, 2026Updated 6 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
YangLock / EventPapersArchive
View on GitHub
Papers about event extraction and event relation extraction
☆13May 17, 2023Updated 3 years ago
nursery42 / ChineseliteratureDataset
View on GitHub
中华经典文献数据集
☆22Jun 29, 2023Updated 3 years ago
lansinuote / Simple_TRL
View on GitHub
☆18Aug 9, 2024Updated last year
EasonWong0327 / QA-Systems-Hub
View on GitHub
It includes various question-answering technology sub-projects
☆25Aug 23, 2025Updated 11 months ago
yzhangcs / master-thesis
View on GitHub
基于树形条件随机场的高阶句法分析
☆16Apr 28, 2022Updated 4 years ago
RhapsodyAILab / MiniCPM-V-Embedding
View on GitHub
☆30Aug 19, 2024Updated last year
taishan1994 / pytorch_knowledge_distillation
View on GitHub
基于Pytorch的知识蒸馏（中文文本分类）
☆23Jan 12, 2023Updated 3 years ago
DunZhang / Stella
View on GitHub
☆63Jul 21, 2024Updated 2 years ago
hkust-nlp / WebExplorer
View on GitHub
The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"
☆120Sep 29, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
thunlp / BlockFFN
View on GitHub
Source codes for paper "BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity".
☆19Jan 10, 2026Updated 6 months ago
Zengwh02 / GlimpRouter
View on GitHub
GlimpRouter: Efficient Collaborative Inference by Glimpsing One Token of Thoughts
☆16Apr 24, 2026Updated 3 months ago
asahi417 / lm-vocab-trimmer
View on GitHub
Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting ir…
☆67Oct 25, 2024Updated last year
OpenLMLab / ParallelTokenizer
View on GitHub
Use the tokenizer in parallel to achieve superior acceleration
☆20Mar 21, 2024Updated 2 years ago
biostat0903 / PGS-Server
View on GitHub
☆10Apr 17, 2023Updated 3 years ago
WangWenhao0716 / PDF-Embedding
View on GitHub
[NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"
☆18Oct 1, 2024Updated last year
HaloForest / UKB_PWAS
View on GitHub
Code utilized in the paper "A phenome-wide association and Mendelian randomization study for Alzheimer's disease: a prospective cohort st…
☆14Jun 10, 2022Updated 4 years ago
TaykhoomDalal / pype
View on GitHub
A Python Package for PheWAS Execution, Visualization, and Analysis
☆14Mar 13, 2026Updated 4 months ago
chandar-lab / NeoBERT
View on GitHub
☆109Jun 2, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
jonathanklmak / Metabolomics_frailty
View on GitHub
Unraveling the metabolic underpinnings of frailty using multicohort observational and Mendelian randomization analyses
☆13May 17, 2023Updated 3 years ago
kyegomez / FastFF
View on GitHub
Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"
☆16Nov 11, 2024Updated last year
zhliu0106 / probing-lm-data
View on GitHub
Official Implementation of "Probing Language Models for Pre-training Data Detection"
☆20Dec 4, 2024Updated last year
stefan-it / modern-bert-ner
View on GitHub
My NER Experiments with ModernBERT and Ettin
☆29Jul 17, 2025Updated last year
ScienceOne-AI / S1-DeepResearch
View on GitHub
☆26Jul 2, 2026Updated 3 weeks ago
morning-hao / domain-self-instruct
View on GitHub
受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果，通过GPT获得question和answer来作为训练数据
☆18May 12, 2023Updated 3 years ago
ShanghaitechGeekPie / coursebench-backend
View on GitHub
☆13Mar 26, 2026Updated 4 months ago
cytan17726 / KBQA_QueryGraphGeneration
View on GitHub
一种面向中文复杂问句的查询图生成方法，以及一份含有多种复杂句的中文知识图谱问答数据集
☆18Mar 16, 2023Updated 3 years ago
RulinShao / massive-serve
View on GitHub
Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.
☆26Jun 6, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
oceanumeric / EnteRAG
View on GitHub
A RAG that can scale 🧑🏻‍💻
☆11May 28, 2024Updated 2 years ago
Lauorie / DFT
View on GitHub
Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629
☆24Oct 14, 2025Updated 9 months ago
deeplearningplus / tGPT
View on GitHub
Generative Pretraining from Transcriptomes
☆17Feb 6, 2023Updated 3 years ago
grammarly / GMEG
View on GitHub
GMEG
☆33Nov 21, 2024Updated last year
kh4nh12 / self_study_ds
View on GitHub
Top Picks for Data Science Self-Study: From Newbies to Pros!
☆11Apr 2, 2024Updated 2 years ago
iLearn-Lab / MM24-DiFF
View on GitHub
Diffusion-generated Facial Forgery Dataset
☆58Apr 7, 2026Updated 3 months ago
worldbank / GISTEmbed
View on GitHub
GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings
☆45Mar 6, 2024Updated 2 years ago