中文预训练ModernBert
☆100Apr 11, 2025Updated last year
Alternatives and similar repositories for ChineseModernBert
Users that are interested in ChineseModernBert are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the training and evaluation code for llm-jp-modernbert-base.☆17Jun 17, 2025Updated 11 months ago
- ☆17Jan 31, 2025Updated last year
- Code for KaLM-Embedding models☆118Jun 30, 2025Updated 10 months ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- 用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.☆17Feb 29, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Bringing BERT into modernity via both architecture changes and scaling☆1,677Mar 1, 2026Updated 2 months ago
- 基于树形条件随机场的高阶句法分析☆16Apr 28, 2022Updated 4 years ago
- ☆29Aug 19, 2024Updated last year
- ☆63Jul 21, 2024Updated last year
- My NER Experiments with ModernBERT and Ettin☆27Jul 17, 2025Updated 10 months ago
- ☆10Oct 1, 2020Updated 5 years ago
- ☆14Oct 21, 2024Updated last year
- ☆22Jan 3, 2026Updated 4 months ago
- MapReduce scripts written in Python for Hadoop Streaming☆10Jun 10, 2014Updated 11 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- Source code of EMNLP 2022 Findings paper "SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters"☆22Feb 28, 2026Updated 2 months ago
- Official Implementation of "Probing Language Models for Pre-training Data Detection"☆20Dec 4, 2024Updated last year
- Code and data for "Timo: Towards Better Temporal Reasoning for Language Models" (COLM 2024)☆26Oct 23, 2024Updated last year
- ☆13Mar 26, 2026Updated 2 months ago
- [AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems☆13May 5, 2025Updated last year
- 一种面向中文复杂问句的查询图生成方法,以及一份含有多种复杂句的中文知识图谱问答数据集☆18Mar 16, 2023Updated 3 years ago
- Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.☆26Jun 6, 2025Updated 11 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Jan 4, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Convert MathML to Latex for OneNote to Markdown☆13Mar 17, 2026Updated 2 months ago
- ☆20Feb 25, 2026Updated 3 months ago
- Linux /proc data in a consistent, parsed format.☆10Mar 28, 2016Updated 10 years ago
- GMEG☆31Nov 21, 2024Updated last year
- Official implementation of paper "BBOPlace-Bench: Benchmarking Black-Box Optimization for Chip Placement".☆30Apr 18, 2026Updated last month
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- 比赛中 的通用方法和模板☆17Sep 8, 2020Updated 5 years ago
- LaTeX Beamer template crafted for University of Illinois Chicago☆12Dec 7, 2024Updated last year
- AAAI'22-"CODE: Contrastive Pre-training with Adversarial Fine-tuning for Zero-shot Expert Linking."☆12Apr 12, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 第三届 Apache Flink 极客挑战赛暨AAIG CUP——电商推荐“抱大腿”攻击识别亚军代码方案☆29Mar 25, 2022Updated 4 years ago
- everyone_can_pretrain_language_model☆25Jan 13, 2021Updated 5 years ago
- ☆109Jun 2, 2025Updated 11 months ago
- jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2☆19Aug 15, 2025Updated 9 months ago
- 北语 246 实验室新生简明指南☆10May 30, 2022Updated 3 years ago
- Top Picks for Data Science Self-Study: From Newbies to Pros!☆11Apr 2, 2024Updated 2 years ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Dec 13, 2024Updated last year