BERT&RoBERTa预训练代码,tensorflow和torch两种版本实现
☆13Feb 8, 2023Updated 3 years ago
Alternatives and similar repositories for bert-pretraining
Users that are interested in bert-pretraining are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- ☆16Jun 10, 2025Updated 9 months ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆30Sep 20, 2025Updated 6 months ago
- 基于BERT的预训练语言模型实现,分为两步:预训练和微调。目前已包括BERT、Roberta、ALbert三个模型,且皆可支持Whole Word Mask模式。☆17Feb 1, 2020Updated 6 years ago
- A simple package about multi-task recommendation☆17May 12, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 《精通 Python 设计模式》读书笔记☆11Jun 18, 2017Updated 8 years ago
- 便签小应用,最近版本 2.0 已采用响应式设计!你可以点击以下链接在线查看☆10Jun 28, 2017Updated 8 years ago
- LLM implementation one matrix multiplication at a time☆13Aug 8, 2024Updated last year
- This is the code for GA-DRL-Aubo paper☆14Apr 8, 2022Updated 3 years ago
- Export VMamba to onnx. VMamba: Visual State Space Models,code is based on VMamba: https://github.com/MzeroMiko/VMamba☆22May 13, 2025Updated 10 months ago
- Repository for "Enhanced Super-Resolution Training via Mimicked Alignment for Real-World Scenes", ACCV 2024☆15Dec 2, 2024Updated last year
- Code for the paper "JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis"☆14Nov 5, 2024Updated last year
- A light weight, low level embedded key-value database library☆32Jul 8, 2013Updated 12 years ago
- Few-Shot Emotion Recognition in Conversation with Sequential Prototypical Networks☆13Sep 24, 2021Updated 4 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- 基于PyTorch框架,相关项目实战代码。☆25Jan 5, 2023Updated 3 years ago
- AFlow & MathAI☆19Feb 24, 2025Updated last year
- Update BasicSR to support PyTorch2.0 DDP training☆17Mar 23, 2023Updated 3 years ago
- PyTorch implementation of ECCV 2024 paper "Confidence-Based Iterative Generation for Real-World Image Super-Resolution"☆16Nov 17, 2024Updated last year
- ☆10Aug 14, 2019Updated 6 years ago
- 基于movielens-25m数据集的生成式推荐项目☆36Aug 6, 2025Updated 7 months ago
- PHP with FPM Dockerfile for trusted automated Docker builds.☆12Mar 2, 2016Updated 10 years ago
- “达观杯”长文本智能处理挑战赛。达观数据提供了一批长文本数据和分类信息,希望选手动用自己的智慧,结合当下最先进的NLP和人工智能技术,深入分析文本内在结构和语义信息,构建文本分类模型,实现精准分类。☆10Jul 20, 2018Updated 7 years ago
- TensorFlow: learn and practice☆11Aug 30, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Scraped reviews from OpenRice for sentiment analysis. Formatted to use with BERT.☆11Apr 9, 2020Updated 5 years ago
- My research paper notes, focusing on data mining/recommender/reinforcement learning. 我的论文笔记,主要聚焦于数据挖掘、推荐系统、强化学习☆23Dec 4, 2021Updated 4 years ago
- ☆20Sep 1, 2021Updated 4 years ago
- N-BEATS: Neural basis expansion analysis for interpretable time series forecasting.☆23Jun 28, 2019Updated 6 years ago
- This repo contains the KERAS implementation of "AGSDNet: Attention and Gradient based SAR Denoising Network"☆15Jun 16, 2023Updated 2 years ago
- ☆18Nov 16, 2022Updated 3 years ago
- Unofficial implementation of generative recommender (GR) with Hierarchical Sequential Transduction Unit (HSTU) from Actions Speak Louder …☆27Jun 5, 2024Updated last year
- Officical repository for the paper“ChartInsights: Evaluating Multimodal Large Language Models for Low-Level Chart Question Answering”(EMN…☆22Nov 16, 2024Updated last year
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆32Mar 3, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A minimalist theme☆17Jan 18, 2025Updated last year
- ☆11Nov 21, 2024Updated last year
- 早期的计算机使用7位的ASCII编码,为了处理汉字,程序员设计了用于简体中文的GB2312和用于繁体中文的big5。 GB2312(1980年)一共收录了7445个字符,包括6763个汉字和682个其它符号。汉字区的内码范围高字节从B0-F7,低字节从A1-FE,占用的码…☆10Sep 10, 2017Updated 8 years ago
- 一个低成本、易于上手的多模态大模型学习项目。基于Qwen3-0.6B和CLIP构建,使用LLaVA架构和LoRA微调,在消费级16G显卡上数小时即可完成训练☆43Sep 15, 2025Updated 6 months ago
- A curated list of resources dedicated to word segmentation☆12Jan 9, 2019Updated 7 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆19Feb 10, 2026Updated last month
- 使用BERT预训练语言模型获取评论文本的向量表示,通过Bi-GRU网络学习其中的语义特征,分别采用情感权重和注意力机制来为评论向量分配权重,动态调节其对用户特征和产品特征的影响程度,并以加权求和的方式获得用户特征和产品特征,最后利用DeepFM算法对用户特征和产品特征进行深…☆16Mar 28, 2023Updated 2 years ago