长文本相似度模型
☆21Nov 24, 2023Updated 2 years ago
Alternatives and similar repositories for longBert
Users that are interested in longBert are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ppo算法实现☆41Jun 5, 2024Updated 2 years ago
- dpo算法实现☆53Jun 12, 2024Updated last year
- BBPE 底层实现☆38Apr 29, 2024Updated 2 years ago
- 爬取快手热门视频,每个视频评论,用户详情数据,用户所有个人作品☆13Jun 24, 2019Updated 6 years ago
- Easy-to-use math animation rendering engine☆21Jan 14, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆14Aug 26, 2024Updated last year
- 此开源hub基于Tensorflow2.x实现文本相似度匹配、知识蒸馏、交互模型、双塔sbert☆11Feb 22, 2023Updated 3 years ago
- 🐳 PyLoader: An asynchronous Python dataloader for loading big datasets, supporting PyTorch and TensorFlow 2.x.☆11Aug 29, 2021Updated 4 years ago
- 中文文本预处理,Word2Vec训练计算文本相似度。☆43Mar 6, 2019Updated 7 years ago
- 《应用时间序列分析》易丹辉、王燕著; 案例Python实现☆16Nov 13, 2019Updated 6 years ago
- 基于SG2300X的视频检索【使用自然语言搜索视频内容,定位到符合描述的具体时间段】☆13Feb 29, 2024Updated 2 years ago
- Springboot + ElasticSearch 构建博客检索系统☆12Mar 5, 2020Updated 6 years ago
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆35May 2, 2020Updated 6 years ago
- 数据挖掘(实战代码/欢迎讨论/大量注释/机器学习). 你将习得,如:数据的处理、LightGBM、GridSearchCV寻找最优参、StratifiedKFold分层5折切分、画AUC图、输出预测名单等。☆19Feb 16, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 中文文本句对相似度匹配-ATEC数据集☆23Jun 11, 2020Updated 6 years ago
- 实现功能:新输入一段文本,与已有数据进行相似度进行比较,返回TOP10的文本。主要实现方法:jieba中文分词、gensim、TF-IDF词汇重要性、cosine余弦相似度。☆11Jul 30, 2020Updated 5 years ago
- ☆12May 3, 2024Updated 2 years ago
- 本项目包含几种常用 NLP算法的实现:关键词(keyword)、命名实体(named entity)、自动摘要(abstract)、文本相似度比较(text similarity)等☆16Jan 16, 2022Updated 4 years ago
- Performing Latent Semantic Analysis with Python on large datasets.☆13Jun 21, 2022Updated 3 years ago
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- fasttext with hierarchical softmax, implemented by tensorflow☆19Jul 15, 2017Updated 8 years ago
- 一个基于elasticsearch开发的搜索引擎网站☆14Nov 22, 2022Updated 3 years ago
- 【Demo】对新闻标题使用TF-IDF向量化和cosine相似度计算完成相似标题推荐☆14Mar 2, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 基于ELMo, tensorflow的中文命名实体标注 Chinese Named Entity Recognition Based on ELMo☆20Oct 26, 2019Updated 6 years ago
- 基于simhash的文本去重算法☆20Jun 18, 2021Updated 4 years ago
- 利用爬虫获取58同城的二手房信息,选取特征并对数据进行预处理,利用机器学习算法给出不同地段的租房推荐。☆14Jul 17, 2019Updated 6 years ago
- 利用bert预训练模型生成句向量或词向量☆26Oct 29, 2020Updated 5 years ago
- 用文本编辑器剪视频☆37Jun 1, 2023Updated 3 years ago
- Code for our ACL-2023 paper AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model☆23Dec 14, 2023Updated 2 years ago
- NLP/ML面试各类资料链接 汇总(主要Github收集)☆11Mar 3, 2020Updated 6 years ago
- 微调预训练语言模型(BERT、Roberta、XLBert等),用于计算两个文本之间的相似度(通过句子对分类任务转换),适用于中文文本☆90Jul 30, 2020Updated 5 years ago
- “达观杯”长文本智能处理挑战赛。达观数据提供了一批长文本数据和分类信息,希望选手动用自己的智慧,结合当下最先进的NLP和人工智能技术,深入分析文本内在结构和语义信息,构建文本分类模型,实现精准分类。☆10Jul 20, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 批量下载抖音用户视频☆21Jan 19, 2024Updated 2 years ago
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆55May 17, 2023Updated 3 years ago
- ☆11Jun 21, 2022Updated 3 years ago
- Hybrid RT DETR: Hybrid encoder-decoder network for end-to-end object detection in UAV imagery☆17May 22, 2024Updated 2 years ago
- Embed Python in Unreal Engine 4☆11Aug 13, 2021Updated 4 years ago
- 针对NER领域提供从线下训练到线上部署的一整套闭环流程☆14Jun 16, 2021Updated 4 years ago
- 超长文本分类(大于1000字);文档级/篇章级文本分类;主要是解决长距离依赖问题☆131Oct 9, 2021Updated 4 years ago