fondoger/scholar_dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fondoger/scholar_dataset)

fondoger / scholar_dataset

百度百科学者词条、知网学者和中文论文元数据开源数据集

☆20

Alternatives and similar repositories for scholar_dataset

Users that are interested in scholar_dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xnliang98 / CKE-ZH
View on GitHub
基于中心度的中文关键短语抽取工具
☆11Sep 2, 2022Updated 3 years ago
yongzhuo / pytorch-loss
View on GitHub
pytorch版损失函数，改写自科学空间文章，【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】
☆12Aug 22, 2021Updated 4 years ago
MySong7NLPer / HyperMatch
View on GitHub
Code for NAACL 2022 paper "Hyperbolic Relevance Matching for Neural Keyphrase Extraction".
☆22Mar 28, 2023Updated 3 years ago
AnzorGozalishvili / unsupervised_keyword_extraction
View on GitHub
Unsupervised approach to keyword extraction
☆24May 22, 2023Updated 3 years ago
baoy-nlp / DSS-VAE-pytorch
View on GitHub
Generating Sentences from Disentangled Syntactic and Semantic Spaces
☆11Jun 24, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bloomberg / kbir_keybart
View on GitHub
Experimental code used in pre-training the KBIR and KeyBART models
☆27Jul 8, 2022Updated 4 years ago
CodeLLM-Research / CodeJudge-Eval
View on GitHub
[COLING25] CodeJudge Eval: Can Large Language Models be Good Judges in Code Understanding?
☆12Dec 3, 2024Updated last year
ruiqi-zhong / SemanticScaffold
View on GitHub
Semantic Scaffolds for Pseudocode-to-Code Generation (accepted by ACL 2020)
☆14Jun 7, 2021Updated 5 years ago
CodedotAl / reading-group
View on GitHub
Information about the CodedotAI reading group sessions.
☆13Aug 16, 2021Updated 4 years ago
kinesiatricssxilm14 / CodeRepoQA
View on GitHub
CodeRepoQA dataset
☆15Feb 19, 2025Updated last year
mtianyan / Mtianyan-Play-with-Machine-Learning-Algorithms
View on GitHub
Python3入门机器学习经典算法与应用学习
☆11Nov 9, 2018Updated 7 years ago
jszheng21 / RACE
View on GitHub
RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.
☆14Oct 12, 2024Updated last year
xydaytoy / BMI-NMT
View on GitHub
☆11Jul 28, 2021Updated 5 years ago
huangd1999 / EffiLearner
View on GitHub
[NeurIPS 2024] Self-Optimization Improves the Efficiency of Code Generation
☆15May 10, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
joeyism / py-image-comparer
View on GitHub
Compares two images using Siamese Network (machine learning) trained from a Pytorch Implementation
☆10Jul 27, 2021Updated 5 years ago
NingAnMe / Label-Smoothing-for-CrossEntropyLoss-PyTorch
View on GitHub
add a Arg: label_smoothing for torch.nn.CrossEntropyLoss()
☆14Jan 13, 2021Updated 5 years ago
shengqiangzhang / transformer-pointer-generator-for-english-dataset
View on GitHub
An Abstractive Summarization(for Datasets in English format) Implementation with Transformer and Pointer-generator
☆12Dec 31, 2020Updated 5 years ago
lulia0228 / Document_IE
View on GitHub
GCN use for semi-construct document information extraction.
☆21Aug 5, 2023Updated 2 years ago
Ahmedfir / mBERTa
View on GitHub
CodeBERT based mutation testing tool.
☆13Nov 10, 2025Updated 8 months ago
robinsongh381 / UNILM_Pytorch_Korean
View on GitHub
☆11Jul 5, 2020Updated 6 years ago
xguo7 / Automatic-Controllable-Product-Copywriting-for-E-Commerce
View on GitHub
☆16Nov 3, 2022Updated 3 years ago
voidful / pretrain_bart
View on GitHub
training BART from scratch
☆12Dec 31, 2021Updated 4 years ago
HiroakiMikami / mlprogram
View on GitHub
PyTorch library for synthesizing programs from natural language
☆18Jul 25, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
920232796 / pytorch_chatbot
View on GitHub
pytorch实现聊天机器人，seq2seq模型
☆10Feb 9, 2020Updated 6 years ago
yagol2020 / SimADFuzz
View on GitHub
SimADFuzz: Simulation-Feedback Fuzz Testing for Autonomous Driving Systems
☆11Apr 11, 2025Updated last year
nuaa-nlp / Evaluation-of-ChatGPT
View on GitHub
☆14Apr 15, 2023Updated 3 years ago
zhanlaoban / Text_Classification
View on GitHub
Summary of Text Classification in deep learning techniques implemented by PyTorch and TensorFlow. 深度学习文本分类技术总结，以PyTorch实现。
☆14Dec 18, 2019Updated 6 years ago
GATECH-EIC / LLM4HWDesign_Starting_Toolkit
View on GitHub
LLM4HWDesign Starting Toolkit
☆20Oct 4, 2024Updated last year
NTDXYG / DualSC
View on GitHub
code and data for paper "Automatic Generation and Summarization of Shellcode via Transformer and Dual Learning", which accepted in SANER …
☆12May 8, 2022Updated 4 years ago
marcusm117 / IdentityChain
View on GitHub
[ICLR 2024] Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain
☆11Nov 24, 2025Updated 8 months ago
haotangxjtu / MSCL
View on GitHub
code for Multisample-based Contrastive Loss for Top-k Recommendation (IEEE TMM)
☆10Nov 23, 2022Updated 3 years ago
zysszy / CAT
View on GitHub
Improving Machine Translation Systems via Isotopic Replacement
☆12Apr 14, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
LIANGQINGYUAN / Lyra
View on GitHub
Lyra: A Benchmark for Turducken-Style Code Generation
☆15Apr 22, 2022Updated 4 years ago
mdrafiqulrabin / JavaTransformer
View on GitHub
Program Transformation Tool for Java Methods
☆10Sep 16, 2022Updated 3 years ago
nuaa-nlp / Multimodality
View on GitHub
☆15Dec 10, 2021Updated 4 years ago
lipiji / uChecker
View on GitHub
Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"
☆19Aug 17, 2022Updated 3 years ago
KoreaMGLEE / Concept-based-curriculum-masking
View on GitHub
Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking
☆13Feb 5, 2023Updated 3 years ago
NTDXYG / COTTON
View on GitHub
Data and code for "Chain-of-Thought in Neural Code Generation: From and For Lightweight Language Models", which accepted in TSE.
☆15Jul 3, 2024Updated 2 years ago
wssun / BADCODE
View on GitHub
Backdooring Neural Code Search
☆14Sep 8, 2023Updated 2 years ago