JerryYin777 / NanoGPT-Pytorch2.0-ImplementationLinks

This is a repo for my NanoGPT Pytorch2.0 Implementation when torch2.0 released soon, faster and simpler, a good tutorial learning GPT.

☆52

Alternatives and similar repositories for NanoGPT-Pytorch2.0-Implementation

Users that are interested in NanoGPT-Pytorch2.0-Implementation are comparing it to the libraries listed below

Sorting:

MrYxJ / enhance_long
This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …
☆45Updated last year
Qcompiler / MixQ_Tensorrt_LLM
Mixed precision inference by Tensorrt-LLM
☆81Updated 9 months ago
muyu42 / DataS
本项目旨在结合以往研究人员的代表性工作，从多个维度评估sft数据，并自动化过滤sft数据。
☆43Updated last year
yileijin / PayAttn
Official Implementation of "Pay Attention to What You Need"
☆43Updated 5 months ago
Qcompiler / vllm-mixed-precision
Support mixed-precsion inference with vllm
☆85Updated 3 weeks ago
wei-potato / Train-llm-from-scratch
使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力
☆155Updated last week
Haijian06 / EartAgent
Ein multimodaler, multi-intelligenter Entwicklungsrahmen
☆45Updated 2 months ago
tapilot-crossing / tapilot_code
☆45Updated last year
heng840 / AMIG
Code of Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Ne…
☆26Updated last year
4real3000 / EasyJudge
[COLING Demos 2025] an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs
☆36Updated 5 months ago
Ablustrund / MPLSandbox
MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler a…
☆178Updated 3 months ago
alienet1109 / RolePersonality
Collecting personality-indicative data for role-playing agents.
☆23Updated 5 months ago
JunityZhan / CharaCraft-AI
This repo can create an character in one url.
☆64Updated last year
bird-bench / livesqlbench
☆100Updated 3 weeks ago
Gunale0926 / SORSA
SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models
☆40Updated 5 months ago
duguodong7 / Awesome-Knowledge-Fusion
A collection of papers related to knowledge fusion
☆57Updated 9 months ago
BigWhiteFox / EssayAssistant
☆46Updated last year
BladeDancer957 / CPFD
☆71Updated last year
niuchenglei / rankextor
High performance rank executor for advertisement and recommendation system, implemented in C/C++ and support ensembled into Java/Scala ho…
☆78Updated last year
gao-xiao-bai / StrategyLLM
StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving
☆22Updated 7 months ago
krystalan / SGSum
CCKS‘2021:《SGSum：一个面向体育赛事摘要的人工标注数据集》
☆21Updated 3 years ago
S1s-Z / SANTA
[ACL'23] Code for "SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recogni…
☆40Updated 3 months ago
bird-bench / BIRD-Interact
[BIRD-INTERACT] Re-imagines Text-to-SQL evaluation via lens of dynamic interactions.
☆111Updated 3 weeks ago
llmadd / code_using_GPT
Analyzing code using GPT .(通过GPT分析代码，增加注释，生成文档)
☆38Updated last year
Chen-X666 / DDmkTCCorpus
📕 DDmkTCCorpus: Diachronic Danmaku Text Comments Corpus （历时弹幕语料库）
☆15Updated last year
yewentao256 / Sicpy
Typeless Programming Language `sicpy` and Compiler;
☆33Updated last year
AtomEcho / WebTable
A python package that takes tables from a web page and processes them to get high quality tables
☆45Updated 2 years ago
OfficeAIWork / WordGPT
WordGPT是一款可以结合个人知识库或联网查询资料快速生成高质量论文、简历、博客、新闻稿、产品描述、故事、邮件、剧本、诗歌、工作汇报，及思维导图、文章配图等内容，同时可以进行各种语言的翻译，还能根据文本生成PPT的的工具。
☆51Updated 11 months ago
yaoching0 / GaC
☆48Updated 9 months ago
Rcrossmeister / RLQG
[ACL2024 Findings] Towards Better Question Generation in QA-based Event Extraction
☆47Updated 5 months ago