JerryYin777 / NanoGPT-Pytorch2.0-Implementation
This is a repo for my NanoGPT Pytorch2.0 Implementation when torch2.0 released soon, faster and simpler, a good tutorial learning GPT.
☆59Updated 7 months ago
Related projects: ⓘ
- 使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力☆145Updated 2 months ago
- This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …☆47Updated 9 months ago
- 本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。☆55Updated 6 months ago
- EffiBench: Benchmarking the Efficiency of Automatically Generated Code☆50Updated last month
- Empower Your Model with Longer and Better Context Comprehention☆50Updated last year
- ☆55Updated 2 months ago
- 【grps接入trtllm】通过接入TensorRT-LLM以及Tokenizers.cpp实现纯c++版本高性能LLM服务,兼容OpenAI接口协议,支持chat和function call模式。☆40Updated 2 weeks ago
- This repo contains my customised style python based plots for NLP papers, and includes my reproduction for my favourite papers' plots☆38Updated 6 months ago
- A Comprehensive Benchmark for Code Information Retrieval.☆61Updated last week
- We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that …☆115Updated last year
- Knowledge Graph Fine-Tuning with Open-World Knowledge☆93Updated 2 months ago
- 接地气的大模型工程,争取成为一本大模型实战百科全书☆17Updated 11 months ago
- CCKS‘2021:《SGSum:一个面向体育赛事摘要的人工标注数据集》☆24Updated 2 years ago
- ☆30Updated last month
- An Extensible Framework for Retrieval-Augmented LLM Applications: Learning Relevance Beyond Simple Similarity.☆42Updated last month
- Code for ACL 2024 long paper: Are AI-Generated Text Detectors Robust to Adversarial Perturbations?☆15Updated 2 months ago
- Ein multimodaler, multi-intelligenter Entwicklungsrahmen☆56Updated last week
- Aiming to build the most comprehensive machine learning blog.☆149Updated this week
- [EMNLP 2023] CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation☆41Updated 10 months ago
- [ACL 2024] CodeScope: An Execution-based Multilingual Multitask Multidimensional Benchmark for Evaluating LLMs on Code Understanding and …☆107Updated last month
- 从预训练到强化学习的中文llama2☆93Updated 11 months ago
- Code and data for crosstalk text generation tasks, exploring whether large models and pre-trained language models can understand humor. …☆164Updated 2 years ago
- A Unified Intermediate Representation for Graph Query Languages☆72Updated last year
- ☆43Updated this week
- Grimoire is All You Need for Enhancing Large Language Models☆115Updated 6 months ago
- A deployment, monitoring and autoscaling service towards serverless LLM serving.☆152Updated last week
- ☆87Updated 7 months ago
- 基于国内大学构造的neo4j知识图谱,并进行简单问答,帮助了解大学,填报高考志愿☆57Updated 2 years ago
- A Graph Query Language Transpiler☆40Updated 3 months ago
- ☆18Updated 9 months ago