luxuantao / advanced_LLM_interview_notes
大模型进阶面经
☆11Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for advanced_LLM_interview_notes
- CMIVQA☆18Updated 5 months ago
- 擂台赛3-大规模预训练调优比赛的示例代码与baseline实现☆38Updated 2 years ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆70Updated 9 months ago
- Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models☆53Updated 5 months ago
- The official repository for paper "LLMaAA: Making Large Language Models as Active Annotators"☆34Updated 7 months ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆67Updated last week
- This repository implements a prompt tuning model for hierarchical text classification. This work has been accepted as the long paper "HPT…☆62Updated last year
- ☆32Updated 3 years ago
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆57Updated 2 years ago
- A paper list of pre-trained language models (PLMs).☆79Updated 2 years ago
- Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》☆56Updated 3 years ago
- ☆36Updated last year
- ☆71Updated 10 months ago
- Released code for our ICLR23 paper.☆63Updated last year
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆97Updated last year
- [ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)☆114Updated 8 months ago
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆38Updated last year
- ☆53Updated 4 months ago
- 基于DPO算法微调语言大模型,简单好上手。☆28Updated 4 months ago
- NTK scaled version of ALiBi position encoding in Transformer.☆67Updated last year
- 揣摩研习社关注自然语言和信息检索前沿技术,解读热门科技论文,分享实用科研工具,挖掘人工智能冰山之下的学术和应用价值!☆37Updated 2 years ago
- This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>☆14Updated last year
- machine translation data process tools☆10Updated 6 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆132Updated 4 months ago
- 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training☆88Updated last month
- Code for "FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models (ACL 2024)"☆89Updated 3 weeks ago
- 😎 A simple and easy-to-use toolkit for GPU scheduling.☆42Updated 3 years ago
- A collection for math word problem (MWP) works, including datasets, algorithms and so on.☆36Updated 5 months ago
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning☆33Updated 9 months ago
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors☆35Updated 6 months ago