OpenLLMAI / OpenLLMDE
OpenLLMDE: An open source data engineering framework for LLMs
☆16Updated last year
Related projects: ⓘ
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆38Updated 7 months ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆51Updated 5 months ago
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆43Updated 5 months ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆45Updated last year
- An Experiment on Dynamic NTK Scaling RoPE☆59Updated 9 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆32Updated 8 months ago
- Code and data for COLING2024 paper "Characteristic AI Agents via Large Language Models".☆23Updated 6 months ago
- A Bilingual Role Evaluation Benchmark for Large Language Models☆33Updated 8 months ago
- Source code for ACL 2023 paper Decoder Tuning: Efficient Language Understanding as Decoding☆47Updated last year
- ☆34Updated 2 weeks ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆22Updated last year
- code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》☆28Updated 8 months ago
- code for Scaling Laws of RoPE-based Extrapolation☆68Updated 11 months ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆39Updated 2 months ago
- ☆32Updated 3 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆33Updated 6 months ago
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆31Updated last month
- ☆24Updated last year
- ☆87Updated 4 months ago
- NTK scaled version of ALiBi position encoding in Transformer.☆64Updated last year
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆34Updated 2 months ago
- Dataset for Findings of ACL 23 "VCSum: A Versatile Chinese Meeting Summarization Dataset"☆29Updated last year
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆45Updated 6 months ago
- ☆24Updated last year
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆18Updated 3 months ago
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆27Updated 2 months ago
- Fantastic Data Engineering for Large Language Models☆38Updated last month
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆58Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆19Updated last year
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆26Updated last year