alipay / private_llm
☆28Updated 8 months ago
Related projects: ⓘ
- ☆82Updated 5 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆121Updated 3 months ago
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆56Updated last year
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆73Updated 7 months ago
- Implementation for PrE-Text: Training Language Models on Private Federated Data in the Age of LLMs☆14Updated 3 months ago
- [ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”☆114Updated 2 months ago
- ☆34Updated 2 weeks ago
- FuseAI Project☆75Updated 3 weeks ago
- Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"☆17Updated last month
- R-Judge: Benchmarking Safety Risk Awareness for LLM Agents☆57Updated last month
- ☆49Updated 6 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆36Updated 2 months ago
- Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24☆97Updated last week
- Light local website for displaying performances from different chat models.☆85Updated 10 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆81Updated this week
- A reading list on LLM based Synthetic Data Generation 🔥☆105Updated last month
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆31Updated last month
- code for Scaling Laws of RoPE-based Extrapolation☆68Updated 11 months ago
- Reformatted Alignment☆111Updated 4 months ago
- ☆71Updated 11 months ago
- ☆57Updated 3 weeks ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆32Updated 8 months ago
- ☆105Updated last week
- Awesome list for LLM quantization☆84Updated 2 weeks ago
- ☆29Updated 3 weeks ago
- A Comprehensive Benchmark for Software Development.☆84Updated 3 months ago
- ☆111Updated 3 months ago
- ☆97Updated last month
- Token level visualization tools for large language models☆46Updated last month
- Codebase for decoding compressed trust.☆20Updated 4 months ago