infly-ai / INF-LLM
The official repo of INF-34B models trained by INF Technology.
☆35Updated 9 months ago
Alternatives and similar repositories for INF-LLM
Users that are interested in INF-LLM are comparing it to the libraries listed below
Sorting:
- ☆140Updated last year
- ☆98Updated 7 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆258Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆77Updated 6 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆171Updated 10 months ago
- ☆47Updated 11 months ago
- ☆63Updated 5 months ago
- [ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)☆135Updated 3 months ago
- ☆143Updated 10 months ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆249Updated 5 months ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆32Updated 4 months ago
- ☆168Updated last year
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆153Updated 11 months ago
- ☆97Updated last year
- ☆81Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆151Updated 8 months ago
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models☆100Updated 3 weeks ago
- A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to…☆55Updated last year
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆127Updated 11 months ago
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated last year
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆115Updated last year
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆32Updated 5 months ago
- Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models☆62Updated 3 months ago
- Repository of LV-Eval Benchmark☆65Updated 8 months ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆176Updated 2 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆80Updated last year
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆77Updated last year
- ☆42Updated 3 months ago
- ☆106Updated last year