infly-ai / INF-LLMLinks
The official repo of INF-34B models trained by INF Technology.
β35Updated 11 months ago
Alternatives and similar repositories for INF-LLM
Users that are interested in INF-LLM are comparing it to the libraries listed below
Sorting:
- β101Updated 8 months ago
- [ICLR 2025] 𧬠RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)β149Updated 4 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuningβ158Updated 9 months ago
- β141Updated last year
- [ICML 2024] Selecting High-Quality Data for Training Language Modelsβ176Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimationβ81Updated 7 months ago
- TencentLLMEval is a comprehensive and extensive benchmark for artificial evaluation of large models that includes task trees, standards, β¦β38Updated 3 months ago
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Modelsβ101Updated 2 weeks ago
- [ACL 2024] Long-Context Language Modeling with Parallel Encodingsβ154Updated last year
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Modelsβ78Updated last year
- β63Updated 6 months ago
- β48Updated last year
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuningβ261Updated last year
- a-m-team's exploration in large language modelingβ160Updated 3 weeks ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMsβ250Updated 6 months ago
- [ACL 2025, Main Conference, Oral] Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Processβ28Updated 10 months ago
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architectβ¦β58Updated 3 weeks ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)β87Updated 4 months ago
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chatβ114Updated 2 years ago
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scalesβ32Updated last year
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Modelsβ40Updated last year
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large Lβ¦β50Updated last year
- β59Updated last year
- [ACL-25] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.β63Updated 7 months ago
- Fantastic Data Engineering for Large Language Modelsβ89Updated 5 months ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Modelsβ55Updated 11 months ago
- β33Updated 9 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"β131Updated last year
- β30Updated 4 months ago
- Naive Bayes-based Context Extensionβ326Updated 6 months ago