multimodal-art-projection / MAP-NEO
☆878Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for MAP-NEO
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,008Updated 10 months ago
- ⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)☆883Updated 4 months ago
- Large Reasoning Models☆580Updated this week
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆714Updated 2 weeks ago
- ☆819Updated last month
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆641Updated 3 months ago
- Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality s…☆491Updated 2 weeks ago
- O1 Replication Journey: A Strategic Progress Report – Part I☆1,318Updated 3 weeks ago
- Train a 1B LLM with 1T tokens from scratch by personal☆319Updated this week
- A streamlined and customizable framework for efficient large model evaluation and performance benchmarking☆257Updated this week
- Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.☆511Updated last week
- Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs…☆434Updated 3 weeks ago
- Parsing-free RAG supported by VLMs☆388Updated this week
- [ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding☆671Updated 2 months ago
- ☆213Updated 6 months ago
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆412Updated last month
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆647Updated last month
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆498Updated 6 months ago
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,045Updated this week
- A generalized information-seeking agent system with Large Language Models (LLMs).☆1,104Updated 5 months ago
- 📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥☆1,006Updated this week
- 大模型多维度中文对齐评测基准 (ACL 2024)☆332Updated 3 months ago
- A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks☆252Updated 3 months ago
- Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-…☆238Updated 5 months ago
- [NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces in…☆791Updated this week
- Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.☆1,120Updated 3 months ago
- DeepSeek LLM: Let there be answers☆1,451Updated 9 months ago
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆353Updated 6 months ago
- Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.☆488Updated 4 months ago
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research☆1,335Updated this week