multimodal-art-projection / MAP-NEO
☆873Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for MAP-NEO
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,000Updated 9 months ago
- ☆778Updated 3 weeks ago
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆701Updated this week
- O1 Replication Journey: A Strategic Progress Report – Part I☆1,235Updated last week
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆641Updated 2 months ago
- ⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)☆880Updated 4 months ago
- Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality s…☆476Updated this week
- [ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding☆657Updated last month
- Large Reasoning Models☆371Updated this week
- Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT,Cross Encoder☆495Updated last week
- Parsing-free RAG supported by VLMs☆329Updated this week
- A streamlined and customizable framework for efficient large model evaluation and performance benchmarking☆231Updated this week
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆967Updated this week
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆353Updated 6 months ago
- Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.☆1,109Updated 3 months ago
- 大模型多维度中文对齐评测基准 (ACL 2024)☆329Updated 2 months ago
- ☆484Updated 3 weeks ago
- A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks☆253Updated 3 months ago
- A generalized information-seeking agent system with Large Language Models (LLMs).☆1,095Updated 4 months ago
- [NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces in…☆776Updated this week
- Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs…☆429Updated 2 weeks ago
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆642Updated last month
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆494Updated 5 months ago
- ☆213Updated 5 months ago
- CMMLU: Measuring massive multitask language understanding in Chinese☆694Updated this week
- Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.☆482Updated 3 months ago
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆402Updated 3 weeks ago
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research☆1,287Updated this week
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,362Updated last year
- An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)☆3,931Updated 2 weeks ago