ccibeekeoc42 / Meta_LlamaLinks
This is the Placeholder for Llama. Starting with Llama 3
☆11Updated last year
Alternatives and similar repositories for Meta_Llama
Users that are interested in Meta_Llama are comparing it to the libraries listed below
Sorting:
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023☆36Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated last year
- ☆209Updated last month
- ☆78Updated last year
- This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"☆115Updated 5 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆81Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆88Updated 10 months ago
- Code and training scripts for FlexOlmo☆113Updated this week
- Implementation of the Mamba SSM with hf_integration.☆56Updated last year
- ☆49Updated 2 years ago
- Small and Efficient Mathematical Reasoning LLMs☆72Updated last year
- ☆67Updated 8 months ago
- Data preparation code for Amber 7B LLM☆93Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated 2 years ago
- Truly flash T5 realization!☆71Updated last year
- ☆70Updated last year
- Multi-Granularity LLM Debugger [ICSE2026]☆93Updated 4 months ago
- Unofficial Implementation of Evolutionary Model Merging☆41Updated last year
- Verifiers for LLM Reinforcement Learning☆79Updated 7 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆205Updated 5 months ago
- ☆100Updated last year
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection☆22Updated last year
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆84Updated 3 months ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated last year
- Efficient Infinite Context Transformers with Infini-attention Pytorch Implementation + QwenMoE Implementation + Training Script + 1M cont…☆83Updated last year
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated 2 years ago
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆152Updated last year
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆74Updated last month
- Pretraining Efficiently on S2ORC!☆173Updated last year
- This is the official repository for Inheritune.☆115Updated 9 months ago