jcolano / llama3_single_gpuLinks

☆13

Alternatives and similar repositories for llama3_single_gpu

Users that are interested in llama3_single_gpu are comparing it to the libraries listed below

Sorting:

dinobby / MAgICoRE
☆24Updated 9 months ago
GaiZhenbiao / Phi3V-Finetuning
Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.
☆58Updated last year
padas-lab-de / ir-rag-sigir24-persona-rag
☆46Updated 9 months ago
vis-nlp / ChartGemma
☆62Updated 11 months ago
kyegomez / MC-ViT
Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"
☆20Updated 2 months ago
cyzus / thoughtsculpt
☆13Updated 6 months ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆57Updated 10 months ago
mbzuai-oryx / PALO
(WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…
☆84Updated 4 months ago
wjbmattingly / qwen2-vl-finetune-huggingface
This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.
☆73Updated 9 months ago
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆49Updated 11 months ago
gangiswag / infogent
☆20Updated 3 months ago
TIGER-AI-Lab / StructLM
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆75Updated 8 months ago
penfever / wildchat-50m
Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.
☆29Updated 2 months ago
TergelMunkhbat / concise-reasoning
Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models
☆34Updated 2 months ago
bespokelabsai / verifiers
Verifiers for LLM Reinforcement Learning
☆61Updated 2 months ago
Tomorrowdawn / top_nsigma
The official code repo and data hub of top_nsigma sampling strategy for LLMs.
☆26Updated 4 months ago
ytyz1307zzh / RefAug
Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"
☆55Updated 8 months ago
linkedin / ControlLLM
Control LLM
☆16Updated 2 months ago
PKU-YuanGroup / LLaVA-o1
☆56Updated 7 months ago
Qichuzyy / POA
Official implementation of ECCV24 paper: POA
☆24Updated 10 months ago
UCSC-VLAA / ReasoningEval
Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.
☆32Updated 3 weeks ago
ByungKwanLee / Phantom
[Under Review] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with enla…
☆60Updated 8 months ago
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆47Updated 4 months ago
mmhamdy / open-language-models
A list of language models with permissive licenses such as MIT or Apache 2.0
☆24Updated 4 months ago
yongchao98 / PROMST
Automatic prompt optimization framework for multi-step agent tasks.
☆31Updated 7 months ago
SHI-Labs / OLA-VLM
OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024
☆60Updated 4 months ago
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆41Updated 7 months ago
opendatalab / OHR-Bench
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
☆77Updated 3 months ago
metal-chart-generation / metal
☆35Updated last month
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year