elsatch / daily_hf_papers_abstractsLinks

This repository includes the code to download the curated HuggingFace papers into a single markdown formatted file

☆14

Alternatives and similar repositories for daily_hf_papers_abstracts

Users that are interested in daily_hf_papers_abstracts are comparing it to the libraries listed below

Sorting:

Qichuzyy / POA
Official implementation of ECCV24 paper: POA
☆24Updated 11 months ago
huggingface / pixparse
Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data
☆21Updated last year
zaydzuhri / flame
Fork of Flame repo for training of some new stuff in development
☆14Updated 3 weeks ago
shangshang-wang / Resa
Resa: Transparent Reasoning Models via SAEs
☆41Updated last month
XiaoduoAILab / XmodelVLM
☆69Updated last year
camenduru / bria-rmbg-jupyter
☆16Updated last year
DCDmllm / HyperLLaVA
Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
☆28Updated last year
GenRobo / MatMamba
Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"
☆60Updated 8 months ago
camenduru / DiffSketcher-colab
☆16Updated last year
camenduru / PIA-colab
☆24Updated last year
sayakpaul / simple-image-recaptioning
Recaption large (Web)Datasets with vllm and save the artifacts.
☆52Updated 8 months ago
SHI-Labs / OLA-VLM
OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024
☆60Updated 5 months ago
mmhamdy / open-language-models
A list of language models with permissive licenses such as MIT or Apache 2.0
☆24Updated 5 months ago
kiddyboots216 / lottery-ticket-adaptation
Lottery Ticket Adaptation
☆39Updated 8 months ago
cloneofsimo / repa-rf
☆32Updated 9 months ago
poloclub / ClickDiffusion
ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing
☆69Updated last year
facebookresearch / mexma
MEXMA: Token-level objectives improve sentence representations
☆41Updated 7 months ago
ethansmith2000 / AutoLoRADiscovery
☆28Updated last year
penfever / wildchat-50m
Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.
☆29Updated 4 months ago
top-yun / SPARK
A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.
☆18Updated 7 months ago
mbzuai-oryx / PALO
(WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…
☆84Updated 5 months ago
The-Inscrutable-X / TACQ
Official Repository for Task-Circuit Quantization
☆21Updated 2 months ago
IntelLabs / multimodal_cognitive_ai
research work on multimodal cognitive ai
☆64Updated last month
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 8 months ago
SonicCodes / subcloning
implementation of https://arxiv.org/pdf/2312.09299
☆21Updated last year
g-luo / vlm_cross_modal_reps
Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025
☆29Updated 3 months ago
shulin16 / MMInA
[ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents
☆46Updated 5 months ago
Adamdad / neumeta
NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…
☆43Updated 8 months ago
vis-nlp / ChartGemma
☆67Updated last year
AnonymousAlethiometer / SGD_SaI
Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"
☆52Updated 6 months ago