KnightofDawn / books-1

IT技术书籍文字版mobi epub格式

☆9

Related projects ⓘ

Alternatives and complementary repositories for books-1

Alsace08 / OOD-Math-Reasoning
Code and Data Repo for NeurIPS 2024 Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"
☆13Updated 5 months ago
yuezih / less-is-more
Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)
☆33Updated 3 weeks ago
HanNight / soft_self_consistency
Code for ACL 2024 paper "Soft Self-Consistency Improves Language Model Agents"
☆16Updated 2 months ago
Dicer-Zz / EPI
Code for the paper: Rehearsal-free Continual Language Learning via Efficient Parameter Isolation
☆13Updated last year
OpenSparseLLMs / CLIP-MoE
CLIP-MoE: Mixture of Experts for CLIP
☆17Updated last month
wbbeyourself / arxiv_paper_downloader
Arxiv daily paper downloader and manage papers with markdown preview.
☆29Updated 4 months ago
SinclairCoder / do-research-in-AI
A repository of useful research/skill-upgrading talks or acticles in NLP/CV/AI Area (in Chinese).
☆69Updated 3 months ago
Maxlinn / CHAIR-metric-standalone
CHAIR metric is a rule-based metric for evaluating object hallucination in caption generation.
☆23Updated last year
ml-researcher / VAE
☆10Updated 2 years ago
njucckevin / MM-Self-Improve
A Self-Training Framework for Vision-Language Reasoning
☆16Updated last week
MikaStars39 / StableMask
PyTorch implementation of StableMask (ICML'24)
☆12Updated 4 months ago
yfzhang114 / LLaVA-Align
This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strat…
☆72Updated 7 months ago
JackYFL / 2020_gkd_PatternRecognition
2020年秋国科大模式识别（刘成林、向世明、张煦尧）课后作业
☆9Updated 3 years ago
gzcch / Bingo
☆53Updated 7 months ago
liuxuannan / MMFakeBench
MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs
☆14Updated 3 months ago
sail-sg / sdft
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
☆100Updated 2 weeks ago
luka-group / mDPO
[EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.
☆32Updated last week
findalexli / mllm-dpo
[ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model
☆25Updated last week
kugwzk / DiDE
Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”
☆29Updated last year
edchengg / infoseek_eval
EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions
☆16Updated 5 months ago
jiaangli / VLCA
Do Vision and Language Models Share Concepts? A Vector Space Alignment Study
☆12Updated last month
Skytliang / COT-Reading-List
☆22Updated last year
ml-researcher / diffusion
☆22Updated 2 years ago
LightChen233 / M3CoT
☆38Updated 5 months ago
albertwy / GPT-4V-Evaluation
Data for evaluating GPT-4V
☆11Updated last year
IMNearth / CoAT
Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)
☆48Updated last month
choosewhatulike / cluster-clip
Multi-GPU supported kmeans clustering for cluser-clip
☆9Updated 5 months ago
Jihuai-wpy / InferAligner
☆25Updated last month
ECNU-ICALK / Foundation-LMs-based-Continual-Learning
☆13Updated 2 months ago
codezakh / SelTDA
[CVPR 23] Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!
☆14Updated 6 months ago