Xuchen-Li/llm-arxiv-daily

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Xuchen-Li/llm-arxiv-daily)

Xuchen-Li / llm-arxiv-daily

Automatically update arXiv papers about LLM Reasoning, LLM Evaluation, LLM & MLLM and Video Understanding using Github Actions.

☆143

Alternatives and similar repositories for llm-arxiv-daily

Users that are interested in llm-arxiv-daily are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sarahmart / HARDMath
View on GitHub
A new dataset of difficult graduate-level applied mathematics problems; evaluations demonstrate that leading LLMs currently exhibit low a…
☆29Feb 14, 2025Updated last year
KejiaZhang-Robust / Academic-paper-writing
View on GitHub
From Scratch to Submission: A Complete Guide to Academic Conference Paper Writing
☆32Sep 26, 2025Updated 9 months ago
WailordHe / cv-arxiv-daily-wailord
View on GitHub
🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)
☆12May 17, 2026Updated last month
safety-research / finetuning-auditor
View on GitHub
Auditing agents for fine-tuning safety
☆21Oct 21, 2025Updated 8 months ago
OrigamiSL / OTETrack
View on GitHub
Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking
☆11Sep 3, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
open-compass / GPassK
View on GitHub
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆33Aug 5, 2025Updated 11 months ago
SaFo-Lab / DoxBench
View on GitHub
[ICLR 2026] The official code for "Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models"
☆28Feb 7, 2026Updated 4 months ago
Babelscape / ALERT
View on GitHub
Official repository for the paper "ALERT: A Comprehensive Benchmark for Assessing Large Language Models’ Safety through Red Teaming"
☆60Sep 20, 2024Updated last year
jonathan-roberts1 / SciFIBench
View on GitHub
NeurIPS 2024: SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
☆13May 24, 2025Updated last year
XiaokunFeng / CTVLT
View on GitHub
[ICASSP'25] Enhancing Vision-Language Tracking by Effectively Converting Textual Cues into Visual Cues
☆19Dec 31, 2024Updated last year
fscdc / RewardMap
View on GitHub
[ICLR 2026] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning
☆45Feb 22, 2026Updated 4 months ago
thoppe / The-Pile-FreeLaw
View on GitHub
Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.
☆16Jun 3, 2023Updated 3 years ago
alipay / POA
View on GitHub
☆22Aug 8, 2024Updated last year
earth-insights / Advanced-Earth-Observation
View on GitHub
Paper List on Earth Observation in the Foundation Model Era
☆31Jun 15, 2026Updated 3 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
HengLan / Awesome-Visual-Tracking
View on GitHub
Awesome Visual Tracking
☆24Oct 3, 2025Updated 9 months ago
rohinmanvi / Capability-Aware-and-Mid-Generation-Self-Evaluations
View on GitHub
☆21Jul 25, 2025Updated 11 months ago
chenshihfang / GOT
View on GitHub
Can we make visual tracking systems align more closely with human visual perception?
☆41Updated this week
jmyoon1 / adp
View on GitHub
Implementation of "Adversarial purification with Score-based generative models", ICML 2021
☆30Oct 24, 2021Updated 4 years ago
gxy-gxy / DeepRAG
View on GitHub
DeepRAG: Thinking to Retrieve Step by Step for Large Language Models
☆39Feb 17, 2026Updated 4 months ago
cma1114 / activation_steering
View on GitHub
An exploration of LLM steering
☆28Jun 15, 2024Updated 2 years ago
Dereck0602 / Awesome_Test_Time_LLMs
View on GitHub
☆155Mar 12, 2025Updated last year
BasitAlawode / Best_of_N_Trackers
View on GitHub
☆25Dec 23, 2024Updated last year
Harvard-Ophthalmology-AI-Lab / FairVision
View on GitHub
[arXiv 2024] FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling
☆18Apr 15, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hccngu / DialCoT
View on GitHub
DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models
☆13Nov 2, 2023Updated 2 years ago
suilin0432 / SoS-WSOD
View on GitHub
Salvage of Supervision in Weakly Supervised Object Detection, CVPR 2022
☆22Oct 25, 2022Updated 3 years ago
davestewart / bluesky-follower-info
View on GitHub
Display followers' profile descriptions in the Bluesky notifications feed
☆14Dec 12, 2024Updated last year
sastpg / CoVo
View on GitHub
Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning
☆25Jun 25, 2025Updated last year
MIND-Lab / SemEval2022-Task-5-Multimedia-Automatic-Misogyny-Identification-MAMI-
View on GitHub
SemEval 2022 Task 5: Multimedia Automatic Misogyny Identification - baseline models and dataset
☆15Nov 22, 2022Updated 3 years ago
atosystem / SSL_Interface
View on GitHub
Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024
☆16Nov 19, 2024Updated last year
RickySkywalker / TheoremLlama
View on GitHub
This is the official repository for all the code of TheoremLlama
☆47Aug 4, 2025Updated 11 months ago
KbsdJames / omni-math-rule
View on GitHub
The rule-based evaluation subset and code implementation of Omni-MATH
☆28Dec 23, 2024Updated last year
ucas-vg / P2Seg-Public
View on GitHub
☆10Jan 24, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zjunlp / LookAheadTuning
View on GitHub
[WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews
☆18Dec 14, 2025Updated 6 months ago
RUCAIBox / SimpleDeepSearcher
View on GitHub
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
☆120Jun 3, 2025Updated last year
BytedTsinghua-SIA / Enigmata
View on GitHub
Resources for the Enigmata Project.
☆82Aug 13, 2025Updated 10 months ago
kilian-group / LMLM
View on GitHub
☆34Oct 31, 2025Updated 8 months ago
WooooDyy / MathCritique
View on GitHub
Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".
☆55Nov 29, 2024Updated last year
D2I-ai / struxgpt
View on GitHub
[NeurIPS 2024] Official implementation of the paper "Enhancing LLM’s Cognition via Structurization"
☆24Aug 5, 2025Updated 11 months ago
d3n7 / riffusionDJ
View on GitHub
Multichannel Looper/Feedback System for Riffusion
☆14May 6, 2023Updated 3 years ago