gabrielchua / daily-ai-papersLinks
All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. πAudio summaries here (https://t.me/daily_ai_papers).
β198Updated this week
Alternatives and similar repositories for daily-ai-papers
Users that are interested in daily-ai-papers are comparing it to the libraries listed below
Sorting:
- β86Updated last year
- β177Updated 2 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.β172Updated 8 months ago
- minimal GRPO implementation from scratchβ98Updated 6 months ago
- GRadient-INformed MoEβ264Updated last year
- An automated tool for discovering insights from research papaer corporaβ139Updated last year
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.β343Updated 3 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS modelsβ456Updated last month
- Code for ExploreTomβ86Updated 3 months ago
- Easy to use, High Performant Knowledge Distillation for LLMsβ93Updated 5 months ago
- Tina: Tiny Reasoning Models via LoRAβ290Updated 2 weeks ago
- Solving data for LLMs - Create quality synthetic datasets!β151Updated 8 months ago
- From scratch implementation of a vision language model in pure PyTorchβ243Updated last year
- β119Updated last year
- Simple examples using Argilla tools to build AIβ56Updated 10 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsβ¦β342Updated 9 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalizationβ275Updated last year
- β170Updated 7 months ago
- β93Updated 3 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?β77Updated 6 months ago
- β102Updated last year
- β218Updated 7 months ago
- Exploring Applications of GRPOβ248Updated last month
- Train your own SOTA deductive reasoning modelβ107Updated 7 months ago
- A compact LLM pretrained in 9 days by using high quality dataβ328Updated 6 months ago
- β78Updated last week
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"β100Updated last month
- Arxflix turns your boring Arxiv research paper into a captivating video.β55Updated 3 weeks ago
- ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!β123Updated last week
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)β119Updated 8 months ago