gabrielchua / daily-ai-papersLinks
All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. πAudio summaries here (https://t.me/daily_ai_papers).
β211Updated 2 months ago
Alternatives and similar repositories for daily-ai-papers
Users that are interested in daily-ai-papers are comparing it to the libraries listed below
Sorting:
- β87Updated last year
- β182Updated 2 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.β175Updated last year
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsβ¦β370Updated last year
- minimal GRPO implementation from scratchβ102Updated 10 months ago
- β120Updated last year
- Banishing LLM Hallucinations Requires Rethinking Generalizationβ276Updated last year
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.β358Updated 7 months ago
- GRadient-INformed MoEβ264Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMsβ96Updated 8 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?β88Updated 10 months ago
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMsβ314Updated 6 months ago
- Tina: Tiny Reasoning Models via LoRAβ313Updated 4 months ago
- An automated tool for discovering insights from research papaer corporaβ137Updated last year
- Exploring Applications of GRPOβ251Updated 5 months ago
- β365Updated 5 months ago
- βοΈ Awesome LLM Judges βοΈβ148Updated 8 months ago
- Build your own visual reasoning modelβ417Updated 2 weeks ago
- Code for ExploreTomβ90Updated 7 months ago
- From scratch implementation of a vision language model in pure PyTorchβ253Updated last year
- β176Updated 10 months ago
- Official repo for "Make Your LLM Fully Utilize the Context"β261Updated last year
- Simple examples using Argilla tools to build AIβ57Updated last year
- β101Updated last year
- Solving data for LLMs - Create quality synthetic datasets!β151Updated last year
- Collection of scripts and notebooks for OpenAI's latest GPT OSS modelsβ495Updated 5 months ago
- A compact LLM pretrained in 9 days by using high quality dataβ340Updated 9 months ago
- β137Updated last year
- Train your own SOTA deductive reasoning modelβ107Updated 10 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Ayaβ125Updated 5 months ago