gabrielchua / daily-ai-papersLinks
All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. πAudio summaries here (https://t.me/daily_ai_papers).
β188Updated this week
Alternatives and similar repositories for daily-ai-papers
Users that are interested in daily-ai-papers are comparing it to the libraries listed below
Sorting:
- β86Updated 9 months ago
- β162Updated 2 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.β173Updated 6 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalizationβ276Updated last year
- Solving data for LLMs - Create quality synthetic datasets!β150Updated 5 months ago
- GRadient-INformed MoEβ263Updated 9 months ago
- minimal GRPO implementation from scratchβ92Updated 4 months ago
- βοΈ Awesome LLM Judges βοΈβ107Updated 2 months ago
- A compact LLM pretrained in 9 days by using high quality dataβ318Updated 3 months ago
- Tina: Tiny Reasoning Models via LoRAβ268Updated last month
- π€ Benchmark Large Language Models Reliably On Your Dataβ359Updated last week
- β162Updated 4 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.β309Updated 3 weeks ago
- β118Updated 10 months ago
- Easy to use, High Performant Knowledge Distillation for LLMsβ88Updated 2 months ago
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMsβ311Updated this week
- Finetune Llama-3-8b on the MathInstruct datasetβ110Updated 9 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?β71Updated 4 months ago
- Simple examples using Argilla tools to build AIβ53Updated 8 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsβ¦β341Updated 7 months ago
- Official repo for "Make Your LLM Fully Utilize the Context"β252Updated last year
- β210Updated 4 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024β318Updated 2 months ago
- Code for ExploreTomβ84Updated 3 weeks ago
- Prompt-to-Leaderboardβ241Updated 2 months ago
- Build your own visual reasoning modelβ395Updated last week
- An automated tool for discovering insights from research papaer corporaβ138Updated last year
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and β¦β345Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Mβ¦β229Updated 8 months ago
- Exploring Applications of GRPOβ243Updated last week