gabrielchua / daily-ai-papersLinks
All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. πAudio summaries here (https://t.me/daily_ai_papers).
β210Updated 2 months ago
Alternatives and similar repositories for daily-ai-papers
Users that are interested in daily-ai-papers are comparing it to the libraries listed below
Sorting:
- β185Updated last month
- β86Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.β174Updated 11 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalizationβ276Updated last year
- An automated tool for discovering insights from research papaer corporaβ138Updated last year
- Tina: Tiny Reasoning Models via LoRAβ310Updated 3 months ago
- GRadient-INformed MoEβ265Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMsβ97Updated 7 months ago
- Finetune Llama-3-8b on the MathInstruct datasetβ116Updated last year
- β173Updated 9 months ago
- Exploring Applications of GRPOβ251Updated 4 months ago
- minimal GRPO implementation from scratchβ101Updated 9 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsβ¦β365Updated last year
- Solving data for LLMs - Create quality synthetic datasets!β150Updated 11 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS modelsβ486Updated 4 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.β355Updated 6 months ago
- βοΈ Awesome LLM Judges βοΈβ146Updated 8 months ago
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMsβ314Updated 5 months ago
- β120Updated last year
- Code for ExploreTomβ89Updated 6 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Mβ¦β245Updated last year
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and β¦β347Updated last year
- β98Updated 2 weeks ago
- The official evaluation suite and dynamic data release for MixEval.β253Updated last year
- awesome synthetic (text) datasetsβ315Updated last month
- WebLINX is a benchmark for building web navigation agents with conversational capabilitiesβ156Updated 10 months ago
- Code and data for the Chain-of-Draft (CoD) paperβ337Updated 9 months ago
- Video+code lecture on building nanoGPT from scratchβ68Updated last year
- β137Updated last year
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)β109Updated 9 months ago