gabrielchua / daily-ai-papersLinks
All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. πAudio summaries here (https://t.me/daily_ai_papers).
β194Updated this week
Alternatives and similar repositories for daily-ai-papers
Users that are interested in daily-ai-papers are comparing it to the libraries listed below
Sorting:
- β86Updated 11 months ago
- β174Updated 3 weeks ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.β173Updated 7 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalizationβ276Updated last year
- Collection of scripts and notebooks for OpenAI's latest GPT OSS modelsβ398Updated 2 weeks ago
- minimal GRPO implementation from scratchβ96Updated 5 months ago
- β118Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMsβ92Updated 3 months ago
- An automated tool for discovering insights from research papaer corporaβ138Updated last year
- From scratch implementation of a vision language model in pure PyTorchβ235Updated last year
- GRadient-INformed MoEβ264Updated 11 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?β74Updated 5 months ago
- Arxflix turns your boring Arxiv research paper into a captivating video.β52Updated 2 weeks ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.β335Updated 2 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Mβ¦β238Updated 10 months ago
- A compact LLM pretrained in 9 days by using high quality dataβ322Updated 4 months ago
- LoRA and DoRA from Scratch Implementationsβ210Updated last year
- Solving data for LLMs - Create quality synthetic datasets!β151Updated 7 months ago
- Tina: Tiny Reasoning Models via LoRAβ278Updated 2 weeks ago
- Simple examples using Argilla tools to build AIβ54Updated 9 months ago
- β102Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Ayaβ117Updated 3 weeks ago
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMsβ314Updated last month
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorchβ102Updated 8 months ago
- π€ Benchmark Large Language Models Reliably On Your Dataβ387Updated this week
- Exploring Applications of GRPOβ246Updated this week
- This is an open-source version of OpenAI's O1 Model Series by Siraj Raval & O1-Previewβ96Updated 10 months ago
- Simple & Scalable Pretraining for Neural Architecture Researchβ289Updated last week
- Build your own visual reasoning modelβ407Updated last week
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsβ¦β346Updated 8 months ago