gabrielchua / daily-ai-papersLinks
All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. πAudio summaries here (https://t.me/daily_ai_papers).
β184Updated this week
Alternatives and similar repositories for daily-ai-papers
Users that are interested in daily-ai-papers are comparing it to the libraries listed below
Sorting:
- β86Updated 9 months ago
- β158Updated last month
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.β173Updated 5 months ago
- An automated tool for discovering insights from research papaer corporaβ138Updated last year
- Finetune Llama-3-8b on the MathInstruct datasetβ110Updated 8 months ago
- βοΈ Awesome LLM Judges βοΈβ105Updated last month
- Easy to use, High Performant Knowledge Distillation for LLMsβ86Updated last month
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"β60Updated 2 months ago
- LoRA and DoRA from Scratch Implementationsβ204Updated last year
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?β68Updated 3 months ago
- Arxflix turns your boring Arxiv research paper into a captivating video.β51Updated 3 weeks ago
- Simple examples using Argilla tools to build AIβ53Updated 7 months ago
- CodeScientist: An automated scientific discovery system for code-based experimentsβ273Updated this week
- Solving data for LLMs - Create quality synthetic datasets!β149Updated 5 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024β311Updated last month
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Mβ¦β225Updated 7 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β314Updated 3 weeks ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsβ¦β339Updated 6 months ago
- β156Updated 3 months ago
- β127Updated 3 months ago
- AWM: Agent Workflow Memoryβ279Updated 4 months ago
- The first dense retrieval model that can be prompted like an LMβ73Updated last month
- Lightweight toolkit package to train and fine-tune 1.58bit Language modelsβ80Updated last month
- Documentation, notes, links, etc for streams.β80Updated last year
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.β122Updated this week
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.β41Updated last month
- minimal GRPO implementation from scratchβ90Updated 3 months ago
- β° AI conference deadline countdownsβ265Updated 2 weeks ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)β101Updated 3 months ago
- Tina: Tiny Reasoning Models via LoRAβ260Updated 3 weeks ago