gabrielchua / daily-ai-papers
All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. πAudio summaries here (https://t.me/daily_ai_papers).
β168Updated this week
Alternatives and similar repositories for daily-ai-papers:
Users that are interested in daily-ai-papers are comparing it to the libraries listed below
- β85Updated 7 months ago
- β144Updated last month
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.β171Updated 3 months ago
- βοΈ Awesome LLM Judges βοΈβ93Updated 2 months ago
- Solving data for LLMs - Create quality synthetic datasets!β146Updated 3 months ago
- π€ Benchmark Large Language Models Reliably On Your Dataβ240Updated last week
- The official evaluation suite and dynamic data release for MixEval.β235Updated 5 months ago
- From scratch implementation of a vision language model in pure PyTorchβ213Updated 11 months ago
- awesome synthetic (text) datasetsβ272Updated 5 months ago
- A user interface for DSPyβ143Updated 6 months ago
- Exploring Applications of GRPOβ185Updated last week
- Let's build better datasets, together!β259Updated 4 months ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"β419Updated 2 weeks ago
- An automated tool for discovering insights from research papaer corporaβ138Updated 10 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsβ¦β320Updated 4 months ago
- β122Updated last month
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)β96Updated last month
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Mβ¦β213Updated 5 months ago
- Code release for "LLMs can see and hear without any training"β239Updated 2 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024β289Updated last week
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β275Updated this week
- LoRA and DoRA from Scratch Implementationsβ202Updated last year
- minimal GRPO implementation from scratchβ85Updated last month
- Finetune Llama-3-8b on the MathInstruct datasetβ110Updated 6 months ago
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuningβ354Updated 7 months ago
- Build your own visual reasoning modelβ341Updated this week
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokensβ139Updated 2 months ago
- β117Updated 7 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Ayaβ108Updated 2 months ago
- An extension of the nanoGPT repository for training small MOE models.β131Updated last month