multimodal-art-projection / AutoKaggle
☆96Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for AutoKaggle
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆53Updated 2 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆58Updated 5 months ago
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆170Updated 2 months ago
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (Official Code)☆135Updated last month
- ☆45Updated 2 months ago
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆137Updated last week
- LoRA and DoRA from Scratch Implementations☆188Updated 8 months ago
- ☆83Updated 2 months ago
- Expert Specialized Fine-Tuning☆145Updated last month
- ☆50Updated 4 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆175Updated 2 weeks ago
- From scratch implementation of a vision language model in pure PyTorch☆162Updated 6 months ago
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆84Updated 2 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆180Updated 3 weeks ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆229Updated 3 weeks ago
- ☆106Updated 2 months ago
- a curated list of the role of small models in the LLM era☆77Updated last month
- [NeurIPS 24 Spotlight] MaskLLM: Learnable Semi-structured Sparsity for Large Language Models☆118Updated 3 weeks ago
- Survey of Small Language Models from Penn State, ...☆70Updated this week
- Codebase accompanying the Summary of a Haystack paper.☆72Updated 2 months ago
- The first dense retrieval model that can be prompted like an LM☆63Updated 2 months ago
- ☆131Updated 4 months ago
- ☆59Updated last month
- ☆69Updated 3 weeks ago
- ☆59Updated 5 months ago
- AWM: Agent Workflow Memory☆208Updated last month
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆126Updated 5 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆103Updated 6 months ago
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆40Updated 3 weeks ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆35Updated last month