Llama2D / llama2d
2D Positional Embeddings for Webpage Structural Understanding π¦π
β93Updated 2 weeks ago
Related projects: β
- Enterprise RAG Challenge to test accuracy of different LLM-driven assistantsβ24Updated 2 weeks ago
- Finetune Llama-3-8b on the MathInstruct datasetβ91Updated 3 weeks ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.β101Updated last week
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first appβ¦β161Updated 8 months ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"β152Updated 7 months ago
- The implementation of "Leeroo Orchestrator: Elevating LLMs Performance Through Model Integration"β50Updated 4 months ago
- β26Updated last month
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundamentβ¦β57Updated this week
- β89Updated 11 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectoβ¦β192Updated 4 months ago
- β53Updated this week
- an implementation of Self-Extend, to expand the context window via grouped attentionβ117Updated 8 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAIβ223Updated 4 months ago
- β51Updated last week
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [Fβ¦β57Updated 3 months ago
- β75Updated 3 weeks ago
- Ο-GPT: A New Approach to Autoregressive Modelsβ53Updated last month
- Set of scripts to finetune LLMsβ36Updated 5 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.β81Updated last year
- An automated tool for discovering insights from research papaer corporaβ131Updated 3 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for freeβ217Updated 6 months ago
- β59Updated last week
- GRDN.AI app for garden optimizationβ68Updated 7 months ago
- β48Updated 11 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the userβ¦β143Updated this week
- Full finetuning of large language models without large memory requirementsβ94Updated 8 months ago
- Just a bunch of benchmark logs for different LLMsβ112Updated last month
- β101Updated 6 months ago
- OmniFusion β a multimodal model to communicate using text and imagesβ229Updated 4 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems.β48Updated 3 weeks ago