katzurik / Knowledge_Navigator
☆12Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for Knowledge_Navigator
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆24Updated last week
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- Training hybrid models for dummies.☆15Updated 2 weeks ago
- A Data Source for Reasoning Embodied Agents☆19Updated last year
- DPO, but faster 🚀☆21Updated 2 weeks ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆26Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆12Updated this week
- ☆26Updated 4 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆36Updated 7 months ago
- Tools for merging pretrained large language models.☆19Updated 5 months ago
- ☆13Updated last year
- ☆40Updated this week
- Visual RAG using less than 300 lines of code.☆23Updated 8 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆20Updated 9 months ago
- ☆13Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆19Updated 4 months ago
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆13Updated last month
- ☆12Updated last week
- ☆41Updated last month
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆13Updated 8 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated 9 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆22Updated 8 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated this week
- ☆11Updated 3 weeks ago
- ☆57Updated last month
- LLM reads a paper and produce a working prototype☆34Updated this week
- BH hackathon☆14Updated 7 months ago
- QLoRA for Masked Language Modeling☆20Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆22Updated last week