johanndiep / language-models-trajectory-generators
This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LLM.
☆21Updated 6 months ago
Alternatives and similar repositories for language-models-trajectory-generators:
Users that are interested in language-models-trajectory-generators are comparing it to the libraries listed below
- ☆21Updated 2 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- ☆20Updated last year
- ☆17Updated 2 months ago
- An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, i…☆18Updated 2 months ago
- Tools for merging pretrained large language models.☆19Updated 10 months ago
- ☆18Updated 6 months ago
- LLM reads a paper and produce a working prototype☆52Updated 2 weeks ago
- ☆19Updated 8 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆25Updated 10 months ago
- ☆38Updated 9 months ago
- ☆29Updated last year
- ☆48Updated 5 months ago
- ☆12Updated last month
- Simple GRPO scripts and configurations.☆58Updated 2 months ago
- ☆28Updated 5 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated last month
- BH hackathon☆14Updated last year
- ☆18Updated last month
- QLoRA for Masked Language Modeling☆22Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆29Updated last year
- alternative way to calculating self attention☆18Updated 11 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆77Updated last month
- Very minimal (and stateless) agent framework☆42Updated 3 months ago
- Build Agentic workflows with function calling using open LLMs☆26Updated 2 weeks ago
- ☆16Updated 11 months ago
- tickr-agent is an enterprise-ready, scalable Python library for building swarms of financial agents that conduct comprehensive stock anal…☆43Updated this week
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 3 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year