johanndiep / language-models-trajectory-generatorsLinks
This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LLM.
☆21Updated last year
Alternatives and similar repositories for language-models-trajectory-generators
Users that are interested in language-models-trajectory-generators are comparing it to the libraries listed below
Sorting:
- ☆17Updated 8 months ago
- An automated tool for discovering insights from research papaer corpora☆138Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 8 months ago
- ☆21Updated 8 months ago
- Simple GRPO scripts and configurations.☆59Updated 8 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- ☆102Updated last year
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 9 months ago
- ☆19Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated 5 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆32Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 6 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆58Updated last week
- Using the moondream VLM with optical flow for promptable object tracking☆72Updated 7 months ago
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆87Updated last year
- A framework for orchestrating AI agents using a mermaid graph☆77Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆88Updated last month
- An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, i…☆20Updated 8 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆96Updated last week
- Scripts to create your own moe models using mlx☆90Updated last year
- ☆20Updated last year
- ☆68Updated 4 months ago
- ☆116Updated 10 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆58Updated 5 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 8 months ago
- ☆49Updated 8 months ago
- The next evolution of Agents☆47Updated this week
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year