rom1504 / generic-mcp-client-chatLinks
Generic MCP Client to use any MCP tool in a chat
☆44Updated 4 months ago
Alternatives and similar repositories for generic-mcp-client-chat
Users that are interested in generic-mcp-client-chat are comparing it to the libraries listed below
Sorting:
- open source alpha evolve☆67Updated 4 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆56Updated 3 months ago
- ☆63Updated last week
- minimal GRPO implementation from scratch☆97Updated 6 months ago
- Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".☆133Updated 2 weeks ago
- ☆158Updated 2 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 6 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆59Updated last year
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆60Updated 10 months ago
- working implimention of deepseek MLA☆44Updated 8 months ago
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆111Updated 3 months ago
- ☆56Updated 10 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- Train, tune, and infer Bamba model☆132Updated 3 months ago
- Implementation of the Llama architecture with RLHF + Q-learning☆166Updated 7 months ago
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆96Updated 9 months ago
- ☆33Updated 8 months ago
- Training framework for Large Behavioral Models☆24Updated last week
- Collection of autoregressive model implementation☆86Updated 5 months ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆30Updated 5 months ago
- Pretraining Code for METAGENE-1☆68Updated 8 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- Explorations into improving ViTArc with Slot Attention☆42Updated 11 months ago
- Implementation of the dynamic chunking mechanism in H-net by Hwang et al. of Carnegie Mellon☆64Updated last month
- A easy, reliable, fluid template for python packages complete with docs, testing suites, readme's, github workflows, linting and much muc…☆188Updated 2 weeks ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆103Updated 5 months ago
- Implementation of the proposed DeepCrossAttention by Heddes et al at Google research, in Pytorch☆93Updated 7 months ago
- Flash Attention Triton kernel with support for second-order derivatives☆88Updated this week
- [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆62Updated this week