☆52Feb 5, 2025Updated last year
Alternatives and similar repositories for transformers_zamba2
Users that are interested in transformers_zamba2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of models from the Zamba2 series.☆193Jan 23, 2025Updated last year
- ☆47Jun 10, 2025Updated 10 months ago
- ☆12Mar 31, 2026Updated 2 weeks ago
- REST API for Large Language Models using FastAPI, Redis and LiteLLM☆14Nov 30, 2023Updated 2 years ago
- An powered LLM Slack bot that uses an OpenAI API backend (LlamaCPP, Ollama, etc)☆13Updated this week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Stream live plots to a matplotlib figure☆81Apr 18, 2025Updated 11 months ago
- Learning records for building a large language model from scratch☆59Jan 1, 2025Updated last year
- ☆13Nov 29, 2024Updated last year
- Pin files for contextual, codebase-level AI assistance.☆16Jul 11, 2024Updated last year
- Generate workflows (for flowcharts or low code) via LLM. Also describe workflow given in DOT.☆18Nov 2, 2023Updated 2 years ago
- 珠算代码大模型(Abacus Code LLM)☆58Sep 26, 2024Updated last year
- ☆16Jul 8, 2024Updated last year
- A simple Python tool to measure the performance of ONNX models.☆27Sep 15, 2024Updated last year
- ☆185Oct 13, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Oct 22, 2023Updated 2 years ago
- Large Language Model in Action☆343Jan 28, 2025Updated last year
- Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)☆58May 27, 2025Updated 10 months ago
- Exploring how ChatGPT can be used to accelerate research in cosmology.☆13Dec 12, 2022Updated 3 years ago
- Quadra: Effortless and reproducible deep learning workflows with configuration files.☆50Feb 23, 2026Updated last month
- ☆21Nov 23, 2021Updated 4 years ago
- These are my lecture notes and code for Coursera online course Functional Programming Principles in Scala by Prof. Martin Odersky from Éc…☆21Jan 7, 2024Updated 2 years ago
- a flying dog eating bones☆19Jun 22, 2024Updated last year
- Train, tune, and infer Bamba model☆138Jun 4, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Collection of radicle binaries.☆20Oct 5, 2021Updated 4 years ago
- ☆21Oct 22, 2021Updated 4 years ago
- Julia package for robust Pade approximation.☆12Jul 28, 2022Updated 3 years ago
- Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE☆19Sep 22, 2021Updated 4 years ago
- ☆21Jun 15, 2024Updated last year
- An open-source NLP library: fast text cleaning and preprocessing☆23Nov 9, 2021Updated 4 years ago
- ForOpenAI - A Fortran library for OpenAI API.☆20Jan 10, 2024Updated 2 years ago
- BFloat16 Fused Adam Operator for PyTorch☆19Nov 16, 2024Updated last year
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Mar 30, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Semantic alignment of astronomical data with natural language using multi-modal models. (Jax) Code associated with https://arxiv.org/abs/…☆17Oct 18, 2024Updated last year
- Small python package to measure OCR quality and other related metrics.☆27Feb 19, 2024Updated 2 years ago
- C++ component of the QML.jl package☆10Updated this week
- ☆13Mar 1, 2022Updated 4 years ago
- Semeval 2017 Financial Sentiment Task 5 code☆11Sep 30, 2020Updated 5 years ago
- ☆12Dec 22, 2024Updated last year
- Simple implementation of Gwern's AUNN proposal☆15Oct 5, 2025Updated 6 months ago