run chatglm3-6b in BM1684X
☆39Mar 1, 2024Updated 2 years ago
Alternatives and similar repositories for ChatGLM3-TPU
Users that are interested in ChatGLM3-TPU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- run ChatGLM2-6B in BM1684X☆49Mar 1, 2024Updated 2 years ago
- ☆44Jul 5, 2024Updated last year
- A whisper repo for TPU☆11Jun 4, 2024Updated last year
- Text2speech & tone color conversion demo running on SG2300x 结合openvoice和emotivoice的TTS+即时克隆☆22Oct 30, 2024Updated last year
- This kernel adds supports for running Docker on Sony Xperia 5 II (pdx206).☆10Mar 14, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆14Jun 5, 2024Updated last year
- ipad最新协议,微商工具,私域管理,群管理,淘客机器人,视频号☆12Mar 31, 2026Updated 2 weeks ago
- 根据Qwen2(Qwen1.5)模型生成qwen2 MoE模型的工具☆15Mar 29, 2024Updated 2 years ago
- My personnal notes on the P4wnP1☆18Feb 18, 2024Updated 2 years ago
- ☆10Mar 11, 2024Updated 2 years ago
- 适用于sophon bm1684x的Langchain-Chatchat,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆18May 23, 2024Updated last year
- ☆12Dec 5, 2023Updated 2 years ago
- cvitek ai compiler base on MLIR☆23Mar 14, 2022Updated 4 years ago
- 本项目主要研究大模型在单独的法律数据集上的效果,现在支持belle和chatglm相关的模型训练,预测,验证和在线部署, 另外增加爬虫代码,langchain,结合数据库预测等功能。☆12Jul 16, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 写综述,写教案,写文章,写小说等都可以用的Agent撰写☆24Jan 23, 2026Updated 2 months ago
- Examples for SophonSDK☆108Aug 11, 2022Updated 3 years ago
- Sophgo AI chips driver and runtime library.☆24Apr 3, 2026Updated 2 weeks ago
- C++ implementation of Qwen-LM☆623Dec 6, 2024Updated last year
- C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)☆2,960Jul 31, 2024Updated last year
- pp_ocr_v4's ONNX version☆25Jun 26, 2024Updated last year
- Local pty, USB UART and Telnet terminal client for Android.☆13Jul 19, 2021Updated 4 years ago
- Lightning Fast: Faiss CPU + Onnx Quantized Multilingual Embedding Model☆23Sep 13, 2024Updated last year
- Using OpenVINO to speed up MeloTTS inference☆15Nov 1, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR '23 Highlight] Official repository for the paper "Quantum Multi-Model Fitting".☆11Mar 7, 2025Updated last year
- Taichi-based Differentiable DVR Renderer☆12Jul 20, 2022Updated 3 years ago
- MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.☆16Apr 8, 2025Updated last year
- A ESP32 BLE scanner with iotWebConf and MQTT☆13Mar 30, 2019Updated 7 years ago
- Integration test of Verilog AXI modules (https://github.com/alexforencich/verilog-axi) with LiteX.☆17Dec 19, 2022Updated 3 years ago
- KAF : Kolmogorov-Arnold Fourier Networks☆21Feb 19, 2025Updated last year
- ☆12Nov 30, 2023Updated 2 years ago
- A long and thin development board, designed to keep the number and length of jumper wires to a minimum. Runs Circuit Python on Microchip …☆14Dec 5, 2023Updated 2 years ago
- ☆12Dec 21, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Chameleon: A Multiplier-Free Temporal Convolutional Network Accelerator for End-to-End Few-Shot and Continual Learning from Sequential Da…☆27Mar 5, 2026Updated last month
- 使用jupyter进行langchain的代码练习☆19Feb 18, 2024Updated 2 years ago
- Short Python script that attempts to neuter USB Rubber Duckies.☆13Jun 25, 2019Updated 6 years ago
- The code of SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space Models☆22Mar 25, 2026Updated 3 weeks ago
- ☆485Apr 1, 2026Updated 2 weeks ago
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- [ACL2026 Findings] "Towards Hierarchical Multi-Step Reward Models for Enhanced Reasoning in Large Language Models"☆20Mar 25, 2025Updated last year