wangyifan2018 / ChatDoc-TPU
适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答
☆11Updated 8 months ago
Alternatives and similar repositories for ChatDoc-TPU:
Users that are interested in ChatDoc-TPU are comparing it to the libraries listed below
- A whisper repo for TPU☆10Updated 9 months ago
- ChatTTS is a generative speech model for daily dialogue.☆14Updated 4 months ago
- Text2speech & tone color conversion demo running on SG2300x 结合openvoice和emotivoice的TTS+即时克隆☆22Updated 4 months ago
- run ChatGLM2-6B in BM1684X☆49Updated last year
- Stable Diffusion+LCM在SG2300X上,纵享丝滑一秒出图☆17Updated 3 months ago
- run chatglm3-6b in BM1684X☆38Updated last year
- 使用SG2300X实现无瑕疵换脸☆27Updated 6 months ago
- 适用于sophon bm1684x的Langchain-Chatchat,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆13Updated 9 months ago
- 百度QA100万数据集☆47Updated last year
- Run generative AI models in sophgo BM1684X☆175Updated this week
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated 7 months ago
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆28Updated 9 months ago
- Evaluation for AI apps and agent☆36Updated last year
- GLM Series Edge Models☆129Updated last week
- 演示 vllm 对中文大语言模型的神奇效果☆31Updated last year
- simplify >2GB large onnx model☆53Updated 3 months ago
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆19Updated last year
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆26Updated 5 months ago
- Large-scale exact string matching tool☆15Updated 3 months ago
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆89Updated last year
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated last year
- 万物检测(零样本检测+识别) demo for SG2300X 【Recognize Anything + GroundingDINO】☆17Updated 9 months ago
- accelerate generating vector by using onnx model☆14Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆83Updated 5 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 3 weeks ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 10 months ago
- LLM RAG 应用,支持 API 调用,语音交互。☆11Updated 8 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆36Updated last year
- ☆12Updated last year
- 02. Enabling various applications to be AI-enabled or used by AI.☆27Updated 6 months ago