Retriever-0.1B
☆96Jun 6, 2024Updated last year
Alternatives and similar repositories for Retriever
Users that are interested in Retriever are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 尝试自己从头写一个LLM,参考llama和nanogpt☆69Apr 27, 2024Updated 2 years ago
- ☆13May 11, 2023Updated 3 years ago
- The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.☆17Feb 19, 2025Updated last year
- The JSON file for the ICD-9-CM and ICD-10-CM hierarchy, including diagnosis codes and procedure codes☆13Jan 26, 2023Updated 3 years ago
- A direct convolution library targeting ARM multi-core CPUs.☆12Nov 27, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆25Apr 16, 2024Updated 2 years ago
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.☆16Aug 31, 2023Updated 2 years ago
- ☆11Apr 10, 2024Updated 2 years ago
- Accelerating GOT-OCRv2 with VLLM☆10Nov 15, 2024Updated last year
- The official implementation of our work SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing via Consistency-Enhanced Multi-Agent C…☆24May 2, 2025Updated last year
- ☆11Dec 2, 2025Updated 5 months ago
- NUST-API集合☆10Oct 29, 2018Updated 7 years ago
- 中文《诗歌总集》,距今为止最全面,最系统的中文诗词数据集,统一数据建模.☆41Jan 6, 2026Updated 4 months ago
- ☆13Jul 12, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Fast and low-memory attention layer written in CUDA☆20Jul 14, 2023Updated 2 years ago
- 多人在线五子棋微信小游戏☆14Dec 26, 2018Updated 7 years ago
- [EMNLP2020] End-to-End Emotion-Cause Pair Extraction based on SlidingWindow Multi-Label Learning☆20Oct 13, 2020Updated 5 years ago
- 中文基于满血DeepSeek-R1蒸馏数据集☆63Feb 21, 2025Updated last year
- ☆33Mar 11, 2023Updated 3 years ago
- Xcbwin - a simple C++ class for graphical outputs using XCB☆12May 12, 2015Updated 11 years ago
- [NIPS 2025] Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control☆47Apr 1, 2026Updated last month
- Code for paper 'Data-Efficient FineTuning'☆28May 24, 2023Updated 3 years ago
- Closed-loop simulator of complex behavior and learning based on reinforcement learning and deep neural networks☆14Mar 20, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Multi-DNN Inference Engine for Heterogeneous Mobile Processors☆39Jul 24, 2024Updated last year
- Code for the paper "Unbiased Supervised Contrastive Learning" | ICLR 2023 https://openreview.net/forum?id=Ph5cJSfD2XN☆12Sep 22, 2023Updated 2 years ago
- Learned User Representations in Online Social Networks (Twitter) using Temporal Dynamics of Information Diffusion.☆10Oct 15, 2018Updated 7 years ago
- ☆16Jul 29, 2025Updated 9 months ago
- Multi-objective reinforcement learning for covid-19 control☆12Aug 12, 2021Updated 4 years ago
- fastapi异步IO+threadpool线程池的工作原理☆18Feb 12, 2024Updated 2 years ago
- Un-official implementation of the Transformer Index for GEnerative Recommenders (TIGER) framework.☆13Jun 6, 2023Updated 2 years ago
- Conversational AI based on Rasa☆40Feb 11, 2022Updated 4 years ago
- 基于sentence transformers和chatglm实现的文档搜索工具☆158Apr 17, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Oct 24, 2022Updated 3 years ago
- This repository contains an experimental PyTorch implementation exploring the NoProp algorithm, presented in the paper "NOPROP: TRAINING …☆16Updated this week
- Self Reproduction Code of Paper "Reducing Transformer Key-Value Cache Size with Cross-Layer Attention (MIT CSAIL)☆17May 24, 2024Updated 2 years ago
- ☆32Mar 12, 2024Updated 2 years ago
- ☆17Apr 17, 2024Updated 2 years ago
- 中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。☆1,713Apr 20, 2024Updated 2 years ago
- ☆22Jul 16, 2024Updated last year