U-C4N / Deepseek-CoTLinks

Deepseek-CoT

☆10

Alternatives and similar repositories for Deepseek-CoT

Users that are interested in Deepseek-CoT are comparing it to the libraries listed below

Sorting:

not-lain / pxia
minimalistic AI library that resembles HF's transformers
☆14Updated 6 months ago
zer0int / CLIP-SAE-finetune
Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.
☆15Updated 7 months ago
Agora-Lab-AI / OmegaViT
OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…
☆14Updated 3 weeks ago
slashml / awesome-finetuning
☆28Updated 10 months ago
kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆30Updated last week
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year
mmhamdy / open-language-models
A list of language models with permissive licenses such as MIT or Apache 2.0
☆24Updated 4 months ago
attashe / ModifiedBeamSampler
Modified Beam Search with periodical restart
☆12Updated 10 months ago
lechmazur / divergent
LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…
☆31Updated 4 months ago
hegdeadithyak / PaperReplica
We Replicate Research Papers in the field of AI & ML.
☆21Updated 11 months ago
nexusflowai / NexusBench
Nexusflow function call, tool use, and agent benchmarks.
☆25Updated 7 months ago
facebookresearch / dual-system-for-visual-language-reasoning
Github repo for Peifeng's internship project
☆13Updated last year
kyegomez / Kosmos-X
The Next Generation Multi-Modality Superintelligence
☆70Updated 10 months ago
RWKV / ZeroCoT
https://x.com/BlinkDL_AI/status/1884768989743882276
☆28Updated 2 months ago
brendanhogan / completion_tree_view
☆10Updated 2 months ago
deep-diver / Vid2Persona
This project breathes life into video characters by using AI to describe their personality and then chat with you as them.
☆47Updated last year
NolanoOrg / SpectraSuite
☆49Updated last year
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆80Updated 2 months ago
lechmazur / deception
Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…
☆28Updated 4 months ago
severian42 / Computational-Model-for-Symbolic-Representations
Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …
☆49Updated 5 months ago
Aloriosa / srmt
The original Shared Recurrent Memory Transformer implementation
☆27Updated last week
OSU-NLP-Group / SeeActChromeExtension
☆16Updated 6 months ago
sammcj / ollama-artefacts
Build HTML artefacts with Ollama
☆11Updated 7 months ago
superagi / Veagle
Enhancement in Multimodal Representation Learning.
☆40Updated last year
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 7 months ago
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
slashml / awesome-small-language-models
☆41Updated 10 months ago
severian42 / Proteus-The-Genesis-LLM
Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine
☆23Updated 7 months ago
diicellman / dynamite-dogs
BH hackathon
☆14Updated last year
camenduru / MoE-LLaVA-jupyter
☆16Updated last year