DolbyUUU / Logic-RL-LiteLinks
Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".
☆50Updated 4 months ago
Alternatives and similar repositories for Logic-RL-Lite
Users that are interested in Logic-RL-Lite are comparing it to the libraries listed below
Sorting:
- Be your lucky charm!☆167Updated last month
- 现代化的去中心化金融(DeFi)资产管理面板,支持连接钱包查看真实的资产余额和交易记录。☆132Updated 3 weeks ago
- 基于OICQ开发的机器人☆174Updated 2 months ago
- Cosmos-AI is an innovative solution that leverages state-of-the-art LLMs and artificial intelligence algorithms for intelligent task deco…☆79Updated 5 months ago
- Performance optimization for Centris software☆566Updated 4 months ago
- Re-movery☆532Updated 4 months ago
- 这是一个专为开发企业级MCP server而设计的通用开发框架☆178Updated 3 months ago
- PrettySQL 是一款基于 IntelliJ IDEA 的轻量级插件,致力于提升 SQL 开发体验,集成 了 SQL 格式美化、高亮表结构提示、SQL检查、执行报告分析等功能,为日常开发中频繁处理 SQL 的用户提供便捷、清晰、可视化的开发辅助。☆272Updated last month
- Code for Formfactory Benchmark☆115Updated last month
- 学习笔记,涵盖 SDN、PDP、Go 等多个领域,包括科研和开发等多方面☆146Updated this week
- ☆506Updated 7 months ago
- Enterprise-grade modular framework for Minecraft server development with multi-tier caching, Redis Streams, resilient data processing, an…☆667Updated this week
- A decentralized agent network for building collaborative, LLM-powered agent-to-agent (A2A) systems.☆262Updated 2 weeks ago
- Multi-Agent System Framework For Complex Tasks☆571Updated this week
- ☆125Updated 7 months ago
- A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement, ASE 2024 (Distinguished Pape…☆112Updated 8 months ago
- OD-FinLLM is a refined model derived from the LLaMA series, with specific enhancements for Chinese financial knowledge. This model is bui…☆269Updated 11 months ago
- KGminerproxy,MinerProxy,minerproxy,原创正版,全币种高性能,专业的 矿池抽水,矿池中转,矿池代理,中转软件,中转搭建,抽水软件,矿场运维工具,提升矿场利润的必备助手。minerproxy,minerproxy,minerproxy,min…☆130Updated 2 weeks ago
- Efficient Steganalysis System☆131Updated 4 months ago
- 基于p5.js的动态艺术海报☆121Updated 10 months ago
- Kafka Web UI By LCC 是一个专为简化Apache Kafka集群管理和操作而设计的图形化用户界面(GUI)。该项目旨在为开发者、运维人员和数据科学家提供一个直观且易用的平台,以进行Kafka主题(Topic)管理 、消息生产和消费、以及集群监控等任务。通过该…☆148Updated 4 months ago
- ☆137Updated 8 months ago
- ☆311Updated last year
- Message queue based on the AMQP model implemented using cpp code☆347Updated 4 months ago
- 如何低价充值 ChatGPT-Plus☆57Updated 7 months ago
- Object_Detection_Dataset_Conversion☆140Updated 7 months ago
- Creating a simple Go module for Backend Teams' DevOps Workflow☆104Updated 7 months ago
- A physics-guided hierarchical deep network (PhyRes-LSTM) framework, which integrates external knowledge with deep neural networks to guid…☆16Updated 11 months ago
- Nestjs抽奖系统 Nodejs抽奖系统 责任链抽奖策略 抉择树抽奖策略☆19Updated 8 months ago
- 【🚧 项目目前尚处于开发阶段,暂未完成开发,请过段时间再来看吧】寒霜物联 —— 支持轻量化快速接入的 IoT 设备统一接入平台☆326Updated last month