Implement llm model in pytorch, support MoE and RoPE
☆64Apr 7, 2026Updated last week
Alternatives and similar repositories for llm_model
Users that are interested in llm_model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- ☆27Dec 11, 2025Updated 4 months ago
- 实现《Multiway Attention Networks for Modeling Sentence Pairs》中的网络模型,可用于问答,句子逻辑推理☆11Apr 13, 2020Updated 6 years ago
- Experimental syslog template mining module☆11Aug 29, 2016Updated 9 years ago
- Taylor moment expansion in Python (JaX and SymPy) and Matlab☆11Nov 26, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICLR 2022] Denoising Likelihood Score Matching for Conditional Score-based Data Generation☆11Jan 2, 2025Updated last year
- tbb, gpu things for robotics☆13Sep 30, 2024Updated last year
- Overlapping Reads COmpression with Minimizers☆16May 19, 2022Updated 3 years ago
- STRODE: Stochastic Boundary Ordinary Differential Equation☆13Jul 20, 2021Updated 4 years ago
- [WACV 2026] SceneEdited: A City-Scale Benchmark for 3D HD Map Updating via Image-Guided Change Detection☆16Updated this week
- Plugin to display surfel clouds in the ROS visualizer RViz☆13Aug 8, 2019Updated 6 years ago
- ☆13Oct 24, 2021Updated 4 years ago
- 3rd Eye Scene is a generalised visual debugger and debugging aid in the vein of rviz.☆10Feb 22, 2023Updated 3 years ago
- ☆14Aug 18, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code used for the AAAI 2020 paper "System Identification with Time-Aware Neural Sequence Models"☆16Nov 22, 2019Updated 6 years ago
- CamVox相关论文、代码中文注释以及代码改动☆13Aug 17, 2021Updated 4 years ago
- ☆16May 12, 2023Updated 2 years ago
- SparseWorld-TC: Trajectory-Conditioned Sparse Occupancy World Model☆27Dec 17, 2025Updated 3 months ago
- 网页剪报☆13Jan 30, 2016Updated 10 years ago
- Source code for UQnet☆16May 23, 2024Updated last year
- Convolutional Channel Features + Online boosting-based person identification for mobile robots☆12Jun 25, 2021Updated 4 years ago
- Like a Crystal Ball: Self-Supervised Learning to Predict the Future of Dynamic Scenes for Indoor Navigation.☆17Dec 12, 2022Updated 3 years ago
- ☆15Aug 22, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Implementation of MTAD-TF: Multivariate Time Series Anomaly Detection Using the Combination of Temporal Pattern and Feature Pattern☆16Feb 21, 2021Updated 5 years ago
- Interpolation between Residual and Non-Residual Networks, ICML 2020. https://arxiv.org/abs/2006.05749☆26Aug 16, 2020Updated 5 years ago
- Port from SVN. Required for PR2's webui☆12Mar 21, 2018Updated 8 years ago
- SLAM with Moving Object Removal☆13Jun 29, 2018Updated 7 years ago
- PyTorch implementation of the NCDSSM models presented in the ICML '23 paper "Neural Continuous-Discrete State Space Models for Irregularl…☆26Jul 9, 2023Updated 2 years ago
- A lightweight (experimental) point cloud visualization library☆18Jul 29, 2022Updated 3 years ago
- avp mapping algorithm using multi-camera system☆16Dec 10, 2022Updated 3 years ago
- ☆11Mar 3, 2020Updated 6 years ago
- Implementation of the "Deep Hierarchical Representation of Point Cloud Videos via Spatio-Temporal Decomposition" paper.☆21Apr 13, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- jfinalQ对应的代码生成工具☆16Sep 28, 2015Updated 10 years ago
- [ICCV 2023 & IJCV 2026] PARTNER: Level up the Polar Representation for LiDAR 3D Object Detection☆22Aug 12, 2024Updated last year
- Reinforcement for state of the art visual slam algorithms with Deep Learning based solutions. Part of my master thesis at University of F…☆10Jul 29, 2018Updated 7 years ago
- Train, visualize, and evaluate RL policies for the Terra environment.☆18Feb 10, 2026Updated 2 months ago
- ☆16Dec 18, 2025Updated 3 months ago
- Collection of resources that combine dynamic systems, control with deep learning.☆29May 18, 2021Updated 4 years ago
- Materials for the State Estimation for Robotics course.☆11Nov 22, 2021Updated 4 years ago