Paper reading: Jamba — Hybrid Transformer-Mamba LM (SSM → S4 → S6 → Jamba)
☆15May 22, 2024Updated last year
Alternatives and similar repositories for Jamba_Paper_Reading
Users that are interested in Jamba_Paper_Reading are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data for ArXiv 2024 paper "Large Language Models as Zero-Shot Keyphrase Extractors: A Preliminary Empirical Study".☆23Mar 10, 2024Updated 2 years ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆49Mar 7, 2024Updated 2 years ago
- ☆12Jun 14, 2019Updated 6 years ago
- ☆17Oct 1, 2021Updated 4 years ago
- Presentations & Notes☆11May 14, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- paper : <Spatial-Temporal Transformer Networks for Traffic Flow Forecasting>☆12Oct 11, 2020Updated 5 years ago
- [EMNLP 2021] Code for our EMNLP 2021 paper “Heterogeneous Graph Neural Networks for Keyphrase Generation”☆14Nov 13, 2021Updated 4 years ago
- Counting-Stars (★)☆83Nov 24, 2025Updated 5 months ago
- ☆11Aug 8, 2022Updated 3 years ago
- ☆16Feb 21, 2023Updated 3 years ago
- Code for CIKM 2022 paper "A Preliminary Exploration of Extractive Multi-Document Summarization in Hyperbolic Space".☆12Dec 2, 2022Updated 3 years ago
- ☆17Oct 18, 2022Updated 3 years ago
- Code for paper "Incorporating Multimodal Information in Open-Domain Web Keyphrase Extraction"☆19Jan 28, 2021Updated 5 years ago
- This is a list of publications regarding deep learning-based image and video compression. The list is maintained by the USTC-FVC research…☆16Sep 3, 2025Updated 8 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆13Jun 21, 2018Updated 7 years ago
- Some tips on paper writing skills.☆15May 25, 2022Updated 3 years ago
- [CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living☆31Nov 12, 2025Updated 5 months ago
- 🤔 When in Doubt: Improving Classification Performance with Alternating Normalization [Findings of EMNLP2021]☆14Oct 29, 2021Updated 4 years ago
- ☆20Jan 9, 2025Updated last year
- Official implementation of the paper "Vessel trajectory prediction with recurrent neural networks: An evaluation of datasets, features, a…☆22Jan 26, 2024Updated 2 years ago
- ☆11Nov 21, 2024Updated last year
- ☆11Sep 18, 2020Updated 5 years ago
- CVPR2022 update everyday!☆11Apr 12, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 回声Echo:AI文案助手☆10May 6, 2023Updated 3 years ago
- Artificial Intelligence project☆10Mar 11, 2016Updated 10 years ago
- Multi-span Style Extraction for Generative Reading Comprehension☆10Apr 2, 2021Updated 5 years ago
- Code used in the paper "Learning to Learn from Web Data through Deep Semantic Embeddings" ECCV 2018 MULA Workshop☆11Aug 1, 2018Updated 7 years ago
- Vessel traffic data, or Automatic Identification System (AIS) data, are collected by the U.S. Coast Guard through an onboard navigation s…☆23Nov 3, 2019Updated 6 years ago
- ChatTTS is a generative speech model for daily dialogue.☆14Oct 21, 2024Updated last year
- ☆26Mar 1, 2025Updated last year
- A simple Bidirectional Mamba☆32Jul 25, 2025Updated 9 months ago
- ☆16May 31, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Self Reproduction Code of Paper "Reducing Transformer Key-Value Cache Size with Cross-Layer Attention (MIT CSAIL)☆17May 24, 2024Updated last year
- In this repository I will be running various experiments on finetune different parts for xtts☆15Jun 22, 2024Updated last year
- A large scale dataset for Video Captioning in Italian☆13May 16, 2023Updated 2 years ago
- 在Linux环境中设置clash tun模式,以便达到全局代理的功能☆12Oct 5, 2023Updated 2 years ago
- ☆19May 11, 2024Updated last year
- Repository for KPTimes corpus☆35Feb 9, 2025Updated last year
- ☆11Aug 7, 2024Updated last year