sparse-attention

Star

Here are 13 public repositories matching this topic...

lucidrains / native-sparse-attention-pytorch

Star

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper

deep-learning artificial-intelligence attention sparse-attention

Updated Jun 11, 2025
Python

thu-ml / SpargeAttn

Star

SpargeAttention: A training-free sparse attention that can accelerate any model inference.

attention vit quantization video-generation mlsys inference-acceleration ai-infra vision-transformer sparse-attention llm sageattention

Updated Jun 10, 2025
Cuda

SHI-Labs / NATTEN

Star

Neighborhood Attention Extension. Bringing attention to a neighborhood near you!

cuda pytorch sparse-attention neighborhood-attention

Updated Jun 11, 2025
C++

ByteDance-Seed / ShadowKV

Star

[ICML 2025 Spotlight] ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

research high-throughput low-rank cpu-offload sparse-attention long-context llm-inference

Updated May 1, 2025
Python

XunhaoLai / native-sparse-attention-triton

Star

Efficient triton implementation of Native Sparse Attention.

natural-language-processing sparse-attention large-language-models

Updated May 23, 2025
Python

thu-nics / MoA

Star

The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>

model-compression sparse-attention large-language-models

Updated May 27, 2025
Python

ByteDance-Seed / FlexPrefill

Star

Code for paper: [ICLR2025 Oral] FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference

natural-language-processing research sparse-attention large-language-models

Updated May 19, 2025
Python

lim142857 / Sparsifiner

Star

Demo code for CVPR2023 paper "Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers"

attention-mechanism fast-inference sparse-neural-networks low-rank vision-transformer efficient-transformers sparse-attention efficient-vision-transformers

Updated Jul 4, 2023
Python

eezkni / SSIU

Star

[TIP-2025] Pytorch implementation of "Structural Similarity-Inspired Unfolding for Lightweight Image Super-Resolution"

lightweight super-resolution sparse-attention

Updated Nov 19, 2024

Iron-Bound / native-sparse-attention

Star

Building Native Sparse Attention

deep-learning sparse-attention flash-attention

Updated Feb 20, 2025
Python

moon23k / Efficient_Summarization

Star

Text Summarization Modeling with three different Attention Types

text-summarization attention-mechanism sparse-attention

Updated May 29, 2024
Python

DoQuantum / r1.7-planck-pioneers

Star

Integrating QC techniques into Sparse Attention for Transformers

machine-learning-algorithms quantum-computing transformer-architecture sparse-attention

Updated Apr 29, 2025
Python

Dynamic Attention Mask (DAM) generate adaptive sparse attention masks per layer and head for Transformer models, enabling long-context inference with lower compute and memory overhead without fine-tuning.

inference-optimization sparse-attention efficient-ai

Updated Jun 6, 2025
Python

Improve this page

Add a description, image, and links to the sparse-attention topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the sparse-attention topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sparse-attention

Here are 13 public repositories matching this topic...

lucidrains / native-sparse-attention-pytorch

thu-ml / SpargeAttn

SHI-Labs / NATTEN

ByteDance-Seed / ShadowKV

XunhaoLai / native-sparse-attention-triton

thu-nics / MoA

ByteDance-Seed / FlexPrefill

lim142857 / Sparsifiner

eezkni / SSIU

Iron-Bound / native-sparse-attention

moon23k / Efficient_Summarization

DoQuantum / r1.7-planck-pioneers

HanzhiZhang-Ulrica / DAM

Improve this page

Add this topic to your repo