Massive-activations-Vlms

Setup

conda create -n fastv python=3.10
conda activate fastv
cd src
bash setup.sh

Visualization: Inefficient Attention over Visual Tokens

we provide a script (./src/FastV/inference/visualization.sh) to reproduce the visualization result of each LLaVA model layer for a given image and prompt.

bash ./src/FastV/inference/visualization.sh

or

python ./src/FastV/inference/plot_inefficient_attention_massive.py \
    --model-path "PATH-to-HF-LLaVA1.5-Checkpoints" \
    --image-path "./src/LLaVA/images/llava_logo.png" \
    --prompt "Describe the image in details."\
    --output-path "./output_example"\

it will obtain a json file contain massive activation weights.

Visualization

python plt_massive.py

Citation

@article{zhang2026drives,
  title={What drives attention sinks? A study of massive activations and rotational positional encoding in large vision--language models},
  author={Zhang, Xiaofeng and Zhu, Yuanchao and Gu, Chaochen and Cao, Jiawei and Cheng, Hao and Wu, Kaijie},
  journal={Information Processing \& Management},
  volume={63},
  number={2},
  pages={104431},
  year={2026},
  publisher={Elsevier}
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
data/aokvqa/validation		data/aokvqa/validation
figs		figs
output_example		output_example
src		src
.gitignore		.gitignore
README.md		README.md
activation_results.json		activation_results.json
generation_utils.py		generation_utils.py
plot_inefficient_attention_massive.py		plot_inefficient_attention_massive.py
plot_inefficient_attention_work.py		plot_inefficient_attention_work.py
plt_massive.py		plt_massive.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Massive-activations-Vlms

Setup

Visualization: Inefficient Attention over Visual Tokens

Visualization

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Massive-activations-Vlms

Setup

Visualization: Inefficient Attention over Visual Tokens

Visualization

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages