Ma2mba Icon Look Every Frame All at Once: Video-Ma2mba for Efficient Long-form Video Understanding with Multi-Axis Gradient Checkpointing

Integrated Vision and Language Lab, KAIST
*Indicates Equal Contribution

COMING SOON