Jiuxiang Gu
Home
News
Publications
CV
Tensor Attention Training: Provably Efficient Learning of Higher-order Transformers (arXiv 2024)
Jiuxiang Gu
,
Yingyu Liang
,
Zhenmei Shi
,
Zhao Song
,
Yufa Zhou
January 2024
Type
Conference paper
Publication
arXiv
Cite
×