MGDoc: Pre-training with multi-granular hierarchy for document image understanding ( 2022)