My name is Jiefeng Ma, but you may also refer to me as Jeffrey. I am currently in my third year as a PhD candidate at the NERC-SLIP Lab, where I have the privilege of working under the guidance of Prof. Du. My research endeavors primarily revolve around document intelligence, multimodal learning, and AI Generated Content (AIGC).
As the field of AIGC continues to grow, it encompasses the creation and optimization of content using advanced artificial intelligence techniques. I am dedicated to contributing significantly to this area through rigorous investigation and innovative problem-solving as I progress in my academic pursuits.
Responsibilities include:
Collaborators and Friends: Jianshu Zhang (张建树), Zhenrong Zhang (张镇荣)
Responsibilities include:
Collaborators and Friends: Haoyu Cao (曹浩宇), Zhongzhong Li (李中中), Sheng Kang (康昇), Wenwen Yu (余文文)
We built a large-scale dataset named HRDoc, which consists of 2,500 multi-page documents with nearly 2 million semantic units. Moreover, we proposed an encoder-decoder-based hierarchical document structure parsing system (DSPS) to tackle document structure reconstruction task. Code and dataset are available at https://github.com/jfma-USTC/HRDoc.