Computer Science & Technology

Mutual Learning Offline Handwritten Mathematical Expression Recognition Based on Multi-Scale Feature Fusion

Expand
  • Faculty of Information Technology,Beijing University of Technology,Beijing 100124,China
付鹏斌(1967-),男,副教授,主要从事图形图像处理、模式识别等研究。E-mail:fupengbin@bjut.edu.cn

Received date: 2023-02-06

  Online published: 2023-06-20

Supported by

the National Natural Science Foundation of China(61772048);the Natural Science Foundation of Beijing(4153058);the Construction of High Quality Undergraduate Courseware for Beijing Education Commission(040000514122506)

Abstract

With complex two-dimensional structure, offline handwritten mathematical expressions is difficult to recognize due to the variable scale of their symbols and the various transformation of their writing styles. This paper proposed a mutual learning model based on multi-scale feature fusion. Firstly, to enhance the model for extracting fine-grained information from expressions and comprehending semantic information of global two-dimensional structures, multi-scale feature fusion was introduced in the encoding stage. Secondly, paired handwritten and printed mathematical expressions were introduced for training the mutual learning model, which includes decoder loss and context matching loss to learn LaTeX grammar as well as semantic invariance between handwritten and printed mathematical expressions respectively to improve the robustness of the model to different writing styles. Experimental validation was performed on the CROHME 2014/2016/2019 dataset. After introducing the multi-scale feature fusion mechanism, the expression correctness rate reaches 55.25%, 52.31%, 53.72%, respectively. After introducing the mutual learning mechanism, the expression correct rate reaches 55.43%, 53.53%, 53.79%, respectively. The expression correctness rate reaches 58.88%, 55.10%, 57.05% after introducing both mechanisms at the same time. It is proved experimentally that the proposed method can effectively extract the features in formulas at different scales and overcome the problems of different handwriting styles and small amount of data by mutual learning mechanism. In addition, the experimental results on the HME100K dataset verified the effectiveness of the proposed model.

Cite this article

FU Pengbin, XU Yu, YANG Huirong . Mutual Learning Offline Handwritten Mathematical Expression Recognition Based on Multi-Scale Feature Fusion[J]. Journal of South China University of Technology(Natural Science), 2024 , 52(2) : 23 -31 . DOI: 10.12141/j.issn.1000-565X.230034

References

1 MOUCHERE H, GAUDIN C V, ZANIBBI R,et al .ICFHR 2016 CROHME:competition on recognition of online handwritten mathematical expressions[C]∥Proceedings of the 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR).Shenzhen:IEEE,2017:607-612.
2 靳简明,江红英,王庆人 .数学公式图像处理综述 [J].模式识别与人工智能200518(4):429-440.
  JIN Jian-ming, JIANG Hong-ying, WANG Qing-ren .Survey of mathematical expression image processing[J].Pattern Recognition and Artificial Intelligence200518(4):429-440.
3 SIMISTIRA F, PAPAVASSILIOU V, KATSOUROS V,et al .Recognition of spatial relations in mathematical formulas[C]∥Proceedings of the 2014 14th International Conference on Frontiers in Handwriting Recognition (ICFHR).Hersonissos:IEEE,2014:164-168.
4 NAZEMI A, TAVAKOLIAN N, FITZPATRICK D,et al .Offline handwritten mathematical symbol recognition utilising deep learning [EB/OL].(2019-10-22)[2023-01-09]..
5 LODS A, ANQUETIL E, MACE S .Fuzzy visibility graph for structural analysis of online handwritten mathematical expressions[C]∥Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR).Sydney:IEEE,2019:641-646.
6 LAVANYA K, BAJAJ S, TANK P,et al .Handwritten digit recognition using hoeffding tree,decision tree and random forests—a comparative approach[C]∥Proceedings of the 2017 International Conference on Computational Intelligence in Data Science (ICCIDS).Chennai:IEEE,2017:1-6.
7 ALTAN A, KARASU S,ZIO E .A new hybrid model for wind speed forecasting combining long short-term memory neural network,decomposition methods and grey wolf optimizer[J].Applied Soft Computing2021,106996/1-20.
8 陈路,陈道喜,陆一鸣,等 .基于注意力机制编码器-解码器的手写数学公式识别模型[J].计算机应用202343(4):1297-1302.
  CHEN Lu, CHEN Daoxi, LU Yiming,et al .Handwritten mathematical expression recognition model based on attention mechanism and encoder-decoder[J].Journal of Computer Applications202343(4):1297-1302.
9 ZHANG J, DU J, ZHANG S L,et al .Watch,attend and parse:an end-to-end neural network based approach to handwritten mathematical expression recognition [J].Pattern Recognition201771:196-206.
10 ZHANG J S, DU J, DAI L R .A GRU-based encoder-decoder approach with attention for online handwritten mathematical expression recognition[C]∥Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).Kyoto:IEEE,2017:902-907.
11 ZHANG J S, DU J, DAI L R .Multi-scale attention with dense encoder for handwritten mathematical expression recognition[C]∥Proceedings of the 2018 24th International Conference on Pattern Recognition (ICPR).Beijing:IEEE,2018:2245-2250.
12 WU J W, YIN F, ZHANG Y M,et al .Image-to-markup generation via paired adversarial learning [C]∥Proceedings of the Machine Learning and Knowledge Discovery in Databases.Cham:Springer,2018:18-34.
13 WU J W, YIN F, ZHANG Y M,et al .Handwritten mathematical expression recognition via paired adversarial learning[J].International Journal of Computer Vision2020128:2386-2401.
14 LE A D .Recognizing handwritten mathematical expressions via paired dual loss attention network and printed mathematical expressions[C]∥Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).Seattle:IEEE,2020:2413-2418.
15 ZHAO W Q, GAO L C, YAN Z Y,et al .Handwritten mathematical expression recognition with bidirectionally trained transformer[C]∥Proceedings of the Document Analysis and Recognition-ICDAR 2021.Cham:Springer,2021:570-584.
16 BIAN X H, QIN B, XIN X Z,et al .Handwritten mathematical expression recognition via attention aggregation based bi-directional mutual learning[EB/OL].(2022-09-04)[2023-01-03]..
17 ZHANG Y, XIANG T, HOSPEDALES T M,et al .Deep mutual learning[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).New York:IEEE,2018:4320-4328.
18 付鹏斌,李建君,杨惠荣 .基于粘连符号分割和多特征融合的手写公式识别[J].北京工业大学学报202147(8):842-853.
  FU Pengbin, LI Jianjun, YANG Huirong. Handwritten formula recognition based on segmentation of adhesive symbols and multi-feature fusion[J].Journal of Beijing University of Technology202147(8):842-853.
19 HUANG G, LIU Z, MAATEN V,et al .Densely connected convolutional networks[C]∥Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Honolulu:IEEE,2017:2261-2269.
20 VASWANI A, SHAZEER N, PARMAR N,et al .Attention is all you need[EB/OL].(2021-01-23)[2023-01-16]..
21 ZHAO W Q, GAO L C. CoMER:modeling coverage for transformer-based handwritten mathematical expression recognition [EB/OL].(2022-07-13)[2023-01-15]..
22 CARION N, MASSA F, SYNNAEVE G,et al .End-to-end object detection with transformers[C]∥Proceedings of the 16th European Conference on Computer Vision.Glasgow:Springer,2020:213-229.
23 DENG Y T, KANERVISTO A, LING J,et al .Image-to-markup generation with coarse-to-fine attention[C]∥Proceedings of the 34th International Conference on Machine Learning.[S.l.]:JMLR,2016:980-989.
24 HINTON G, VINYALS O, DEAN J .Distilling the knowledge in a neural network[EB/OL].(2018-08-13)[2023-01-15]..
25 ZHANG J S, DU J, YANG Y X,et al .A tree-structured decoder for image-to-markup generation[C]∥Proceedings of the International Conference on Machine Learning (ICML).[S.l.]:PMLR,2020:11076-11085.
26 YUAN Y, LIU X, DIKUBAB W,et al .Syntax-aware network for handwritten mathematical expression recognition[EB/OL].(2022-06-18)[2023-02-01]..
Outlines

/