US 12,033,413 B2
Method and apparatus for data structuring of text
Dong Hwan Kim, Seoul (KR); You Kyung Kwon, Seoul (KR); So Young Ko, Seoul (KR); Sook Jin Roe, Seoul (KR); Ki Beom Kwon, Gyeonggi-do (KR); and Da Hea Moon, Seoul (KR)
Assigned to 42 Maru Inc., Seoul (KR)
Filed by 42Maru Inc., Seoul (KR)
Filed on Oct. 14, 2021, as Appl. No. 17/502,017.
Claims priority of application No. 10-2021-0135569 (KR), filed on Oct. 13, 2021.
Prior Publication US 2023/0110931 A1, Apr. 13, 2023
Int. Cl. G06K 9/00 (2022.01); G06F 16/953 (2019.01); G06F 40/20 (2020.01); G06V 30/12 (2022.01); G06V 30/19 (2022.01); G06V 30/412 (2022.01); G06V 30/413 (2022.01); G06V 30/414 (2022.01); G06V 30/416 (2022.01)
CPC G06V 30/413 (2022.01) [G06F 16/953 (2019.01); G06F 40/20 (2020.01); G06V 30/12 (2022.01); G06V 30/19093 (2022.01); G06V 30/412 (2022.01); G06V 30/414 (2022.01); G06V 30/416 (2022.01)] 13 Claims
OG exemplary drawing
 
1. An apparatus for data structuring of text, the apparatus comprising:
a processor; and
a memory storing instructions executable by the processor,
wherein the processor is configured to execute the instructions to:
extract text and location information of the text from an image based on an optical character recognition (OCR) technique;
generate a text unit based on the text and the location information;
classify a form of the image based on the text;
label the text unit as first text, second text, and third text respectively corresponding to an item name, an item value, and others based on the classified form of the image;
structure the text by mapping the second text corresponding to the item value and the first text corresponding to the item name; and
determine misrecognition of the first text and correct the first text determined to be misrecognized.