US 12,112,563 B2
Method of detecting, segmenting and extracting salient regions in documents using attention tracking sensors
Naftali Y Cohen, New York, NY (US); Sameena Shah, Scarsdale, NY (US); Natraj Raman, London (GB); Zhen Zeng, Ypsilanti, MI (US); Salwa Husam Alamir, Bournemouth (GB); Daniel Borrajo, Pozuelo de Alarcon (ES); and Alec Louis Clemente Candidato, Brooklyn, NY (US)
Assigned to JPMORGAN CHASE BANK, N.A., New York, NY (US)
Filed by JPMorgan Chase Bank, N.A., New York, NY (US)
Filed on Aug. 9, 2021, as Appl. No. 17/444,721.
Prior Publication US 2023/0042930 A1, Feb. 9, 2023
Int. Cl. G06V 30/413 (2022.01); G06F 3/01 (2006.01); G06F 40/289 (2020.01); G06V 10/46 (2022.01); G06V 30/18 (2022.01); G06V 30/412 (2022.01); G06V 30/414 (2022.01)
CPC G06V 30/413 (2022.01) [G06F 3/013 (2013.01); G06F 40/289 (2020.01); G06V 10/462 (2022.01); G06V 30/18143 (2022.01); G06V 30/412 (2022.01); G06V 30/414 (2022.01)] 18 Claims
OG exemplary drawing
 
1. A method for detecting, segmenting, and extracting salient regions in documents by using attention tracking sensors, the method being implemented by at least one processor, the method comprising:
receiving, by the at least one processor, an image that corresponds to a document;
receiving, by the at least one processor from a sensor, a sequence of measurements that correspond to a human reading of the document, wherein the sensor includes an eye-tracking sensor configured to detect a sequence of eye-gaze positions on the document as a function of time;
determining, by the at least one processor based on the received sequence of measurements, an identification of at least one of a title, a section header, a graph, and a table, included in the document;
outputting, by the at least one processor, the identification of the at least one of the title, the section header, the graph, and the table;
determining, by the at least one processor based on the received sequence of measurements, at least one region of the document as being a salient document region;
demarcating, by the at least one processor, the salient document region in an electronically displayable manner; and
outputting, by the at least one processor, a file that includes a displayable version of the document with the demarcated salient document region.