Abstract: A system of identifying one or more fillable fields of an electronic form may include an electronic device, and a computer-readable storage medium that includes one or more programming instructions. The programming instructions are configured to instruct the electronic device to receive an electronic form, identify fillable field candidates of the electronic form, and determine, for each fillable field candidate, whether the fillable field candidate is a fillable field. The system updates metadata associated with the electronic form by applying a sequencing framework to only the fillable fields by obtaining position information for each fillable field that indicates a position of the fillable field on the document, sorting the fillable fields based on the position information to form a sequence of fillable fields, determining a designator to each fillable field that indicates a position of a corresponding fillable field in the sequence, and storing the designator.
Abstract: A system of associating text items with fillable fields of an electronic fillable form may identify an electronic fillable form comprising one or more fillable fields and one or more text items, and for one or more pairs of fillable fields and text items, determine a confidence value associated with the pair. The system may determine a first configuration for the electronic fillable form, determine an overall confidence value for the first configuration, and identify one or more orphan fields from the determined configuration. The system may determine a second configuration for the electronic form, determines an overall confidence value for the second configuration, and in response to the overall confidence value for the first configuration exceeding the overall confidence value for the second configuration, selects the first configuration, otherwise, selects the second configuration. The system creates an association map for the selected configuration.
Abstract: An image processing system for identifying table captions may include a processor and a non-transitory computer-readable storage medium comprising one or more instructions configured to cause the processor to perform image processing of a scanned document to identify boxes of an electronic fillable form that corresponds to the scanned document. The system creates and stores a data structure that includes a box list. The system searches the box list to identify an empty box, uses an adjacency list determine whether the empty box has adjacent boxes, and in response to determining that the empty box has one or more adjacent boxes, creates a table box list that includes the empty box and one or more other boxes from the table to which the empty box belongs. The system uses the adjacency list to identify a row caption and a column caption associated with the empty box.