On-Device Information Extraction from Screenshots in form of tags

2020 
We propose a method to make mobile Screenshots easily searchable. In this paper, we present the workflow in which we: 1) pre-processed a collection of screenshots, 2) identified script present in image, 3) extracted unstructured text from images, 4) identified language of the extracted text, 5) extracted keywords from the text, 6) identified tags based on image features, 7) expanded tag set by identifying related keywords, 8) inserted image tags with relevant images after ranking and indexed them to make it searchable on device. We made the pipeline which supports multiple languages and executed it on-device, which addressed privacy concerns. We developed novel architectures for components in the pipeline, optimized performance and memory for on-device computation. We observed from experimentation that the solution developed can reduce overall user effort and improve end user experience while searching, whose results are published.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    16
    References
    3
    Citations
    NaN
    KQI
    []