OCR
OCR stands for Optical Character Recognition. It is the electronic identification and digital encoding of printed or handwritten characters by means of an optical scanner and specialized software.
Extracted Text
Text that already exists in the native files such as MS Word, MS Excel, HTML files and PDFs.