Skip to main content

Optical character recognition (OCR)


OCR (optical character recognition) is the recognition of printed or handwritten text characters by a computer. The basic process of OCR involves examining the text of a document and translating the characters into character codes a computer program can understand.

OCR systems are used to convert physical documents into machine-readable text. Software features can also take advantage of artificial intelligence (AI) to implement more advanced methods of intelligent character recognition (ICR), like identifying languages or styles of handwriting.
The process of OCR is most commonly used to turn hard copy legal or historic documents into PDFs. Once digitized, the document can be interacted with as if it was created with a word processor. This is why OCR is sometimes also referred to as text recognition.

How optical character recognition works

The first step of OCR is to scan the physical document. OCR programs typically target one character, word or block of text at a time. When a character is identified, it is converted into ASCII code.
Characters are typically identified using one of two algorithms:
  • Pattern recognition - OCR programs are fed examples of text in various fonts and formats which are then used to compare, and recognize, characters in the scanned document.
  • Feature detection - OCR programs apply rules regarding the features of a specific letter or number to recognize characters in the scanned document. Features could include the number of angled lines, crossed lines or curves in a character for comparison. For example, the capital letter "A" may be stored as two diagonal lines that meet with a horizontal line across the middle.

Optical character recognition use cases

OCR can be used for a variety of applications, including:
  • Indexing print material for search engines.
  • Deciphering handwritten documents into text that can be read aloud to visually-impaired or blind users.
  • Archiving historic information, such as newspapers, magazines or phonebooks, in searchable formats.
  • Electronically depositing checks.
  • Recognizing text, such as license plates, with a camera or software.
  • Sorting letters for mail delivery.
  • Translating words within an image into a specified language.


Comments

Popular posts from this blog

Black swan

A  black swan event  is an incident that occurs randomly and unexpectedly and has wide-spread ramifications. The event is usually followed with reflection and a flawed rationalization that it was inevitable. The phrase illustrates the frailty of inductive reasoning and the danger of making sweeping generalizations from limited observations. The term came from the idea that if a man saw a thousand swans and they were all white, he might logically conclude that all swans are white. The flaw in his logic is that even when the premises are true, the conclusion can still be false. In other words, just because the man has never seen a black swan, it does not mean they do not exist. As Dutch explorers discovered in 1697, black swans are simply outliers -- rare birds, unknown to Europeans until Willem de Vlamingh and his crew visited Australia. Statistician Nassim Nicholas Taleb uses the phrase black swan as a metaphor for how humans deal with unpredictable events in his 2007...

A Graphics Processing Unit (GPU)

A graphics processing unit (GPU) is a computer chip that performs rapid mathematical calculations, primarily for the purpose of rendering images. A GPU may be found integrated with a central processing unit (CPU) on the same circuit, on a graphics card or in the motherboard of a personal computer or server. In the early days of computing, the CPU performed these calculations. As more graphics-intensive applications such as AutoCAD were developed; however, their demands put strain on the CPU and degraded performance. GPUs came about as a way to offload those tasks from CPUs, freeing up their processing power. NVIDIA, AMD, Intel and ARM are some of the major players in the GPU market. GPU vs. CPU A graphics processing unit is able to render images more quickly than a central processing unit because of its parallel processing architecture, which allows it to perform multiple calculations at the same time. A single CPU does not have this capability, although multi...

6G (sixth-generation wireless)

6G (sixth-generation wireless) is the successor to 5G cellular technology. 6G networks will be able to use higher frequencies than 5G networks and provide substantially higher capacity and much lower latency. One of the goals of the 6G Internet will be to support one micro-second latency communications, representing 1,000 times faster -- or 1/1000th the latency -- than one millisecond throughput. The 6G technology market is expected to facilitate large improvements in the areas of imaging, presence technology and location awareness. Working in conjunction with AI, the computational infrastructure of 6G will be able to autonomously determine the best location for computing to occur; this includes decisions about data storage, processing and sharing.  Advantages of 6G over 5G 6G is expected to support 1 terabyte per second (Tbps) speeds. This level of capacity and latency will be unprecedented and wi...