Google Releases Tesseract OCR Software

Sep 6, 2006 | 2,477 views | by Navneet Kaushal
VN:F [1.9.13_1145]
Rating: 0.0/5 (0 votes cast)

Google has recently re-released the Tesseract OCR software to the open source community. OCR or optical character recognition is a sophisticated technique that helps digitally converting physical text into computer based text. Physical text is passe. With the OCR software you can now store a bulk of your earlier papers in digital formats.

Google has also reported that they are not the original developer of the OCR software. This particular Tesseract OCR software was originally developed at the Hewlett Packard Laboratories during 1985 – 1995. But unfortunately HP got out of the Tesseract OCR software business and the software was unused till Google's recent re-launch of the software.

The Tesseract OCR software supports only one language, i.e. English. The software may not include a page layout analysis module but it's far more accurate than any Open Source OCR package available in the market.

Recommend this story

Navneet Kaushal

About the author:

Navneet Kaushal, CEO PageTraffic is a trusted authority in the search engine marketing industry. He is a featured author at Web Pro News, Search Newz, Website Notes, DevWebPro, SEO Article and Web Help Now among many others.

Related Articles

  • No Related Post

{ 2 comments… read them below or add one }

Cyber and Technology December 11, 2009 at 01:09

Good article, can you add me as a friend?

Reply

Chris webb January 12, 2010 at 11:18

Thanks for sharing this, somehow I think a new breed of OCR usage applications maybe on the horizon.

Reply

Leave a Comment