Rub shoulders with leading experts and industry disruptors at TNW Conference →

The heart of tech

This article was published on March 1, 2011


Google Docs OCR now available in 34 languages

Google Docs OCR now available in 34 languages
Francis Tan
Story by

Francis Tan

Francis Tan is the Asia editor of TNW, who is based in the Philippines. He is particularly interested in Asian Internet startups, social me Francis Tan is the Asia editor of TNW, who is based in the Philippines. He is particularly interested in Asian Internet startups, social media and e-commerce. Get in touch with him via Twitter @francistan or Email [email protected].

Google Docs introduced its Optical Character Recognition (OCR) feature last year, which allows users to upload images and PDF files that are automatically converted to editable text. Google just recently announced that the feature is being extended to 29 additional character sets, including Asian and European languages–a grand total of 34 languages.


You can check out the full list in the upload page. According to the Google blog, this update will result in an improvement in OCR quality for languages that were previously supported, as well as improvements to the way it imports formatting, fonts, and alignment from your documents. Naturally, users can get best results with sharp, high-resolution images or PDF files.

The company said it will keep adding languages and continue to improve speed and accuracy.

Also tagged with