Google Acquires reCAPTCHA

Thursday, September 17, 2009 13:27
Posted in category Google, News
Google acquires reCAPTCHA

Google acquires reCAPTCHA

The image above is a CAPTCHA — you can read it, but computers have a harder time interpreting the letters. We tried to make it hard for computers to recognize because we wanted to give humans the scoop first, but we’re happy to announce to everybody now that Google has acquired reCAPTCHA, a company that provides CAPTCHAs to help protect more than 100,000 websites from spam and fraud.

Since computers have trouble reading squiggly words like these, CAPTCHAs are designed to allow humans in but prevent malicious programs from scalping tickets or obtain millions of email accounts for spamming. But there’s a twist — the words in many of the CAPTCHAs provided by reCAPTCHA come from scanned archival newspapers and old books. Computers find it hard to recognize these words because the ink and paper have degraded over time, but by typing them in as a CAPTCHA, crowds teach computers to read the scanned text.

In this way, reCAPTCHA’s unique technology improves the process that converts scanned images into plain text, known as Optical Character Recognition (OCR). This technology also powers large scale text scanning projects like Google Books and Google News Archive Search. Having the text version of documents is important because plain text can be searched, easily rendered on mobile devices and displayed to visually impaired users. So we’ll be applying the technology within Google not only to increase fraud and spam protection for Google products but also to improve our books and newspaper scanning process.

That’s why we’re excited to welcome the reCAPTCHA team to Google, and we’re committed to delivering the same high level of performance that websites using reCAPTCHA have come to expect. Improving the availability and accessibility of all the information on the Internet is really important to us, so we’re looking forward to advancing this technology with the reCAPTCHA team.

Source: googleblog.blogspot.com/2009/09/teaching-computers-to-read-google.html

Share and Enjoy:
  • Print
  • Digg
  • Propeller
  • del.icio.us
  • Facebook
  • Mixx
  • Google Bookmarks
  • Sphinn
  • Live
  • MySpace
  • Netvibes
  • NewsVine
  • StumbleUpon
  • Technorati
  • Twitter
  • Wikio
  • Yahoo! Bookmarks
You can leave a response, or trackback from your own site.

5 Responses to “Google Acquires reCAPTCHA”

  1. Timothy says:

    September 17th, 2009 at 2:57 pm

    :-( … It’s like adding a spy code to 100K sites instantly..

  2. Carman says:

    September 17th, 2009 at 2:58 pm

    Where’s the DoJ and EU anti-trust when you need them? Cannot believe this. First Urchin, now this. Is anything going to remain independent anymore?

  3. Saddy says:

    September 17th, 2009 at 2:58 pm

    I read the other day how (apparently) some software can happly read captchas quite well, and this has been going on for a while now.

  4. Chris says:

    September 17th, 2009 at 2:59 pm

    That makes sense. ReCaptcha’s aim was to use the captchas to improve algos for ocr technology, kind of self defeating, but I guess there was a bigger picture. I imagine it will now be put to good use scanning books.

  5. John says:

    September 17th, 2009 at 3:01 pm

    The best captchas are the ones you create yourself. A simple question will most likely suffice (e.g. What color is the sky?). I don’t think hackers will waste their time hacking one specific algorithm, or system, unless you have a super popular website. Even then, you can simply rotate the questions.

    Another good one is to require a person to enter a specific word that can be found by clicking on a link.

Leave a Reply