Keep the Internet free and open
If you agree and want to support a free and open Internet too, I invite you to join us by signing the petition at google.com/takeaction. Please make your voice heard and spread the word.
Thanks to the Internet, trade has never been easier. The ability to trade goods and services online has helped companies large and small to reach a global marketplace. And the web has also enabled another important cross-border transaction: the free flow of information without restriction.
This month, yet another country acknowledged the importance of having a consistent framework for cross-border flows of goods, services, and information. Mauritius is the first African country to sign a joint agreement with the U.S. that supports government transparency, open Internet networks, and cross-border information flows. This agreement has significant implications for Mauritius’ economy. While South Africa hasn’t yet fully embraced the Internet, the sector already contributes up to 2 percent (or $7.1 billion/R59-billion) of the country’s GDP, according to a recent report by World Wide Worx. As the Internet grows, countries that are open to the free flow of goods and information will enable their businesses to trade, negotiate and advertise freely. In the long run, these solid business practices will lead to more exports and more jobs. We encourage more governments and industries to take action so that their citizens have access to the Internet and their businesses are able to sell goods and services across borders, with the help of the Internet.
(Cross-posted on the Official Google Blog)
We find that there are relatively few malicious players, who make multiple attempts to bypass our defenses to defraud users. As we get better and faster at catching these advertisers, they redouble their efforts and create more accounts at an even faster rate.
Even in this ever-escalating arms race, our efforts are working. One method we use to test the success of our efforts is to ask human raters to tell us how we’re doing. These human raters review a set of sites that are advertised on Google. We use a large set of sites in order to get an accurate statistical reading of our efforts. We also weight the sites in our statistical sample based on the number of times a particular site was displayed so that if a particular site is shown more often, it’s more likely to be in our sample set. By using human raters, we can calibrate our automated systems and ensure that we’re improving our efforts over time. In 2011, we reduced the percentage of bad ads by more than 50 percent compared with 2010. That means the proportion of bad ads that are showing on Google was halved in just a year.
Google’s long-term success is based on people trusting our products. We want to make sure that the ads on Google are safe and trustworthy, and we’re not satisfied until we do.
For example, in the case of a site that is selling counterfeit goods, this three-pronged approach aims to look for patterns that would flag such a site and help prevent ads from showing. Ad review notices patterns in the ads and keywords selected by the advertiser. Site review analyzes the entire site to determine if it is selling counterfeit goods. Account review aims to determine if a new advertiser is truly new, or is simply a repeat offender trying to abuse Google’s advertising system. Here’s more detail on how we review each of these three components.
Ad Review
An ad is the snippet of information presented to a user, along with a link to a specific webpage, or landing page. The ads review system inspects individual ads and landing pages, and is probably the system most familiar to advertisers. When an advertiser submits an ad, our system immediately performs a preliminary examination. If there’s nothing in the ad that flags a need for further review, we tell the advertiser the ad is “Eligible” and show the ad only on google.com to users who have SafeSearch turned off. If the ad is flagged for further review, in most cases we refer to the ad as “Under Review” and don’t show the ad at all. From there, the ad enters our automated pipeline, where we employ machine learning models, a rules engine and landing page analysis to perform a more extensive examination. If our automated system determines an outcome with a high degree of confidence, we will either approve the ad to run on Google and all of our partners (“Approved”), approve the ad to show for appropriate users in specific locations (“Approved - Limited”) or reject the ad (“Disapproved”). If our automated system isn’t able to determine the outcome, we send the ad to a real person to make a final decision.
Site Review
A site has many different pages, each of which could be pointed to by different ads, often known as a domain. Our site review system identifies policy issues which apply to the whole site. It aggregates sites across all ads from all advertisers and regularly crawls them, building a repository of information that’s constantly improving as new scams and new sites are examined. We store the content of advertised sites and use both machine learning models and a rules engine to analyze the sites. The magic of the site review system is it understands the structure of language on webpages in order to classify the content of sites. Site review will determine whether or not an entire site should be disabled, which would prevent any ads leading to that site showing from any account. When the automated system isn’t able to determine the outcome with a high degree of confidence, we send it to a real person to make a decision. When a site is disabled, we tell the advertiser that it’s in violation of “Site Policy.”
Account Review
An account is one particular advertiser’s collection of ads, plus the advertiser’s selections for targeting and bidding on those ads. An account may have many ads which may point to several different sites, for example. The account review system constantly evaluates individual advertiser accounts to determine if the whole account should be inspected and shut down for policy violations. This system “listens” to a variety of signals, such as ads and keywords submitted by the advertiser, budget changes, the advertiser’s address and phone number, the advertiser’s IP address, disabled sites connected to this account, and disapproved ads. The system constantly re-evaluates all accounts, incorporating new data. For example, if an advertiser logs in from a new IP address, the account is re-evaluated to determine if that new signal suggests we should take a closer look at the content of the advertiser’s account. If the account review system determines that there is something suspect about a particular account with a high degree of confidence, it automatically suspends the account. If the system isn’t sure, it stops the account from showing any ads at all and asks a real person to decide if the account should be suspended.
Even with all these systems and people working to stop bad ads, there still can be times when an ad slips through that we don’t want. There are many malicious players who are very persistent—they seek to abuse Google’s advertising system in order to take advantage of our users. When we shut down a thousand accounts, they create two thousand more using different patterns. It’s a never-ending game of cat and mouse.
We’ve put a great deal of effort and expense into building these systems because Google’s long-term success is based on the trust of people who use our products. I’ve focused my time and energy in this area for many years. I find it inspiring to fight the good fight, to focus on the user, and do everything we can to help prevent bad ads from running. I’ll continue to post here from time to time with additional thoughts and greater information about how we make ads safer by detecting and removing scam ads.