Measuring Porn on the Web?

Fri Nov 17, 2006 6:35AM EST

See Comments (3)

Measuring anything on the web is part art and part science. Measuring how much pornography is on the web taxes the art of both. In part, that's because pornography is, at best, loosely defined and subject to local standards. In part, it's because some part of the pornography that's found on the web is not found in the places you might normally measure.

And yet, a U.S. government-commissioned study just identified the percentage of pornography on the web. The study says that about 1 percent of the web contains sexually explicit content.

What is 1 percent of the web? Tough to say. If you do a search on the letter a, the most common letter in the English language, you get 18.5 billion pages. Search on the word porn and you get 88 million pages. Search on porn or xxx and you get 186 million. Roughly 1 percent. If you add the word sex and search for porn or xxx or sex you get something more like 4.4 percent. The first probably undercounts and the second probably overcounts; the truth is probably in the middle. Not a trivial number of pages, but sexually explicit material probably comprises only a small percentage of the entire web.

But that's not the only problem with the study. I can't help but think that if 1 percent of indexed web pages contains sexually explicit content, it's either a very "in your face" 1 percent or that the study might have missed major sources of explicit material.

What sorts of "others" should they factor in? Ads, for one. As I was researching this column for news stories, the ad search results on Google displayed an ever-changing array of ads for pornographic material. Peer-to-peer networks, for another. Sexually explicit images are readily available on many of these networks. Social networking sites like MySpace pages, which also wouldn't show up in an indexed search, might also contain sexually explicit images. A recent survey done at Fresno State that looked at 700 MySpace pages found that 59 percent of the individual pages included risqué/sexual poses, 9 percent included links to pornographic sites, and 6 percent had full frontal nudity of females. 

There are plenty of other reasons for materials not to be indexed as pages: eBay-like sites that compile a set of products for you and are indexed differently. Finally, sites can simply ask that their pages not be indexed by a search engine. I'm sure that others can add to my list.

The study was introduced in court this week as evidence in a complicated ongoing case where the Justice Department is hoping to revive the 1998 Child Online Protection Act. The Act would have required commercial web sites to collect a credit card number or other proof of age before allowing Internet users to view material deemed "harmful to minors." The law was blocked by the Supreme Court in 2004, when it ruled that free speech rights of adults would be hurt and that technologies like filtering software might work better than any legislation.

While the study offers a data point, it needs to be taken within the larger context of how people really see what they see on the Internet. I don't know about you, but if 99 percent of the web is free of sexually explicit material I must be hanging out in the wrong percent.

Top 5 Posts

Comments on Measuring Porn on the Web?

Post a Comment

Join in the discussion. Here you'll see the comments in the order they were posted.

  • 1 Posted by buckthebigman on Thu Sep 3, 2009 3:15PM EDT Report Abuse

    Very interesting, but one correction. "E" is the most commonly used letter in the English language, "A" is actually third, behind "T". A simple check of any letter frequency table will confirm this. I found the article both useful and informative. good reading!

  • 2 Posted by ytech_robinraskin on Thu Sep 3, 2009 10:58PM EDT Report Abuse

    checking to see if that changes things, thanks. Ever since Google stopped reporting "The McDonald's ay" (I think they dropped off somewhere around 8.5 million pages served) I've been looking for a way to count.

  • 3 Posted by maxwell_burke on Thu Sep 3, 2009 7:12PM EDT Report Abuse

    Here's a slightly different take on the same study: http://www.networkworld.com/community/?q=node/9224

More Posts: 1

Post a Comment


My Tech

Please enable your browser's cookies to activate the My Tech column.

Also on Yahoo! Tech

Computers Home Office Wi-Fi & Networking Phones & PDAs Cameras & Camcorders TV & Home Theater Portable Audio
 

Question and Answer content at Yahoo! Tech is written by Yahoo! users at Yahoo! Answers. Yahoo! does not evaluate or guarantee the accuracy of any Yahoo! Answers content. For more information, read the Full Disclaimer.

Opinions expressed by the Advisors are their own and do not necessarily reflect the views of Yahoo! Inc. Yahoo! receives no compensation from any manufacturer or distributor nor does it compensate any Advisor for the coverage of any product or service in any Advisor's content.