Searching Deep and Dark: Building A Google for The Less Visible Parts of The Web
Christian Mattmann, 11 Jan 17
       

A geographical map depicting hotbeds of dark web activity related to illegal products. Larger circles indicate more activity. Christian Mattmann, CC BY-SA

In today’s data-rich world, companies, governments and individuals want to analyze anything and everything they can get their hands on – and the World Wide Web has loads of information. At present, the most easily indexed material from the web is text. But as much as 89 to 96 percent of the content on the internet is actually something else – images, video, audio, in all thousands of different kinds of nontextual data types.

Further, the vast majority of online content isn’t available in a form that’s easily indexed by electronic archiving systems like Google’s. Rather, it requires a user to log in, or it is provided dynamically by a program running when a user visits the page. If we’re going to catalog online human knowledge, we need to be sure we can get to and recognize all of it, and that we can do so automatically.

How can we teach computers to recognize, index and search all the different types of material that’s available online? Thanks to federal efforts in the global fight against human trafficking and weapons dealing, my research forms the basis for a new tool that can help with this effort.

Understanding what’s deep

The “deep web” and the “dark web” are often discussed in the context of scary news or films like “Deep Web,” in which young and intelligent criminals are getting away with illicit activities such as drug dealing and human trafficking – or even worse. But what do these terms mean?

Sign in to view full article

       
How To Calculate The Economic Impact Of Grief
The death of a child is one of the most traumatic experiences that a parent can experience. Those who do ...
Gerard Van den Berg
Sat, 14 Jan 17
Your Next Social Network Could Pay You For Posting
You may well have found this article through Facebook. An algorithm programmed by one of the world’s biggest companies now ...
Jelena Dzakula
Wed, 1 Feb 17
Petition Urges Xi Jinping to End Forced Organ Harvesting of Falun Gong Practitioners
NEW YORK—A petition that has garnered nearly 6,000 signatures in just 2 days calls for President Donald Trump to help ...
Bowen Xiao
Mon, 10 Apr 17
Young Workers Expect Their Older Colleagues to Get Out of The Way
There are many names for the narratives pitting the older generation against the younger: Gen-Y versus Baby Boomers, “Generation Me” ...
Michael North
Wed, 15 Mar 17
Singapore’s Ageing Population, a Challenge for Hospitals and Nurses
The increase in hospital admission and ensuing demands on intensive medical care will trigger the need for more hospital beds: ...
Epoch Newsroom
Mon, 2 Jan 17
An Epoch Times Survey
Advertise with Us
An Epoch Times Survey
Read about Forced Organ Harvesting
Sports Elements
Sports Elements