Searching Deep and Dark: Building A Google for The Less Visible Parts of The Web
Christian Mattmann, 11 Jan 17
       

A geographical map depicting hotbeds of dark web activity related to illegal products. Larger circles indicate more activity. Christian Mattmann, CC BY-SA

In today’s data-rich world, companies, governments and individuals want to analyze anything and everything they can get their hands on – and the World Wide Web has loads of information. At present, the most easily indexed material from the web is text. But as much as 89 to 96 percent of the content on the internet is actually something else – images, video, audio, in all thousands of different kinds of nontextual data types.

Further, the vast majority of online content isn’t available in a form that’s easily indexed by electronic archiving systems like Google’s. Rather, it requires a user to log in, or it is provided dynamically by a program running when a user visits the page. If we’re going to catalog online human knowledge, we need to be sure we can get to and recognize all of it, and that we can do so automatically.

How can we teach computers to recognize, index and search all the different types of material that’s available online? Thanks to federal efforts in the global fight against human trafficking and weapons dealing, my research forms the basis for a new tool that can help with this effort.

Understanding what’s deep

The “deep web” and the “dark web” are often discussed in the context of scary news or films like “Deep Web,” in which young and intelligent criminals are getting away with illicit activities such as drug dealing and human trafficking – or even worse. But what do these terms mean?

Sign in to view full article

       
These Three Firms Own Corporate America
A fundamental change is underway in stock market investing, and the spin-off effects are poised to dramatically impact corporate America.
Jan Fichtner, Eelke Heemskerk, Javier Garcia
Tue, 16 May 17
When Things Go Wrong In An Automated World, Would We Still Know What To Do?
We live in a world that is both increasingly complex and automated. So just as we are having to deal ...
Peter Fisher
Mon, 27 Mar 17
Sustainable Shopping: For Eco-Friendly Jeans, Stop Washing Them So Often
Denim jeans – whether ripped, straight, flared, vintage or raw – are one of the world’s most-loved garments. But from ...
Alice Payne, Susannah Kate Devitt
Thu, 1 Jun 17
Searching Deep and Dark: Building A Google for The Less Visible Parts of The Web
In today’s data-rich world, companies, governments and individuals want to analyze anything and everything they can get their hands on ...
Christian Mattmann
Wed, 11 Jan 17
The Dead End of Communism
Communism is estimated to have killed at least 100 million people, yet its crimes have not been compiled and its ...
Epoch Times
Sat, 11 Feb 17
Join us today!
AcuSLIM - Acupuncture Weight Loss Programme
At Epoch Times, We Care :o)
Read about Forced Organ Harvesting
BUCHERER
Sports Elements