Dong Wang is an Assistant Professor at the Department of Computer Science and Engineering, the University of Notre Dame. He received his Ph.D. in Computer Science from University of Illinois at Urbana Champaign (UIUC) in 2012, an M.S. degree from Peking University in 2007 and a B.Eng. from the University of Electronic Science and Technology of China in 2004, respectively. Dong Wang has published over 30 technical papers in conferences and journals, including IPSN, ICDCS, IEEE JSAC, IEEE J-STSP, and ACM ToSN. His research on social sensing resulted in software tools that found applications in academia, industry, and government research labs. His work was widely reported in talks, keynotes, panels, and tutorials, including at IBM Research, ARL, CPSWeek, RTSS, IPSN, and the University of Michigan, to name a few. Wang's interests lie in developing analytic foundations for reliable information distillation systems, as well as the foundations of data credibility analysis, in the face of noise and conflicting observations, where evidence is collected by both humans and machines.
Increasingly, human beings are sensors engaging directly with the mobile Internet. Individuals can now share real-time experiences at an unprecedented scale. Social Sensing: Building Reliable Systems on Unreliable Data looks at recent advances in the emerging field of social sensing, emphasizing the key problem faced by application designers: how to extract reliable information from data collected from largely unknown and possibly unreliable sources. The book explains how a myriad of societal applications can be derived from this massive amount of data collected and shared by average individuals. The title offers theoretical foundations to support emerging data-driven cyber-physical applications and touches on key issues such as privacy. The authors present solutions based on recent research and novel ideas that leverage techniques from cyber-physical systems, sensor networks, machine learning, data mining, and information fusion. - Offers a unique interdisciplinary perspective bridging social networks, big data, cyber-physical systems, and reliability- Presents novel theoretical foundations for assured social sensing and modeling humans as sensors- Includes case studies and application examples based on real data sets- Supplemental material includes sample datasets and fact-finding software that implements the main algorithms described in the book
A new information age
Abstract
Social sensing broadly refers to a set of sensing and data collection paradigms where data are collected from humans or devices on their behalf. In this chapter, we first give an overview of social sensing as an emerging research field and identify the data reliability problem as a fundamental research challenge in this field. This challenge, if successfully addressed, engenders a paradigm shift in social sensing by allowing development of dependable applications with guaranteed correctness properties that rely on the collective observations of untrained, average, and largely unreliable sources. Followed by the overview, we go over the motivations of social sensing applications and discuss several key challenges and state-of-the-art techniques centered on the data reliability problem. In the end of the chapter, we review the organization of the whole book chapter by chapter.
Keywords
Social sensing
New information age
Introduction
Motivation
Challenges
State-of-the-art
Twenty years ago, your best bet to find information was to go to a library, search through a pile of index cards, find your way through rows of shelves, and borrow a book. Ten years ago, your best bet to let your friends know of your latest endeavors was to call them. These days are long gone.
One of the most remarkable advances in the last decade was the advent of an age of real-time broadcast. An author of this book remembers a recent ride in a hot air balloon that he happened to share with a newly-wed couple. They were taking a lot of pictures. When the ride was over, they were met with friends. The first question they asked their friends was: “Have you seen our air pictures on Facebook?.” Indeed, information about the experience had already preceded them to the ground.
The age of modern broadcast is enabled by a confluence of three technological advances. First, we now have an unprecedented connectivity on the move, apparently, even while in a hot air balloon. Second, we have an increasingly rich set of options for broadcasting information. Twitter, Facebook, YouTube, Instagram, Flickr, and Foursquare are just a few examples. Finally, we live in a world of information acquisition (i.e., sensing) devices that we use on daily basis. Cameras, GPS devices, fitbits, smart watches, and Internet-connected cars are generating significant amounts of data that we may or may not choose to share.
A direct consequence of the age of broadcast is information overload. In point-to-point communication (such as a phone call), a minute consumed by the initiator corresponds to a minute consumed by the responder. Hence, a balance exists between the collective capacity consumed at sources and the collective capacity consumed at sinks. In contrast, on broadcast channels, for every minute consumed at the broadcast source, hours may be collectively consumed by the community of receivers. For example, if a 1-minute broadcast message is read by 1000 recipients, then 1000 minutes are collectively consumed at the receivers for the one minute spent at the source. This is fine when the number of sources is a lot smaller than the number of receivers (e.g., think of the number of radio stations compared to the number of listeners). However, in the current age of democratized real-time broadcast, everyone can be a source. A survey in 2014, suggested that more than 1 Billion users were on Twitter, more than 1.3 Billion on Facebook, and more than 1.5 Billion on Google+. On Twitter, your message is visible to all. The underlying paradigm is one of global broadcast. Given that everyone can be a broadcast source, the balance between production and consumption is disrupted. The proliferation of sensing devices further exacerbates the imbalance. Consequently, a gap widens between the capacity of sources to generate information and the capacity of sinks to consume it. This widening gap heralds a new era of services whose main function is to summarize large amounts of broadcast data into a smaller amount of actionable information for receivers.
A key class of summarization services in the new age of real-time broadcast are services that attain situation awareness. Much of the information uploaded on social media constitutes acts of sensing. In other words, sources report observations they made regarding their physical environment and found worthwhile to comment on. We call this phenomenon, social sensing. In their raw form, however, these observations are not very useful and generally lack reliability. There are conflicting sentiments, conflicting claims, missing data, purposeful pieces of misinformation, and other noise.
Table 1.1 shows examples of tweets collected from Syria on Twitter in August 2013, in the week following the sudden deaths of thousands of citizens in Ghouta; a suburb near the capital, Damascus. It can be seen that many different claims are posted, some of which are true but others are pure conjecture, rumors, or even intentionally planted misinformation. Hence, the use of social sensing data, such as tweets on Twitter, for enhancing situation awareness must be done with care. A key problem is to extract reliable information from large amounts of generally less reliable social sensing data. Recently, significant advances were made on this topic.
Table 1.1
A Twitter Example
Medecins Sans Frontieres says it treated about 3,600 patients with ‘neurotoxic symptoms’ in Syria, of whom 355 died http://t.co/eHWY77jdS0
Weapons expert says #Syria footage of alleged chemical attack “difficult to fake” http://t.co/zfDMujaCTV
U.N. experts in Syria to visit site of poison gas attack http://t.co/jol8OlFxnf via @reuters #PJNET
Syria Gas Attack: ‘My Eyes Were On Fire’ http://t.co/z76MiHj0Em
Saudis offer Russia secret oil deal if it drops Syria via Telegraph http://t.co/iOutxSiaRs
Long-term nerve damage feared after Syria chemical attack http://t.co/8vw7BiOxQR
Syrian official blames rebels for deadly attack http://t.co/76ncmy4eqb
Assad regime responsible for Syrian chemical attack, says UK government http://t.co/pMZ5z7CsNZ
Syrian Chemical Weapons Attack Carried Out by Rebels, Says UN (UPDATE) http://t.co/lN4CkUePUj #Syria http://t.co/tTorVFUfZF
US forces move closer to Syria as options weighed: WASHINGTON (AP) – U.S. naval forces are moving closer to Sy…http://t.co/F6UAAXLa2M
Putin Orders Massive Strike Against Saudi Arabia If West Attacks Syria http://t.co/SFLJ9ghwbt
400 tonnes of arms sent into #Syria through Turkey to boost Syria rebels after CW attack in Damascus – > http://t.co/KLwESYChCc
UN Syria team departs hotel as Assad denies attack http://t.co/O3SqPoiq0x
Vehicle of UN #Syria #ChemicalWeapons team hit by sniper fire. Team replacing vehicle & then returning to area.
International weapons experts leave Syria, U.S. prepares attack. More http://t.co/4Z62RhQKOE
Military strike on Syria would cause retaliatory attack on Israel, Iran declares http://t.co/M950o5VcgW
Asia markets fall on Syria concerns: Asian stocks fall, extending a global market sell-off sparked by growing …http://t.co/06A9h2xCnJ
Syria Warns of False Flag Chemical Attack!
UK Prime Minister Cameron loses Syria war vote (from AP) http://t.co/UlFF1wY9gx
We formally define social sensing as the act of collection of observations about the physical environment from humans or devices acting on their behalf. Assessing data reliability is a fundamental research challenge in this field. This challenge, if successfully addressed, may engender a paradigm shift in situation awareness by allowing development of dependable applications with guaranteed correctness properties that rely on the collective observations of untrained, average, and largely unreliable sources. In this chapter, we introduce this problem, go over the underlying technical enablers and motivations of social sensing applications, and discuss several key challenges and state-of-the-art solutions. We then review the organization of the book, chapter by chapter, to offer a reading guide for the remainder of the book.
1.1 Overview
The idea of leveraging the collective wisdom of the crowd has been around for some time [1, 2]. Today, massive amounts of data are continually being collected and shared (e.g., on social networks) by average individuals that may be used for a myriad of societal applications, from a global neighborhood watch to reducing transportation delays and improving the efficacy of disaster response. Little is analytically known about data validity in this new sensing paradigm, where sources are noisy, unreliable, erroneous, and largely unknown. This motivates a closer look into recent advances in social sensing with an emphasis on the key problem faced by application designers; namely, how to extract reliable information from data collected from largely unknown and possibly unreliable sources? Novel solutions that leverage techniques from machine learning, information fusion, and data mining recently offer significant progress on this problem and are described in this book.
In situations, where the reliability of sources is known, it is easy to compute the probability of correctness of different observations. Among other alternatives, one can use, say, Bayesian analysis to fuse data from...
Erscheint lt. Verlag | 17.4.2015 |
---|---|
Sprache | englisch |
Themenwelt | Informatik ► Datenbanken ► Data Warehouse / Data Mining |
Informatik ► Theorie / Studium ► Künstliche Intelligenz / Robotik | |
Mathematik / Informatik ► Informatik ► Web / Internet | |
ISBN-10 | 0-12-801131-9 / 0128011319 |
ISBN-13 | 978-0-12-801131-7 / 9780128011317 |
Haben Sie eine Frage zum Produkt? |
Größe: 14,2 MB
Kopierschutz: Adobe-DRM
Adobe-DRM ist ein Kopierschutz, der das eBook vor Mißbrauch schützen soll. Dabei wird das eBook bereits beim Download auf Ihre persönliche Adobe-ID autorisiert. Lesen können Sie das eBook dann nur auf den Geräten, welche ebenfalls auf Ihre Adobe-ID registriert sind.
Details zum Adobe-DRM
Dateiformat: PDF (Portable Document Format)
Mit einem festen Seitenlayout eignet sich die PDF besonders für Fachbücher mit Spalten, Tabellen und Abbildungen. Eine PDF kann auf fast allen Geräten angezeigt werden, ist aber für kleine Displays (Smartphone, eReader) nur eingeschränkt geeignet.
Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen eine
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen eine
Geräteliste und zusätzliche Hinweise
Zusätzliches Feature: Online Lesen
Dieses eBook können Sie zusätzlich zum Download auch online im Webbrowser lesen.
Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.
Größe: 17,1 MB
Kopierschutz: Adobe-DRM
Adobe-DRM ist ein Kopierschutz, der das eBook vor Mißbrauch schützen soll. Dabei wird das eBook bereits beim Download auf Ihre persönliche Adobe-ID autorisiert. Lesen können Sie das eBook dann nur auf den Geräten, welche ebenfalls auf Ihre Adobe-ID registriert sind.
Details zum Adobe-DRM
Dateiformat: EPUB (Electronic Publication)
EPUB ist ein offener Standard für eBooks und eignet sich besonders zur Darstellung von Belletristik und Sachbüchern. Der Fließtext wird dynamisch an die Display- und Schriftgröße angepasst. Auch für mobile Lesegeräte ist EPUB daher gut geeignet.
Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen eine
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen eine
Geräteliste und zusätzliche Hinweise
Zusätzliches Feature: Online Lesen
Dieses eBook können Sie zusätzlich zum Download auch online im Webbrowser lesen.
Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.
aus dem Bereich