Program
Tutorial 1:
Data Stream Management Systems - Concepts, Prototypes, and Applications
Vera Goebel and Thomas Plagemann - University of Oslo
Tutorial 2:
Advances in Mass Media Delivery to Mobiles
Rod Walsh, Toni Paila, and Harsh Mehta - Nokia Corp.
Tutorial 1:
Data Stream Management Systems - Concepts, Prototypes, and Applications
Speakers
Vera Goebel and Thomas Plagemann
University of OsloDepartment of Informatics
http://heim.ifi.uio.no/~{goebel,plageman}
Files
[Slides, References]
Abstract
In the last decade, a new class of data-intensive applications, like sensor networks, network traffic analysis, financial tickers, web or telecommunications transaction log analysis, has become widely recognized. These applications require support for on-line analysis of rapidly changing data streams. However, traditional database management systems (DBMSs) have no pre-defined notion of time and cannot handle data on-line (i.e., in main memory without storing the data on disk) in near real-time. During the last five years, Data Stream Management Systems (DSMSs) have been developed to handle transient data streams on-line and to process continuous queries on these data streams. Currently, first prototypes and systems are becoming available.
The fundamental difference between a classical DBMS and a DSMS is the data stream model. Instead of processing a query over a persistent set of data that is stored in advance on disk, queries are performed in DSMSs over a data stream. In a data stream, data elements arrive on-line and stay only for a limited time period in memory. Consequently, the DSMS has to handle the data elements before the buffer is overwritten by new incoming data elements. The order in which the data elements arrive cannot be controlled by the system. Once a data element has been processed it cannot be retrieved again without storing it explicitly. The size of data streams is potentially unbounded and can be thought of as an open-ended relation. In DSMSs, continuous queries evaluate continuously the arriving data elements. Standard operator types that are supported by most existing DSMSs are filtering, mapping, aggregates, and joins. Since continuous streams may not end, intermediate results of continuous queries are often generated over a predefined window and then either stored, updated, or used to generate a new data stream of intermediate results. Window techniques are especially important for aggregation and join queries. Examples for DSMSs include STREAM, GigaScope, and TelegraphCQ.
Since DSMSs are a promising technology, for example for traffic analysis, we have performed during our sabbatical at Institute Eurecom in 2003/2004 an experimental study to analyze the advantages and limitations of using public domain DSMSs for concrete network tomography problems. In a nutshell, the results of this study demonstrated that both on-line analysis with DSMSs and off-line analysis with DBMSs of network traces could be very helpful, but to utilize these tools efficiently, systematic approaches and methodologies have to be developed, and current DSMS prototypes have to be improved, in particular with respect to performance and scalability.
Goal
It is the goal of this tutorial to explain the main concepts of DSMS, give an overview of the state-of-the-art and the existing prototypes and to show applications of DSMS technology. Since we gained practical experience with using the DSMS TelegraphCQ for network traffic analysis, we will go into more depth with examples of this application domain, but we will also give an overview on other application domains for DSMSs. Thus, the participants should be enabled to evaluate whether current or future DSMSs might be helpful for their research.
Preliminary outline
- Introduction
- Concepts
- DBMS vs. DSMS
- Data models
- Query languages
- Sliding windows
- Data reduction algorithms
- Prototypes
- TelegraphCQ
- Stream
- GigaScope
- Applications
- Network traffic analysis
- Sensor networks
- Summary and outlook
Target audience
Researchers that are interested in this new technology, either just to learn about it, or to understand whether and how DSMSs might be used for the current and future research. A basic understanding of the core notions of relational DBMSs will be helpful, but are not mandatory.
Tutorial 2:
Advances in Mass Media Delivery to Mobiles
Speakers
Rod Walsh, Toni Paila, and Harsh Mehta
Nokia Corp.Files
[Slides]
Abstract
Data delivery has been recognised as a critical enabler for mass media applications on mobile terminals. Standards bodies such as the IETF, 3GPP, ETSI-DVB and OMA have set about creating a set of standards for mass media delivery, applications and the related technologies. The first standards based on these technologies are now becoming available. Also, several implementations of these technologies (open and otherwise) exist.
We examine the need for multicast technologies and applications in the mobile domain and present the need for standardized solutions for these technologies.
Starting from the application layer, Internet Media Guides (IMG) and means for content discovery play an important role in multicast applications and user access to these applications. The Internet Media Guides are currently being developed in the IETF. This includes work on the requirements, framework for IMGs, protocol work, the system architecture and the data models and metadata.
FLUTE and RTP are the delivery methods currently endorsed by the IETF (as well as other bodies) for file delivery and streaming respectively. Other delivery methods are also discussed. While traditional multicast technologies assume a two-way end-to-end channel, mass media multicast technologies require that transport techniques can handle the unidirectional environment.
At the physical layer, architecture, use cases and other specialisations for IP Datacast systems have been proposed in DVB (DVB-H). The primary use case of IP Datacst is for applications such as large-scale file delivery or real-time streaming to mobile terminals. 3GPP is also currently working on architecture, use cases and specialisations for mass multimedia based on 3rd generation cellular technologies. Deployments of many of the IP-based building blocks of both the MBMS and DVB IP Datacast systems currently exist.
There is now a strong need to stimulate involvement from a variety of resrach centres in identifying future areas of research and development contributions in the field.
Goal
The goal of this tutorial is to create awareness of the current standardization efforts surrounding the area of Mass Media Delivery for mobile devices. The standardisation fora dealt with in this presentation are the IETF, 3GPP and DVB. The presentation provides an overview of the requirements and existing work in the area. Further, we provide a detailed view on the currently standardised solutions as well as other existing solutions. We also outline the timeline in which the standards are set to be finalised as well as the expected future developments and research in the area.
Preliminary outline
- Mass Media and the need for multicast applications
- Content discovery and Internet Media Guides
- existing work
- requirements
- framework
- protocol work
- system architecture
- introducing data models and metadata
- Common delivery methods
- FLUTE, RTP and the out-of-favour alternatives
- in brief and notes on unidirectional vs bidirectional multicast
- IP Datacast (DVB-H) Systems
- architecture
- deployments
- primary use cases
- specialisations
- MBMS (3GPP Multicast) Systems
- architecture
- deployments
- primary use cases
- specialisations
- Timeline, future expectations and future research space
- expectations and stimulating involvement and contributions to the field
Target audience
Researchers interested in the current status of these technologies and the status of the standards and to use this as a basis for future research.