Please note: In order to keep Hive up to date and provide users with the best features, we are no longer able to fully support Internet Explorer. The site is still available to you, however some sections of the site may appear broken. We would encourage you to move to a more modern browser like Firefox, Edge or Chrome in order to experience the site fully.

Data Stream Management, PDF eBook

Data Stream Management PDF

Part of the Synthesis Lectures on Data Management series

PDF

Please note: eBooks can only be purchased with a UK issued credit card and all our eBooks (ePub and PDF) are DRM protected.

Description

Many applications process high volumes of streaming data, among them Internet traffic analysis, financial tickers, and transaction log mining.

In general, a data stream is an unbounded data set that is produced incrementally over time, rather than being available in full before its processing begins.

In this lecture, we give an overview of recent research in stream processing, ranging from answering simple queries on high-speed streams to loading real-time data feeds into a streaming warehouse for off-line analysis.

We will discuss two types of systems for end-to-end stream processing: Data Stream Management Systems (DSMSs) and Streaming Data Warehouses (SDWs).

A traditional database management system typically processes a stream of ad-hoc queries over relatively static data.

In contrast, a DSMS evaluates static (long-running) queries on streaming data, making a single pass over the data and using limited working memory.

In the first part of this lecture, we will discuss research problems in DSMSs, such as continuous query languages, non-blocking query operators that continually react to new data, and continuous query optimization.

The second part covers SDWs, which combine the real-time response of a DSMS by loading new data as soon as they arrive with a data warehouse's ability to manage Terabytes of historical data on secondary storage.

Table of Contents: Introduction / Data Stream Management Systems / Streaming Data Warehouses / Conclusions

Information

Other Formats

Information

Also in the Synthesis Lectures on Data Management series