1. Huawei FusionInsight HD Documentation: The official product documentation confirms that the Streaming service is based on Apache Storm and is used for real-time stream data processing.
Source: Huawei FusionInsight HD 6.5.1 Product Description
Section: 2.3.5 Streaming, "Streaming is a distributed, high-reliability, and real-time stream data processing component developed based on Apache Storm." This section explicitly defines its purpose for real-time applications, contradicting the "offline" claim.
2. Official Apache Storm Project Documentation: The primary source for Apache Storm defines it as a real-time system.
Source: Apache Storm Official Website (storm.apache.org)
Reference: The homepage introduction states, "Apache Storm is a free and open source distributed realtime computation system. Apache Storm makes it easy to reliably process unbounded streams of data, doing for realtime processing what Hadoop did for batch processing." This directly contrasts Storm (real-time) with batch (offline) systems.
3. Peer-Reviewed Academic Publication: Academic literature consistently categorizes Storm as a stream processing engine.
Source: Toshniwal, A., et al. (2014). Storm @Twitter. Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data.
DOI: https://doi.org/10.1145/2588555.2595641
Reference: Section 1 (Introduction), "Storm is a distributed and fault-tolerant real-time computation system... Storm is designed for use cases that require real-time analysis of streams of data." The paper consistently refers to Storm's real-time capabilities, which are distinct from offline batch processing.
4. University Courseware: Reputable academic courses on big data systems distinguish between stream (real-time) and batch (offline) processing frameworks.
Source: University of California, Berkeley, CS 186/286 Introduction to Database Systems.
Reference: Lecture notes on "Data Models and Query Languages for Big Data" typically categorize systems like Hadoop MapReduce as "Batch Processing" and systems like Apache Storm and Spark Streaming as "Stream Processing," highlighting the fundamental architectural and use-case differences between offline and real-time frameworks.