Hierarchical aggregation level

Time Series Hierarchies

When working with time series, a suitable definition of the time series must first be found. In addition to the granularity, for example, this also includes the selection of a suitable hierarchical level at which the data is to be aggregated. The term hierarchical level is illustrated with the following example.

A company has data on the sale of its products. The item description, the quantity sold, the customers and the date of sale are are recorded:

Article descriptionDate of saleQuantityCustomer
Article12020-01-0110Customer1
Article12020-01-015Customer2
Article12020-01-016Customer3
Article12020-01-0210Customer1
Article12020-01-027Customer4
Article12020-01-0320Customer1
Article12020-01-0410Customer1
Article12020-01-0520Customer1
Article22020-01-012Customer5
Article22020-01-022Customer5
Article22020-01-032Customer5
Article22020-01-042Customer5
Article22020-01-052Customer5
............

Various hierarchical levels are available for the definition of article time series. The lowest hierarchical level corresponds to the definition of a time series as sales figures of an item per customer. This definition leads to the creation of time series that differ according to all available characteristics (item description, sales date, quantity, customer).

A higher hierarchical level is the time series of article sales figures where all customers are summed up. Following the example above following the example above:

Article descriptionDate of saleQuantity
Article12020-01-0121
Article12020-01-0217
Article12020-01-0320
Article12020-01-0410
Article12020-01-0520
Article22020-01-012
Article22020-01-022
Article22020-01-032
Article22020-01-042
Article22020-01-052
.........

The "customer" attribute has been lost in this step.

In order to form time series, you need at least one piece of time information and a corresponding value. Thus, the time series of the highest hierarchical level in our example corresponds to the total sales quantity.

Date of saleQuantity
2020-01-0123
2020-01-0219
2020-01-0322
2020-01-0412
2020-01-0522

Choice of hierarchical levels

The higher the hierarchical level to which the data is aggregated, the greater the loss of information. However, there are other criteria that need to be considered and weighed up when choosing a suitable hierarchical level.

Motivation

A key aspect is the overall motivation behind data analysis. The primary goal should be to aggregate the data at a hierarchical level that is relevant for subsequent analyses. If multiple levels are of interest, there are several approaches to consider. If working at the lowest target level, results for higher levels can be generated through aggregation. Conversely, if starting at a higher level, time series data must first be disaggregated to the underlying levels. Finally, analyses and forecasts can also be conducted in parallel across different levels.

Predictability

The choice of a hierarchical level has a significant impact on the characteristics of the associated time series. There is often a strong correlation between the selected hierarchical level and predictability, meaning the expected quality of the forecasts.

At lower levels, such as the article-customer level, individual customer purchase behavior can be highly sporadic, resulting in time series that are similarly noisy and erratic. However, when these time series are aggregated across all customers, the resulting article-level time series may exhibit strong seasonal patterns or trends. These characteristics can be lost when operating at the item-customer level.

Performance

Performance can also be a crucial factor when selecting the aggregation level. The runtime of processing steps is closely tied to the number of time series to be processed. The lower the chosen hierarchical level, the more time series can be generated, and the greater the subsequent processing effort may become.

Uniqueness

The relationship between time series at different hierarchical levels is not always unique. For instance, if aggregation levels such as item-customer and item-customer_location are chosen, there may not always be a clear relationship between the resulting time series (e.g., a customer might have branches in multiple countries). Ambiguous relationships necessitate additional rules for aggregation and disaggregation.

Hierarchical forecasts

Hierarchical forecasts refer to predictions that take into account the distinct characteristics and interdependencies of the individual hierarchical levels.

Disaggregation based on quantity shares can be understood as a hierarchical forecasting method. For example, one might forecast sales quantities at the item level and then disaggregate them to the item-customer level, determining each customer's share based on their sales history.

More complex hierarchical forecasting methods make predictions at all hierarchical levels. Additionally, these methods aim to minimize the forecast error at each level while providing consistent predictions across levels (aggregating from the lower level results in the upper level).

You are about to leave our website via an external link. Please note that the content of the linked page is beyond our control.

Cookies und andere (Dritt-)Dienste

Diese Website speichert Cookies auf Ihrem Computer nur, wenn Sie dem ausdrücklich zustimmen. Bei Zustimmung werden insbesondere auch Dritt-Dienste eingebunden, die zusätzliche Funktionalitäten, wie beispielsweise die Buchung von Terminen, bereitstellen. Diese Cookies und Dienste werden verwendet, um Informationen darüber zu sammeln, wie Sie mit unserer Website interagieren, und um Ihre Browser-Erfahrung zu verbessern und anzupassen. Zudem nutzen wir diese Informationen für Analysen und Messungen zu unseren Besuchern auf dieser Website und anderen Medien. Weitere Informationen zu den von uns verwendeten Cookies und Dritt-Diensten finden Sie in unseren Datenschutzbestimmungen.